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(54) Title: 186 HUMAN SECRETED PROTEINS 



(57) Abstract 



The present invention relates to 186 novel human secreted proteins and isolated nucleic acids containing the coding regions of the 
genes encoding such proteins. Also provided are vectors, host cells, antibodies, and recombinant methods for producing human secreted 
proteins. The invention further relates to diagnostic and therapeutic methods useful for diagnosing and treating disorders related to these 
novel human secreted proteins. 



60/043,314 11 april 1997(11.04.97) US 

60/043,569 1 1 aprU 1997 (1 1.04.97) US 

60/043,311 11 aprin997 (11.04.97) US 

60/043,671 1 1 april 1997 (1 1 .04.97) US 

60/043,674 1 1 april 1997 (11.04.97) US 

60/043.669 11 april 1997(11.04.97) US 

60/043,312 11 april 1997(11.04.97) US 

60/043,313 11 april 1997(11.04.97) US 

60/043.672 1 1 april 1997 (1 1.04.97) US 

60/043,578 U april 1997 (11.04.97) US 

60/043,576 11 aprill 997 (11.04.97) US 

60/043.670 II april 1997 (11.04.97) US 

60«)47,600 23 May 1997(23.05.97) US 

60/047,615 23 May 1997 (23.05.97) US 

60/047,597 23 May 1997 (23.05.97) US 

60/047.502 23 May 1997 (23.05.97) US 

60rt)47,633 23 May 1997 (23.05.97) US 

60/047,583 23 May 1997 (23.05.97) US 

60/047,617 23 May 1997 (23.05.97) US 

60/047.618 23 May 1997 (23.05.97) US 

60/047.503 23 May 1997 (23.05.97) US 

60/047,592 23 May 1997 (23.05.97) US 

60/047,581 23 May 1997 (23.05.97) US 

60/047,584 23 May 1997 (23.05.97) US 

60/047,500 23 May 1997(23.05.97) US 

60/047.587 23 May 1997 (23.05.97) US 

60/047,492 23 May 1997 (23.05.97) US 
60/047,598 23 May 1997 (23.05.97) US 
60/047.613 23 May 1997 (23.05.97) US 
60/047,582 23 May 1997 (23.05.97) US 



60/047,596 23 May 1997 (23.05.97) US 
60/047.612 23Mayl997a3.05.97) US 
60/047,632 23 May 1997 (23.05.97) US 
60/047.601 23 May 1997 (23.05.97) US 
60/047.595 23 May 1997 (23.05.97) US 
60/047.599 23 May 1997 (23.05.97) US 
60/047,588 23 May 1997 (23.05.97) US 
60/047.585 23 May 1997 (23.05.97) US 
60/047,586 23 May 1997 (23.05.97) US 
60/047.590 23 May 1997(23.05.97) US 
60/047,594 23 May 1997 (23.05.97) US 
60/047,589 23 May 1997 (23.05.97) US 
60/047.593 23 May 1997 (23.05.97) US 
60/047,614 23 May 1997(23.05.97) US 
60/047,501 23 May 1997 (23.05.97) US 
60/048.974 06 June 1997 (06.06.97) US 
60/048,964 06 June 1997 (06.06.97) US 
60/049.610 13 June 1997 (13.06.97) US 
60/051,926 08 July 1997 (08.07.97) US 
60/052,874 16 July 1997 (16.07.97) US 
60/055.724 18 August 1997 (1 8.08.97) US 
60/056,886 22 August 1997 (22.08.97) US 
60/056.877 22 August 1997(22.08.97) US 
60/056,889 22 August 1997(22.08.97) US 
60/056.893 22 August 1997 (22.08.97) US 
60/056,630 22 August 1997 (22.08.97) US 
'60/056,878 22 August 1997(22.08.97) US 
60/056.662 22 August 1997 (22.08.97) US 
60/056,872 22 August 1997 (2Z08.97) US 
60/056.882 22 August 1997 (22.08.97) US 



60/056,637 22 August 1997 (22.08.97) US 

60/056.903 22 August 1997 (22.08.97) US 

60/056.888 22 August 1997 (22.08.97) US 

60/056,879 22 August 1997 (22.08.97) US 

60/056,880 22 August 1997 (22.08.97) US 

60/056,894 22 August 1997(22.08.97) US 

60/056.911 22 August 1997 (22.08.97) US 

60/056.636 22 August 1997 (22.08.97) US 

60/056.874 22 August 1997 (22.08.97) US 

60/056,910 22 August 1997 (22.08.97) US 

60/056,864 22 August 1997 (22.08.97) - US 

60/056,631 22 August 1997 (22.08.97) US 

60/056,845 22 August 1997 (22.08.97) US 

60/056,892 22 August 1997 (22.08.97) US 

60/056,632 22 August 1997 (22.08.97) US 

60/056,664 22 August 1997 (22.08.97) US 

60/056.876 22 August 1997 (22.08.97) US 

60/056.881 22 August 1997 (22.08.97) US 

60/056,909 22 August 1997 (22.08.97) US 

60/056.875 22 August 1997 (22.08.97) US 

60/056,862 22 August 1997 (22.08.97) US 

60/056.887 22 August 1997 (22.08.97) US 

60/056,908 22 August 1997 (22.08.97) US 

60/056,884 22 Aujgust 1997 (22.08.97) US 
60/057,761 05 September 1997 (05 09.97).US 
60/057,650 05 September 1997 (05 09.97).US 
60/057,669 05 September 1997 (05 09.97).US 
60/058,785 12 September 1997 (12.09,97) US 

60/061.060 02 October 1997 (02.10.97) US 



FOR THE PURPOSES OF INFORMATION ONLY 
Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the ?CT. 



XL 


Albania 


ES 


Spain 


LS 


l^otho 


SI 


Slovenia 


AM 


Armenia 


Fl 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


France 


LU 


Luxembourg 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


sz 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


MD 


Republic of Moldova 


TG . 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Faso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HU 


Hungary 


ML 


Mali 


TV 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belartts 


IS 


Iceland 


MW 


Malawi 


US 


United States of Am«icB 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


uz 


Uzbekistan 


CF 


Central African Republic 


JP 


Japan 


NE 


Niger 


VN 


Viet Nam 


CC 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


zw 


Zimbabwe 


CI 


C6tc d'lvoiie 


KP 


Democratic People*s 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


PT 


Portugal 






CU 


Cuba 


KZ 


Kazakstan 


RO 


Romania 






CZ 


Czech Republic 


LC 


Saint Lucia 


RU 


Russian Federation 






DE 


Germany 


LI 


Liechtenstein 


SD 


Sudan 






DK 


Denmark 


LK 


Sri Lanka 


SE 


Sweden 






EE 


Estonia 


LR 


Liberia 


SG 


Singapore 







wo 98/39448 



1 



PCTAJS98/04493 



186 Human Secreted Proteins 

Field of the Invention 
This invention relates to newly identified polynucleotides and the polypeptides 
encoded by these polynucleotides, uses of such polynucleotides and polypeptides, and 
5 their production. 

Background of the Invention 

Unlike bacterium, which exist as a single compartment surrounded by a 
membrane, human cells and other eucaryotes are subdivided by membranes into many 
functionally distinct compartments. Each membrane-bounded compartment, or 

10 organelle, contains different proteins essential for the function of the organelle. The cell 
uses "sorting signals," which are amino acid motifs located within the protein, to target 
proteins to particular cellular organelles. 

One type of sorting signal, called a signal sequence, a signal peptide, or a leader 
sequence, directs a class of proteins to an organelle called the endoplasmic reticulum 

1 5 (ER). The ER separates the membrane-bounded proteins from all other types of 

proteins. Once localized to the ER, both groups of proteins can be further directed to 
another organelle called the Golgi apparatus. Here, the Golgi distributes the proteins to 
vesicles, including secretory vesicles, the cell membrane, lysosomes, and the other 
organelles. 

20 Proteins targeted to the ER by a signal sequence can be released into the 

extracellular space as a secreted protein. For example, vesicles containing secreted 
proteins can fuse with the cell membrane and release their contents into the extracellular 
space - a process called exocytosis. Exocytosis can occur constitutively or after receipt 
of a triggering signal. In the latter case, the proteins are stored in secretory vesicles (or 

25 secretory granules) until exocytosis is triggered. Similarly, proteins residing on the cell 
membrane can also be secreted into the extracellular space by proteolytic cleavage of a 
"linker" holding the protein to the membrane. 

Despite the great progress made in recent years, only a small number of genes 
encoding human secreted proteins have been identified. These secreted proteins include 

30 the conunercially valuable human insulin, interferon, Factor VID, human growth 
hormone, tissue plasminogen activator, and erythropoeitin. Thus, in light of the 
pervasive role of secreted proteins in human physiology, a need exists for identifying 
and characterizing novel human secreted proteins and the genes that encode them. This 
knowledge will allow one to detect, to treat, and to prevent medical disorders by using 

35 secreted proteins or the genes that encode them. 



I 
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Summary of the Invention 

The present invention relates to novel polynucleotides and the encoded 
polypeptides. Moreover, the present invention relates to vectors, host cells, antibodies, 
5 and recombinant methods for producing the polypeptides and polynucleotides. Also 
provided are diagnostic methods for detecting disorders related to the polypeptides, and 
therapeutic methods for treating such disorders. The invention further relates to 
screening methods for identifying binding partners of the polypeptides. 

10 Detailed Description 

The following definitions are provided to facilitate understanding of certain 
terms used throughout this specification. 

In the present invention, "isolated" refers to material removed from its original 

15 environment (e.g., the natural environment if it is naturally occurring), and thus is 
altered "by the hand of man" from its natural state. For example, an isolated 
polynucleotide could be part of a vector or a composition of mauer, or could be 
contained within a cell, and still be "isolated" because that vector, composition of 
matter, or particular cell is not the original environment of the polynucleotide. 

20 In the present invention, a "secreted" protein refers to those proteins capable of 

being directed to the ER, secretory vesicles, or the extracellular space as a result of a 
signal sequence, as well as those proteins released into the extracellular space without 
necessarily containing a signal sequence. If the secreted protein is released into the 
extracellular space, the secreted protein can undergo extracellular processing to produce 

25 a "mature" protein. Release into the extracellular space can occur by many 
mechanisms, including exocytosis and proteolytic cleavage. 

As used herein , a "polynucleotide" refers to a molecule having a nucleic acid 
sequence contained in SEQ ID NO:X or the cDNA contained within the clone deposited 
with the ATCC. For example, the polynucleotide can contain the nucleotide sequence 

30 of the full length cDNA sequence, including the 5* and 3' untranslated sequences, the 
coding region, with or without the signal sequence, the secreted protein coding region, 
as well as fragments, epitopes, domains, and variants of the nucleic acid sequence. 
Moreover, as used herein, a "polypeptide" refers to a molecule having the translated 
amino acid sequence generated from the polynucleotide as broadly defined. 

35 In the present invention, the full length sequence identified as SEQ ID NO:X 

was often generated by overlapping sequences contained in multiple clones (contig 
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analysis). A representative clone containing all or most of the sequence for SEQ ID 
NO:X was deposited with the American Type Culture Collection (" ATCC"). As 
shown in Table 1, each clone is identified by a cDNA Clone ID (Identifier) and the 
ATCC Deposit Number. The ATCC is located at 12301 Park Lawn Drive, Rockville, 
5 Maryland 20852, USA. The ATCC deposit was made pursuant to the terms of the 
Budapest Treaty on the international recognition of the deposit of microorganisms for 
purposes of patent procedure. 

A "polynucleotide" of the present invention also includes those polynucleotides 
capable of hybridizing, under stringent hybridization conditions, to sequences contained 
10 in SEQ ID NO:X, the complement thereof, or the cDNA contained within the clone 
deposited with the ATCC. "Stringent hybridization conditions" refers to an overnight 

incubation at 42° C in a solution comprising 50% formamide, 5x SSC (750 mM NaCl, 

75 mM sodium citrate), 50 mM sodium phosphate (pH 7.6), 5x Denhardt's solution, 
10% dextran sulfate, and 20 |ig/ml denatured, sheared salmon sperm DNA, followed 

1 5 by washing the filters in 0. Ix SSC at about 65''C. 

Also contemplated are nucleic acid molecules that hybridize to the 
polynucleotides of the present invention at lower stringency hybridization conditions. 
Changes in the stringency of hybridization and signal detection are primarily 
accomplished through the manipulation of formamide concentration (lower percentages 
20 of formamide result in lowered stringency); salt conditions, or temperature. For 

example, lower stringency conditions include an overnight incubation at 37°C in a 

solution comprising 6X SSPE (20X SSPE = 3M NaCl; 0.2M NaH^PO,; 0.02M EDTA, 
pH 7.4), 0.5% SDS, 30% formamide, 100 ug/ml salmon sperm blocking DNA; 

followed by washes at 50°C with IXSSPE, 0.1% SDS. In addition, to achieve even 

25 lower stringency, washes performed following stringent hybridization can be done at 
higher salt concentrations (e.g. 5X SSC). 

Note that variations in the above conditions may be accomplished through the 
inclusion and/or substitution of alternate blocking reagents used to suppress 
background in hybridization experiments. Typical blocking reagents include 

30 Denhardt's reagent, BLOTTO, heparin, denatured salmon sperm DNA, and 

conunercially available proprietary formulations. The inclusion of specific blocking 
reagents may require modification of the hybridization conditions described above, due 
to problems with compatibility. 

Of course, a polynucleotide which hybridizes only lo polyA-f sequences (such 

35 as any 3' terminal polyA+ tract of a cDNA shown in the sequence listing), or to a 
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complementary stretch of T (or U) residues, would not be included in the definition of 
"polynucleotide," since such a polynucleotide would hybridize to any nucleic acid 
molecule containing a poly (A) stretch or the complement thereof (e.g., practically any 
double-stranded cDNA clone). 

5 The polynucleotide of the present invention can be composed of any 

polyribonucleotide or polydeoxribonucleotide, which may be unmodified RNA or DNA 
or modified RNA or DNA. For example, polynucleotides can be composed of single- 
and double-stranded DNA. DNA that is a mixture of single- and double-stranded 
regions, single- and double-stranded RNA, and RNA that is mixture of single- and 

10 double-stranded regions, hybrid molecules comprising DNA and RNA that may be 
single-stranded or, more typically, double-stranded or a mixture of single- and double- 
stranded regions. In addition, the polynucleotide can be composed of triple-su-anded 
regions comprising RNA or DNA or both RNA and DNA. A polynucleotide may also 
contain one or more modified bases or DNA or RNA backbones modified for stability 

1 5 or for other reasons. "Modified" bases include, for example, tritylated bases and 
unusual bases such as inosine. A variety of modifications can be made to DNA and 
RNA; thus, "polynucleotide" embraces chemicaUy. enzymatically, or metabolically 
modified forms. 

The polypeptide of the present invention can be composed of amino acids joined 
20 to each other by peptide bonds or modified peptide bonds, i.e., peptide isosteres. and 
may contain amino acids other than the 20 gene-encoded amino acids. The 
polypeptides may be modified by either natural processes, such as postlranslational 
processing, or by chemical modification techniques which are well known in the art. 
Such modifications are well described in basic texts and in more detailed monographs, 
25 as well as in a voluminous research literature. Modifications can occur anywhere in a 
polypeptide, including the peptide backbone, the amino acid side-chains and the amino 
or carboxyl termini. It will be appreciated that the same type of modification may be 
present in the same or varying degrees at several sites in a given polypeptide. Also, a 
given polypeptide may contain many types of modifications. Polypeptides may be 
30 branched , for example, as a result of ubiquitination, and they may be cyclic, with or 
without branching. Cyclic, branched, and branched cyclic polypeptides may result 
from posttranslation natural processes or may be made by synthetic methods. 
Modifications include acetylation, acylation, ADP-ribosylation, amidaUon, covalent 
attachment of flavin, covalent attachment of a heme moiety, covalent attachment of a 
35 nucleotide or nucleotide derivative, covalent attachment of a lipid or lipid derivative, 
covalent attachment of phosphotidylinositol, cross-lmking, cychzation, disulfide bond 
formation, demelhylation, formation of covalem cross-links, formation of cysteine. 
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formation of pyroglutamate, formylation, gamma-carboxylation, glycosylation, GPI 
anchor formation, hydroxylaiion, iodination, methylation, myristoylation, oxidation, 
pegylation, proteolytic processing, phosphorylation, prenylation, racemization, 
selenoylation, sulfation, transfer-RNA mediated addition of amino acids to proteins 
such as arginylation, and ubiquitination. (See, for instance, PROTEINS - 
STRUCTURE AND MOLECULAR PROPERTIES, 2nd Ed., T. E. Creighton, W. 
H. Freeman and Company, New York (1993); POSTTRANSLATIONAL 
COVALENTMODmCATION OF PROTEINS, B. C. Johnson, Ed., Academic 
Press, New York, pgs. 1-12 (1983); Seifter et al., Meth Enzymol 182:626-646 (1990); 
Rattan et al., Ann NY Acad Sci 663:48-62 (1992).) 

"SEQ ID NO:X" refers to a polynucleotide sequence while "SEQ ID NO: Y" 
refers to a polypeptide sequence, both sequences identified by an integer specified in 
Table 1. 

"A polypeptide having biological activity" refers to polypeptides exhibiting 
activity similar, but not necessarily identical to, an activity of a polypeptide of the 
present invention, including mature forms, as measured in a particular biological assay, 
with or without dose dependency. In the case where dose dependency does exist, it 
need not be identical to that of the polypeptide, but rather substantially similar to the 
dose-dependence in a given activity as compared to the polypeptide of the present 
invention (i.e., the candidate polypeptide will exhibit greater activity or not more than 
about 25-fold less and, preferably, not more than about tenfold less activity, and most 
preferably, not more than about three-fold less activity relative to the polypeptide of the 
present invention.) 

Polynucleotides and Polypeptides of the Invention 

FEATURES OF PROTEIN ENCODED BY GENE NO: 1 

This gene is expressed primarily in testes tumor and to a lesser extent in fetal 

brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, cancer particularly of the testes, and defects of the central nervous system 
such as seizure and neurodegenerative disorders. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly cancer of the testes and central nervous system. 
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10 



expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues and cell types (e.g.. testes and other reproductive tissue, brain and 
other tissue of the nervous system, and blood cells, and spleen, and cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level. i.e.. the expression level in healthy tissue 

or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 

conesponding to this gene are useful for treatment/diagnosis of testicular cancer and 

treatment of central nervous system disorders since this gene is primarily expressed m 

the testes tumor and developing brain. 



FEATURES OF PROTEIN ENCODED BY GENE NO: 2 

This gene is expressed primarily in cancer tissues, such as breast cancer and 
1 5 Wilm's tumor, and to a lesser extent in fetal tissues. 

Therefore, polynucleotides and polypeptides of the invention are usefiil as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, and/or tumors, particularly, those found in the breast, and developmental 
20 abnormalities or disorders. Similarly, polypeptides and antibodies directed to tiiese 

polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the glandular tissues, expression of this gene at significantiy higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g.. mammary 
25 tissue, and fetal tissue and. cancerous and wounded tissues) or bodily fluids (e.g., 
sertim. plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. Preferred epitopes include those compnsmg a 
30 sequence shown in SEQ ID NO. 314 as residues: Pro-1 1 to Thr-18, Leu-43 to Pro-50, 
Gly-64 to Leu-72, and Leu-8 1 to Lys-86. 

The tissue distribution indicates tiiat polynucleotides and polypeptides 
corresponding to this gene are useful for treatment/diagnosis of cancers and/or tumors, 
particularly, those found in die breast since expression is mainly in cancer/tumor 
35 tissues. May serve as therapeutic proteins for proliferation/differentiation of fetal 



tissues. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 3 

This gene is expressed primarily in CD34 depleted buffy coat and to a lesser 
extent in spleen, chronic lymphocytic leukemia. 

5 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: blood disorders or 
leukemias, diseases of the immune system. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 

10 differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the immune system, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g.. blood cells, and spleen, and cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 

15 cell sample taken from an individual having such a disorder, relative to the standard 

gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment/diagnosis of blood disorders or 

20 leukemias, diseases of the immune system since expression is in tissues related to 
inmiune function. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 4 

This gene is expressed primarily in CD34 depleted buffy coat. 

25 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: blood disorders or 
lymphocytic diseases. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 

30 of the lissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the inunune system, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., blood 
cells, and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, 
synovial fluid or spinal fluid) or another tissue or cell sample taken from an individual 

35 having such a disorder, relative to the standard gene expression level, i.e., the 

expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 



1 
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The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment/diagnosis of blood disorders since 
expression is in tissues related to immune function. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 5 

This gene is expressed primarily in CD34 depleted buffy coat. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: blood or immune 

10 diseases. Similarly, polypeptides and antibodies directed to these polypeptides are 

useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the immune system, expression of this gene at significantly higher or lower levels may 
be routinely detected in certain tissues and cell types (e.g., blood cells, and cancerous 

15 and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or 
spinal fluid) or another tissue or cell sample taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. Preferred 
epitopes include those comprising a sequence shown in SEQ ID NO. 317 as residues: 

20 Pro-13 toLys-21. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment/diagnosis of blood disorders since 
expression is in tissues related to immune function. 

25 FEATURES OF PROTEIN ENCODED BY GENE NO: 6 

This gene is expressed primarily in CD34 depleted buffy coat. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell lype(s) present in a 
biological sample and for diagnosis of diseases and conditions: blood or immune 
30 diseases. Similarly, polypeptides and antibodies directed to these polypeptides are 

useful in providing immunological probes for differential identification of the lissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the immune system, expression of this gene at significantly higher or lower levels may 
be routinely detected in certain tissues and cell types (e.g., and blood cells, and 
35 cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
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in healthy tissue or bodily fluid from an individual not having the disorder. Preferred 
epitopes include those comprising a sequence shown in SEQ ID NO. 318 as residues: 
Lys-31 to Lys-39. 

The tissue distribution indicates that polynucleotides and polypeptides 
5 corresponding to this gene are useful for treatment/diagnosis of blood diseases since it 
is expressed in tissues related to immune function. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 7 

This gene is expressed primarily in CD34 depleted buffy coat and to a lesser 

10 extent in pineal gland. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: diseases of the immune 
system and brain associated diseases. Similarly, polypeptides and antibodies directed to 

15 these polypeptides are useful in providing immunological probes for differential 

identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the immune system, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., blood cells, and pineal gland, and cancerous and wounded tissues) or 

20 bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 

25 corresponding to this gene are useful for treatment/diagnosis of blood disorders, 

innmune diseases or brain associated diseases (specifically of the pineal gland) since 
expression is in tissues related to immune function, 

FEATURES OF PROTEIN ENCODED BY GENE NO: 8 

30 The translation product of this gene shares sequence homology with an organic 

cation transporter which is thought to be important in organic cation uptake in the 
kidney and liver. (See Accession No. 2343059.) Preferred polypeptide fragments 
comprise the amino acid sequence ITIAIQMICLVNXELYPTFVRNXGVMVCSSLCDIGGIITP 
^\a^I^WQAi.PLILFAVLGLLAAGVTLLLPETKGVALPETMKDAE^^GRKAKPK£OT 

35 VQTSEPSGT (SEQ ID NO: 6 1 5) or TMKDAENLGRKAKPKENT (SEQ ID NO: 6 1 6) as well 
as N-terminal and C-terminal deletions of these fragments. Also preferred are 
polynucleotide fragments encoding these polypeptide fragments. 
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This gene is expressed primarily in liver. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: hepatic and renal 

5 diseases where drug elimination/cation exchange (organic cation uptake) in the liver and 
kidney are problematic. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the hepatic or renal system, expression of this gene at significantly higher 

10 or lower levels may be routinely detected in certain tissues and cell types (e.g., kidney 
and liver, and cancerous and wounded tissues) or bodily fiuids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e.. 
the expression level in healthy tissue or bodily fluid from an individual not having the 

1 5 disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO. 320 as residues: Asn-64 to Asn-74, and Gln-81 to Gly-87. 

The tissue distribution and homology to organic cation transporter indicate that 
polynucleotides and polypeptides corresponding to this gene are useful as a polyspecific 
transporter that is important for drug elimination in the liver (and possibly kidney) since 

20 expression is found in the liver. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 9 

This gene is expressed primarily in eosinophil induced with IL-5 and to a lesser 
extent in fetal liver and spleen. This gene also maps to chromosome 15, and therefore 

25 can be used in linkage analysis as a marker for chromosome 15. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: diseases of the immune 
system, particularly allergies or asthma. Similarly, polypeptides and antibodies directed 

30 to these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the inunune system, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., blood cells, liver, and spleen, and cancerous and wounded tissues) or 

35 bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
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Standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treating/diagnosis of diseases involving 
esosinphil reactions since expression seems to be concentrated in eosinophils and other 
tissues involved in immunity such as the liver and spleen. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 10 

This gene is expressed primarily in tissues of hematopoietic lineage and to a 
lesser extent in Hodgkins lymphoma. Any frame shifts in this sequence can easily be 
clarified using known molecular biology techniques. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, and immune deficiency or dysfunction. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of tiie tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the immune system, expression of this gene 
at significantly higher or lower levels may be routinely detected in certain tissues and 
cell types (e.g., hematopoietic cells, lymphoid and reticuloendothelial tissues, and 
cancerous tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment/ diagnosis for lymphomas or 
immune dysfuction or as a therapeutic protein useful in immune modulation based on 
expression in anergic T-cells and lymphomas. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 11 

This gene is expressed primarily in neutrophils and to a lesser extent in activated 
lymphoid cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the cell type present in a biological sample and 
for diagnosis of diseases and conditions: inflamation. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
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of the above tissues or cells, particularly of the inimune system, expression of this gene 
at significantly higher or lower levels may be routinely detected in certain tissues and 
cell types (e.g., blood cells and lymphoid tissue, and cancerous and wounded tissues) 
or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 

5 tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e.. the expression level in healthy tissue or bodily 
fluid from an mdividual not having the disorder. Preferred epitopes include those 
comprising a sequence shown in SEQ ID NO. 323 as residues: Glu-40 to Lys-46. 
The tissue distribution indicates that polynucleotides and polypeptides 

10 corresponding to this gene are useful for modulation of an immune reaction or as a 
growth factor for the differentiation or proliferation of neutrphils for the treatment of 
neutropenia. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 12 
1 5 This gene is expressed primarily in brain and to a lesser extent in activated T- 

cells. It is likely that the open reading frame containing the predicted signal peptide 
continues in tiie 5' direction. Preferred polypeptide fragments comprise tiie amino acid 
sequence PRVRNSPEDLGLSLTGDSCKL (SEQ ID NO:617). 

Therefore, polynucleotides and polypeptides of the invention are useful as 
20 reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: neurodegenerative 
disorders including ischemic shock, alzheimers and cognitive disorders. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For a 
25 number of disorders of the above tissues or cells, particularly of die central nervous 
system, expression of this gene at significantiy higher or lower levels may be routinely 
detected in certain tissues and cell types (e.g., blood cells, and brain, and otiier tissue of 
the nervous system and cancerous and wounded tissues) or bodily fluids (e.g.. serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
30 an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO. 324 as residues: Ser-5 to Glu-14, lle-21 to Pro-35. Ser-65 to Asp-81, Cys-89 to 
Val-96, Lys-136 to Ser-145. Ile-152 to Met-169. and Arg-189 to Lys-196. 
35 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for diagnostic/treatment for cancers of the given 
tissue or in the treatment of neurological disorders of the CNS. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 13 
This gene was also recently cloned by other groups, naming this calcium-activated 
potassium channel gene, hKCa4. (See Accession No. AF033021, see also, Accession 
5 No. 2584866.) This gene is mapped to human chromosome 19ql3.2. A second signal 
sequence likely exists upstream from the predicted signal sequence as described in 
Table 1. Preferred polypeptide fragments comprise: QADDLQATVAALCVLRGGGPWAG 
SWLSPKTPGAMGGDLVLGLGALRRRKRLL (SEQ NO: 61 8); or EQEKSLAGWALVLAXXGIGL 
MVLHAEMLWFGGCSAVNATGHI^DTLWLIPITFI.TIGYGDVVPGTMWGKJVCLCT 

10 TALLVAVVARKLEFNKAEIfflVHNFMMDIQYTK£MKESAARVLQEAWMF^ 

XHQRXLLAAINAFRQVRLKHRKLREQVNSMVDISKMHMILYDLQQ^^SSSHRALEKQIDT^ 
KLDALTELLSTALGPRQLPEPSQQSK (SEQ ID NO: 619), as well as N-terminal and C- 
terminal deletions. Also preferred are polynulcleotide fragments encoding these 
polypeptide fragments. 

1 5 This gene is expressed primarily in breast lymph node and T-cells, and to a 

lesser extent in placenta. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: hematologic and 

20 immune disorders. Similarly, polypeptides and antibodies directed to these polypeptides 
are useful in providing inununological probes for differential identification of the 
tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the inmiune system, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., lymphoid 

25 tissue, blood cells and placenta, and cancerous and wounded tissues) or bodily fluids 
(e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. Preferred epitopes include those comprising a 

30 sequence shown in SEQ ID NO. 325 as residues: Arg-13 to Lys-23. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the treatment/diagnosis of hematologic and 
diseases involving immune modulation based or distribution in the lymph node and T- 
cells. 



35 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 14 

This gene was recently cloned by another group, calling it PAPS synethase. 
(See Accession No. el204135.) Preferred polypeptide fragments comprise the amino 
acid sequence YQAHHVSRNKRGQVVGTRGGFRGCTVWLTGLSGAGK (SEQ ID NO: 620). 
5 Also preferred are the polynucleotide fragments encoding this polypeptide fragment. 

It has been discovered that this gene is expressed primarily in benign prostate 
hyperplasia, Human Umbilical Vein Endothelial Cells and to a lesser extent in smooth 
muscle and Human endometrial stromal cells-treated with estradiol. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

10 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: inflamation, ischemia, 
and restenosis, based on endothelial cell and smooth muscle cell expression, and 
prostate diseases such as benign prostate hyperplasia or prostate cancer. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 

15 immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the prostate or vessels 
of the circulatory system, expression of this gene at significantly higher or lower levels 
may be routinely detected in certain tissues and cell types (e.g., prostate, endothelial 
cells, smooth muscle, and endometrium, and cancerous and wounded tissues) or bodily 

20 fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. Preferred epitopes include those comprising a 
sequence shown in SEQ ID NO. 326 as residues: Arg-21 to Asp-26, Lys-35 to Lys-44, 

25 Glu-49 to Asn-58. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treating/diagnosing diseases or conditions 
where the endothelial cell lining of the veins and arteries of underlying smooth muscle 
are involved. 

30 

FEATURES OF PROTEIN ENCODED BY GENE NO: 15 

This gene is expressed primarily in human 6 week embryo and to a lesser extent 
in placenta. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
35 reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: developmental 
anomalies or fetal deficiencies. Similarly, polypeptides and antibodies directed to these 
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polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly developmental in nature, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., embryonic 
5 tissue, and placenta, and cancerous and wounded tissues) or bodily fluids (e.g., semm, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
10 NO. 327 as residues Lys-50 to Glu-57. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for detection of developmental abnormalities. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 16 

1 5 This gene is expressed primarily in kidney and amygdala and to a lesser extent 

in fetal tissues. This gene is mapped to chromosome 14, and therefore is useful in 
linkage analysis as a marker for chromosome 14. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) present in a biological sample and 

20 for diagnosis of diseases and conditions: kidney diseases, neurological disorders and 
developmental abnormalities. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s). For a number of disorders of the above tissues, particularly of the renal 
system or developing fetal tissues, expression of this gene at significantly higher or 

25 lower levels may be routinely detected in certain tissues and cell types (e.g., kidney, 

amygdala, and fetal tissues, and cancerous and wounded tissues) or bodily fluids (e.g., 
serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 

30 individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment or diagnosis of conditions affecting 
the brain, kidneys and fetal development. 

35 FEATURES OF PROTEIN ENCODED BY GENE NO: 17 
This gene is expressed primarily in ovarian cancer. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: solid tumors similar to 
ovarian cancer Similarly, polypeptides and antibodies directed to these polypeptides are 

5 useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the reproductive system, expression of this gene at significantly higher or lower levels 
may be routinely detected in certain tissues and cell types (e.g., ovarian and other 
reproductive tissue, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 
10 plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO. 329 as residues Ser-51 to Val-56. 

15 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for the treatment of solid tumors of the 
reproductive system such as ovarian cancer. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 18 
This gene is expressed primarily in brain meduUoblastoma, Preferred polypeptide 
fragments comprise the amino acid sequence: IRHEQHPNFSLEMHSKGSSLLLFLPQL 
ILILPVCAHLHEELNC (SEQ ID NO: 643) and SPnSEEKGHLLLQAERHPWVAGALVGVSG 
GLTLTTCSGPTEKPATKNYFLKRLLQEMHIRAN (SEQ ID NO: 644), as well as N-terminal 
and C-terminal deletions. Also preferred are polynucleotide fragments encoding these 
polypeptide fragments. 
20 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: tumors particularly of 
the CNS or Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 
25 or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the Central nervous system, expression of this gene at significantly higher or lower 
levels may be routinely detected in certain tissues and cell types (e.g., brain and other 
tissue of the nervous system, and cancerous and wounded tissues) or bodily fluids 
(e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell 
30 sample taken from an individual having such a disorder, relative to the standard gene 
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expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treating medulloblastoma or similar tumors. 

5 

FEATURES OF PROTEIN ENCODED BY GENE NO: 19 
This gene is expressed primarily in adipocytes. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

10 biological sample and for diagnosis of diseases and conditions: obesity. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
inmiunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the adipose tissues 
expression of this gene at significantly higher or lower levels may be routinely detected 

15 in certain tissues and cell types (e.g., adipocytes and cancerous and wounded tissues) 
or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

20 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for treating obesity by regulating the function and 
number of adipocytes 

FEATURES OF PROTEIN ENCODED BY GENE NO: 20 

25 This gene is expressed primarily in B cell lymphoma. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, of the immune system with an emphasis on B cell lymphoma. Similarly, 

30 polypeptides and antibodies directed to these polypeptides are useful in providing 

immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the tumors of the 
immune system, expression of this gene at significantly higher or lower levels may be 
routinely delected in certain tissues and cell types (e.g., blood cells, and lymphoid 

35 tissue, and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e.. 
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the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of B cell derived 
tumors based on its expression in b cell lymphomas 

FEATURES OF PROTEIN ENCODED BY GENE NO: 21 

This gene is expressed primarily in immune cells and to a lesser extent in fetal 

tissues 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: inflammatory diseases 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
immune expression of this gene at significantly higher or lower levels may be routinely 
detected in certain tissues and cell types (e.g., cells of the inrmiune system, and fetal 
tissues, and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. Preferred epitopes include those comprising a sequence shown in SEQ ID 
NO:333 as residues Asp-10 to Pro-19, Ser-74 to Tyr-79, Glu-95 to Lys-1 10. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment of diseases involving alterations in T 
cell activity. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 22 

It has been discovered that this gene is expressed primarily in ovarian tumor. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: tumors particularly of 
the ovary. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
tumors of the reproductive organs, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., ovarian 
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and other reproductive tissue and cancerous and wounded tissues) or bodily fluids 
(e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
5 individual not having the disorder. Prefenred epitopes include those comprising a 
sequence shown in SEQ ID NO. 334 as residues: Leu-22 to Gln-27. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of ovarian tumors as it 
has only been identified in ovarian tumors. 

10 

FEATURES OF PROTEIN ENCODED BY GENE NO: 23 

It has been discovered that this gene is expressed primarily in fetal tissues and to 
a lesser extent in osteoclastoma cell line 

Therefore, polynucleotides and polypeptides of the invention are useful as 
1 5 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: osteoporosis or arthritis 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
?0 skeletal expression of this gene at significantly higher or lower levels may be routinely 
detected in certain tissues and cell types (e.g., bone cells, and fetal tissue, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
25 in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment of conditions of abnormal bone 
remodeling due to enhanced activity of osteoclasts. This may be useful as a specific 
marker for malignancies derived from osteoclasts or their precursors. 

30 

FEATURES OF PROTEIN ENCODED BY GENE NO: 24 

The translation product of this gene shares sequence homology with a 
periplasmic ribonuclease which is thought to be important in degrading extracellular 
polynucleotides 

35 It has been discovered that this gene is expressed primarily in serum treated 

smooth muscle cells 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: vascular disease such as 
restenosis. Similarly, polypeptides and anUbodies directed to these polypeptides are 

5 useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the vasculature expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues and cell types (e.g., smooth muscle, and cancerous 
and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or 

10 spinal fluid) or another tissue or cell sample taken from an individual having such a 
disonier, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. Preferred 
epitopes include those comprising a sequence shown in SEQ ID NO: 336 as residues: 
Gln-30 to Lys-36, and Pro-41 to Arg-48. 

1 5 The tissue distribution and homology to ribonucleases indicate that 

polynucleotides and polypeptides corresponding to this gene are useful for treatment of 
pathological conditions of smooth muscle associated with bacterial or viral infiltration 

FEATURES OF PROTEIN ENCODED BY GENE NO: 25 
20 This gene is expressed primarily in Early Stage Human Brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the lissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: human brain 
development and related diseases. Similarly, polypeptides and antibodies directed to 
25 these polypeptides are useful in providing immunological probes for differential 

identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the human brain development and related diseases, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues and cell types (e.g., brain and other tissue of the nervous system, and 
30 cancerous and wounded tissues) or bodily fluids (e.g., semm, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to this gene indicate that polynucleotides 
35 and polypeptides corresponding to this gene are useful for diagnosis and treatment of 
diseases affecting human brain development and related diseases. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 26 

It has been discovered that this gene is expressed primarily in human brain 

tissue. 

5 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: human brain diseases 
and other diseases related to brain diseases, which may be caused by brain diseases. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 

10 providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
human brain diseases, expression of this gene at significantly higher or lower levels 
may be routinely detected in certain tissues and cell types (e.g., brain and other tissue of 
the nervous system, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 

15 plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution and homology to the gene indicate that polynucleotides 
20 and polypeptides correspxjnding to this gene are useful for diagnosis and treatment of 
human brain diseases and other diseases related. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 27 

It has been discovered that this gene is expressed primarily in Anergic T-cells. 

25 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: immune diseases, 
inflammatory diseases and diseases related to T lymph cells. Similarly, polypeptides 
and antibodies directed to these polypeptides are useful in providing immunological 

30 probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the immune diseases, 
inflammatory diseases and diseases related to T lymph cells, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., blood cells, and cancerous and wounded tissues) or bodily fluids (e.g., 

35 serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
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expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution and homology to the gene indicate that polynucleotides 
and polypeptides corresponding to this gene are useful for immune diseases, 
5 inflammatory diseases and diseases related to T lymph cells. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 28 

The translation product of this gene shares sequence homology with Shigella 

flexneri positive transcriptional regulator CriR (criR) gene which is thought to be 
1 0 important in regulation of gene expression. 

This gene is expressed primarily in human synovial sarcoma and normal human 

brain tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue{s) or cell type(s) present in a 

15 biological sample and for diagnosis of diseases and conditions: human brain diseases 
particularly sarcomas of the synovium. Similarly, polypeptides and antibodies directed 
to these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of die above 
tissues or cells, particularly of the human brain and synovium and other related human 

20 brain diseases, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain (e.g., synovial tissue, and brain and other tissue of the 
nervous system, and cancerous and wounded tissues) or bodily fluids (e,g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 

25 the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of human synovial 
sarcoma and other related human brain diseases. 

30 

FEATURES OF PROTEIN ENCODED BY GENE NO: 29 

This gene is expressed in bone marrow, infant brain, fetal liver and spleen, 
prostate and to a lesser extent in pineal gland, adipose tissue, kidney, adrenal gland, 
umbilical vein endothelial cells, and T cells. 
35 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for identification of the tissue(s) or cell lype(s) present in a biological sample 
and for diagnosis of diseases and conditions: diseases related to bone marrow or 
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hematoplastic tissues, prostate, kidney, adrenal gland, and cardiovascular tissue or 
organs. Similarly, polypeptides and antibodies directed to these polypeptides are useful 
in providing immunological probes for identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the diseases related to 
5 hematoplastic tissues, immune system, prostate, kidney, adrenal gland, and 

cardiovascular tissue or organs, expression of this gene at significandy higher or lower 
levels may be routinely detected in certain tissues and cell types (e.g., bone marrow, 
hematopoietic cells, pineal gland, adipose tissue, kidney, adrenal gland, endothelial 
cells, and blood cells, and cancerous and wounded tissues) or bodily fluids (e.g., 

10 serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution and homology to the gene indicate that polynucleotides 

15 and polypeptides corresponding to this gene are useful for diagnosis and treatment of 
diseases related to hematoplastic tissues, immune system, prostate, kidney, adrenal 
gland, and cardiovascular tissue or organs. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 30 
20 This gene is expressed primarily in meningea and to a lesser extent in breast and 

adult brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: Diseases of the 

25 meningea and related brain diseases. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the meningea and related brain diseases, expression of 
this gene at significantly higher or lower levels may be routinely detected in certain 

30 tissues and cell types (e.g., miningea, mammary tissue, and brain and other tissue of 
the nervous system, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 

35 disorder. 
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The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of diseases of the 
meningea and related brain diseases. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 31 

This gene is expressed in meningea, fetal spleen, osteoblast and to a lesser 
extent in activated T-cells, endometrial stromal cells, fetal lung, HL-60. thymus, testis 
and endothelial cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

10 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: meningeal disease, 
osteoporosis, immune diseases, and hematoplastic diseases. Similarly, polypeptides 
and antibodies directed to these polypeptides are useful in providing immunological 
probes for identification of the tissue(s) or cell type(s). For a number of disorders of the 

15 above tissues or cells, particularly of the meningeal diseases, osteoporosis, inimune 
diseases, and hematoplastic diseases, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., blood 
cells, endometrium, lung, thymus, testis, and endothelial cells, and cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 

20 fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to gene indicate that polynucleotides and 
polypeptides corresponding to this gene are useful for diagnosis and treatment of 

25 meningeal, osteoporosis, inunune diseases, hematoplastic diseases, testis diseases and 
lung diseases. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 32 

This gene is expressed primarily in human thymus and to a much lesser extent 
30 in infant brain, T-cells, smooth muscle, endothelial cells, bone marrow, human ovarian 
tumor and keratinocytes testes, osteoclastoma, breast, and tonsils. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: Diseases involving the 
35 thymus, particularly thymic cancer and diseases involving T-cell maturation. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For a 
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number of disorders of the above tissues or cells, particularly of the thymus, expression 
of this gene at significantly higher or lower levels may be routinely detected in certain 
tissues and cell types (e.g., thymus, brain, and other tissue of the nervous system, 
blood cells, bone marrow, ovaries, and testes, and other reproductive tissue, manunary 
S tissue, tonsils, melanocytes and cancerous and wounded tissues) or bodily fluids (e.g., 
serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 
10 The tissue distribution and homology to gene indicate that polynucleotides and 

polypeptides corresponding to this gene are useful for diagnosis and treatment of 
diseases of the thymus particularly thymic cancer and diseases involving T-cell 
maturation. 



15 FEATURES OF PROTEIN ENCODED BY GENE NO: 33 

This gene is expressed primarily in human tonsils, and placenta, and to a lesser 
extent in adipocytes, melanocyte, and infant brain. 

Therefore, polynucleotides and polypeptides of the invention are usefiil as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

20 biological sample and for diagnosis of diseases and conditions: inflammatory diseases, 
immune diseases, and obesity. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the inflanunatory diseases, immune diseases, and obesity, expression of 

25 this gene at significantly higher or lower levels may be routinely detected in certain 
tissues and cell types (e.g., tonsils, placenta, adipocytes, melanocytes, and brain and 
other tissue of the nervous system, and cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 

30 gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution and homology to this gene indicate that polynucleotides 
and polypeptides corresponding to this gene are useful for diagnosis and treatment of 
diseases such as inflammation, immune diseases, and obesity. 

35 



wo 98/39448 



PCT/US98y04493 



FEATURES OF PROTEIN ENCODED BY GENE NO: 34 

This gene is expressed in activated T cells, and to a lesser extent in pituitary, 
testis, and breast lymph node. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

5 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: diseases relating to T 
cells. Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 

10 disorders of the immune system, expression of this gene at significantly higher or lower 
levels may be routinely detected in certain tissues and cell types (e.g., pituitary, testes 
and other reproductive tissue, mammary tissue, and lymphoid tissue, and cancerous 
and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or 
spinal fluid) or another tissue or cell sample taken from an individual having such a 

15 disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment of immune disorders. 

go FEATURES OF PROTEIN ENCODED BY GENE NO: 35 

This gene is expressed primarily in infant brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: neurological disorders. 

25 Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
diseases relating to neurological disorders, expression of this gene at significantly 
higher or lower levels may be routinely detected in certain tissues and cell types (e.g., 

30 brain, and other tissue of the nervous system, and cancerous and wounded tissues) or 
bodily fluids (e.g., semm, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

35 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for diagnosis and treatment of neurological 
disorders. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 36 
This gene is expressed primarily in infant brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
5 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: neurological disorders. 
Similarly, polypeptides and antibodies directed to these polyp>eptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 

10 diseases relating to neurological disorders, expression of this gene at significantly 

higher or lower levels may be routinely detected in certain tissues and cell types (e.g., 
brain and other tissue of the nervous system, and cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 

15 standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of neurological 
disorders. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 37 
This gene is expressed primarily in human ovary. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

25 biological sample and for diagnosis of diseases and conditions: ovarian cancer. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
ovarian disorders such as those involving germ cells, ovarian follicles, stromal cells, 

30 expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues and cell types (e.g., ovary and other reproductive tissue, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 

35 in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of ovariopathy. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 38 

This gene is expressed primarily in lymph node breast cancer. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
5 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: breast cancer. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful hi providing 
inununological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the breast cancer, 
10 expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues and cell types (e.g., mammary tissue and lymphoid tissue, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
15 in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for used as a diagnostic marker for breast cancer. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 39 

20 This gene is expressed primarily in brain and to a lesser extent in other tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: neuronal disorders such 
as trauma, brain degeneration, and brain tumor. Similarly, polypeptides and antibodies 

25 directed to these polypeptides are useful in providing immunological probes for 

differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, panicularly of the brain, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., brain and other tissue of the nervous system, and cancerous and wounded 

30 tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 

35 corresponding to this gene are useful for diagnosis and therapeutic treatment of 
neuronal disorders. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 40 

This gene is expressed in early stage human embryo, adrenal gland tumor, and 
5 immune tissues such as fetal liver, fetal spleen, T-cell, and myoloid progenitor cell line 
and to a lesser extent in ovary, colon cancer, and a few orther tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: tumorigenesis including 

10 adrenal gland tumor, colon cancer and various other tumors, developmental and 

inunune disorders. Similarly, polypeptides and antibodies directed to these polypeptides 
are useiiil in providing immunological probes for differential identification of the 
tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the cancer tissues, early stage human tissues, and immune system, 

15 expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues and cell types (e.g., liver, spleen, blood cells, bone marrow, ovary 
and other reproductive tissue, and colon, and cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 

20 gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and therapeutic treatment of immune 
and developmental disorders, and tumorigenesis. 

25 

FEATURES OF PROTEIN ENCODED BY GENE NO: 41 

This gene is expressed primarily in fetal lung, endothelial cells, liver, thymus 
and a few other immune tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

30 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: immune disorders such 
as immune deficiency and autoinunune diseases, pulmonary diseases, liver diseases, 
and tumor matasis. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 

35 of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the fetal lung, liver, endothelial cells, and immune tissues, expression of 
this gene at significantly higher or lower levels may be routinely detected in certain 
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tissues and cell types (e.g., lung, endothelial cells, liver, thymus, and other tissue of 
the immune system, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
5 the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis of immune disorders and pulmonary 
and hepatic diseases. Its promoter may also be used for immune system and lung- 
10 specific gene therapies. The expression of this gene in endothelial cells indicates that it 
may also involve in angiogenesis which therefore may play role in tumor matasis. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 42 

This gene is expressed primarily in liver, thyroid, parathyroid and to a lesser 

1 5 extent in fetal lung, stomach and early embryos. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: metabolic regulation, 
obesity, heptic failure, heptacellular tumors or thyroiditis and thyroid tumors. Similarly, 

20 polypeptides and antibodies directed to these polypeptides are useful in providing 

immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the digestive/endocrine 
system expression of this gene at significantiy higher or lower levels may be routinely 
detected in certain tissues and cell types (e.g., liver, thyroid, parathyroid, lung, 

25 stomach, and embryonic tissue, and cancerous and wounded tissues) or bodily fluids 
(e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

30 The tissue distribution and the extracellular locations indicates that 

polynucleotides and polypeptides corresponding to this gene are useful for the detection 
and treatment of digestive/endocrine disorders, including metabolic regulation, heptic 
failure, malabsortion, gastritis and neoplasms. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 43 

This gene is expressed primarily in Schizophrenic adult brain, pituitary, front 
cortex, hypothalmus and to a lesser extent in retina, adipose and stomach cancer and 
placenta. 

5 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: schizophrenia and other 
neurological disorders. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing inununological probes for differential identification 

10 of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the central nerve system, expression of this gene at significantly higher 
or lower levels may be routinely detected in certain tissues and cell types (e.g., retinal 
tissue, adipose, stomach, and placenta, and cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 

IS cell sample taken from an individual having such a disorder, relative to the standard 

gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful in treatment/detection of disorders in the nerve 

20 system, including schizophrenia, neurodegeneration, and neoplasia. Additionally, a 
secreted protein in brain may serve as an endocrine. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 44 

The translation product of this gene shares sequence homology with GTP 
25 binding proteins which are thought to be important in signal transduction and protein 
transport. 

This gene is expressed primarily in umbilical vein and microvascular endothelial 
cells, GM-CSF treated macrophage, anergic T cells, osteoblast, osteoclast, CD34+ cells 
and to a lesser extent in gall bladder. 

30 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: bone formation and 
growth, osteonecrosis, osteoporosis, angiogenesis and/or hematopoeisis. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 

35 immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the skeletal and 
hematopoeisis systems, expression of this gene at significantly higher or lower levels 
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may be routinely detected in certain tissues and cell types (e.g., endothelial cells, blood 
cells, bone, and gall bladder, and cancerous and wounded tissues) or bodily fluids 
(e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
5 expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution and homology to GTP binding proteins indicate that 
polynucleotides and polypeptides corresponding to this gene arc useful for 
treatment/detection of bone formation and growth, osteonecrosis, osteoporosis, and/or 
10 hematopoeisis because its involvement in the growth signaling or angiogenesis. 

• 

FEATURES OF PROTEIN ENCODED BY GENE NO: 45 

The translation product of this gene shares sequence homology with signal 
sequence receptor gamma subunit which is thought to be important in protein 

15 translocation on endoplasmic reticulum. 

This gene is expressed primarily in adrenal gland, salivary gland, prostate, and 
to a lesser extent in endothelial cells and smooth muscle. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

20 biological sample and for diagnosis of diseases and conditions: protein secretion. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
secretory organs, expression of this gene at significantly higher or lower levels may be 

25 routinely detected in certain tissues and cell types (e.g., adrenal gland, salivary gland, 
prostate, endothelial cells, and smooth muscle, and cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 

30 fluid from an individual not having the disorder. 

The tissue distribution and homology to SSR gamma subunit indicate that 
polynucleotides and polypeptides corresponding to this gene are useful for endocrine 
disorders, prostate cancer, xerostomia or sialorrhea. 



35 



FEATURES OF PROTEIN ENCODED BY GENE NO: 46 

This gene is expressed primarily in osteoclastoma cells and to a lesser extent in 
melanocyte, amygdala, brain, and stomach. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: ossification, 
osteoporosis, fracture, osteonecrosis, osteosarcoma. Similarly, polypeptides and 
5 antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the skeletal systems, expression of this gene 
at significantly higher or lower levels may be routinely detected in certain tissues and 
cell types (e.g., melanocytes, amygdala, brain and other tissue of the nervous system, 
10 and stomach, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 

plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

1 5 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful in intervention of ossification, osteoporosis, 
fracture, osteonecrosis and osteosarcoma. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 48 
20 The translation product of this gene shares sequence homology with proline rich 

proteins which is thought to be important in protein-protein interaction. 
This gene is expressed primarily in brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

25 biological sample and for diagnosis of diseases and conditions: neurological and 
psychological disorders. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the central nerve system and endocrine system, expression of this gene at 

30 significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., brain and other tissue of the nervous system, and cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 

35 fluid from an individual not having the disorder. 

The tissue distribution and homology to proline-rich proteins indicate that 
polynucleotides and polypeptides corresponding to this gene are useful in intervention 
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and detection of neurological diseases, including trauma, neoplasia, degenerative or 
metabolic conditions in the central nerve system. Additionally, the gene product may be 
a secreted by the brain as an endocrine. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 49 

The translation product of this gene shares sequence homology with the AOCB 
gene from Aspergillus nidulans which is important in asexual development. 

This gene is expressed primarily in infant brain and to a lesser extent in the 
developing embryo, trachea tumors, B-cell lymphoma and synovial sarcoma. 

10 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell lype(s) present in a 
biological sample and for diagnosis of diseases and conditions: neurodegenerative 
diseases, leukemia and sarcoma's. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 

1 5 identification of the tissue(s) or ceil type(s). For a number of disorders of the above 
tissues or cells, particularly of the brain and immune system, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues and ceil 
types (e.g., embryonic tissue, blood cells, trachea, and synovial tissue, and cancerous 
and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or 

20 spinal fluid) or another tissue or cell sample taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution in infant brain and sarcoma's and homology to a gene 
involved in a key step of eukaryotivc development (fungal spore formation) indicates 

25 that the protein product of this clone could play a role in neurological diseases such as 
schizophrenia, particularly in infants. The existence of the gene in a B-cell lymphoma . 
indicates the gene may be used in the treatment and detection of leukemia. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 50 

30 This gene is expressed primarily in fetal lung. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: pulmonary disorders 
including lung cancer. Similarly, polypeptides and antibodies directed to these 

35 polypeptides are useful in providing immunological probes for differential identification 
of the tis.sue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the pulmonary system, expression of this gene at significantly higher or 
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, lower levels may be routinely detected in certain tissues and cell types (e.g., lung, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
5 in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution of this gene only in fetal lung indicates that it plays a key 
role in development of the pulmonary system. This would suggest that misregulaiion of 
the expression of this protein product in the adult could lead to lymphoma or sarcoma 
formation, particularly in the lung. It may also be involved in predisposition to certain 
10 pulmonary defects such as pulmonary edema and embolism, bronchitis and cystic 
flbrosis. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 51 

This gene is expressed primarily in hematopoietic cell types and fetal cells and to 

1 5 a lesser extent in all tissue types. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: defects in the immune 
system and hematopoeisis. Similarly, polypeptides and antibodies directed to these 

20 polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
pardcuiarly of the immune and hematopoiedc systems, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., hematopoietic cells, and fetal tissue, and cancerous and wounded tissues) 

25 or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution of this gene predominantly in hematopoeitic cells and in 

30 the developing embryo indicates that polynucleotides and polypeptides corresponding to 
this gene are useful for detection and treatment of lymphomas and disease states 
affecting the immune system or hematopoeisis disorders such as leukemia, AIDS, 
arthritis and asthma.. 

35 FEATURES OF PROTEIN ENCODED BY GENE NO: 52 

This gene is expressed primarily in prostate and to a lesser extent in fetal spleen, 
fetal Hver, infant brain and T cell leukemias. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the lissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: prostate disorders, 
prostate cancer, leukemia. Similarly, polypeptides and antibodies directed to these 

5 polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the inmiune system, and/or prostate gland expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., thymus, spleen, liver, brain and other tissue of the nervous system, and 

10 blood cells, and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

15 The tissue distribution of this gene in prostate indicates that polynucleotides and 

polypeptides corresponding to this gene are useful for detection or treatment of prostate 
disorders or prostate cancer. Its distribution in fetal liver and fetal spleen indicates it 
may play a role in the immune system and its misregulation could lead to immune 
disorders such as leukemia, arthritis and asthma. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 53 

The translation product of this gene shares sequence homology with dynein. 
This gene is expressed primarily in brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
25 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: neuro-degenerative 
diseases of the brain. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
30 particularly neuro-degenerative diseases expression of this gene at significantly higher 
or lower levels may be routinely detected in certain tissues and cell types (e.g., brain 
and other tissue of the nervous system, and cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
35 gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 



wo 98/39448 



37 



PCT/US98/04493 



The predominant tissue distribution in the brain and homology to dynein, a 
microtubule motor protein involved in the positioning of cellular organelles and 
molecules indicates that polynucleotides and polypeptides corresponding to this gene 
are useful for detection/treatment of neurodegenerative diseases, such as Alzheimers, 
5 Huntigtons, Parkinsons diseases and shizophrenia. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 54 

The translation product of this gene shares sequence homology with ubiquitin- 
conjugation protein, an enzyme which is thought to be important in the processing of 
1 0 the Huntingtons Disease causing gene. 

This gene is expressed primarily in brain and to a lesser extent in activated 
macrophages. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of die tissue(s) or cell type(s) present in a 

15 biological sample and for diagnosis of diseases and conditions: neurodegenerative 
disease states including Huntington's disease. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
differential identification of brain tissues. For a number of disorders of the above 
tissues or cells, particularly of the neurological systems expression of this gene at 

20 significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., brain and other tissue of the nervous system, and blood cells, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 

25 in healthy tissue or bodily fluid from an individual not having the disorder. 

The predominant tissue distribution of this gene in the brain and its homology to 
a Huntington interacting protein indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the regulation of the expression of the 
Huntington disease gene and other neurodegenerative diseases including 

30 spinocerebullar ataxia types 1 and HI, dentatorubropallidoluysian and spinal bulbar 

muscular atrophy. In addition, the existence of elevated levels of free ubiquitin pools in 
Alzheimer's disease, Parkinson's disease and amylotrophic lateral sclerosis indicates 
that the ubiquitin pathway of protein degradation plays a role in these disease states. 
Thus, considering the gene described here is homologous to a ubiquitin-conjugation 

35 protein it may play a general role in neurodegenarative conditions. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 56 

This gene is expressed primarily in T-cells (anergic T-cells, resting T-Cells, 
apoptotic T-cells) and lymph node (breast), as well as brain (hypothalamus, 
hippocampus, pituitary, infant brain, early-stage brain). 
5 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: immune (e.g. 
immunodeficiencies, autoimmunities, inflammation, leukemias & lymphomas) and 
neurological (e.g. Alzheimer's disease, dementia, schizophrenia) disorders. Similarly. 

10 polypeptides and antibodies directed to these polypeptides are useful in providing 

immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the central nervous, 
hematopoietic and immune systems, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., blood 

1 5 cells, lymphoid tissue, and brain and other tissue of the nervous system, and cancerous 
and wounded tissues) or bodily fluids (e.g., seram, plasma, urine, synovial fluid or 
spinal fluid) or another tissue or cell sample taken from an individual having such a 
disorder, relative to tiie standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. 

20 The tissue distribution indicates tiiat polynucleotides and polypeptides 

corresponding to this gene are useful in the intervention or detection of pathologies 
associated with the hematopoietic and immune systems, such as anemias (leukemias). 
In addition, the expression in brain (including fetal) might suggest a role in 
developmental brain defects, neuro-degenerative diseases or behavioral abnomalities 

25 (e.g. schizophrenia, Alzheimer's, dementia, depression, etc.). 

FEATURES OF PROTEIN ENCODED BY GENE NO: 57 

This gene is expressed primarily in lung, and to a lesser extent in a variety of 
otiier hematological cell types (e.g. Raji cells, bone marrow cell line, activated 

30 monocytes). 

Therefore, polynucleotides and polypeptides of the invention are usefiil as 
reagents for differential identification of Uie tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: pulmonary and/or 
hematological disfunction. Similarly, polypeptides and antibodies directed to these 

35 polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the vasculo-pulmonary and hematopoietic systems, expression of this 
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gene at significantly higher or lower levels may be routinely detected in certain tissues 
and cell types (e.g., lung and blood cells, and cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
5 standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful in the intervention and detection of pathologies 
associated with the vasculo-pulmonary system. In addition the expression of this gene 
10 in a variety of leukocytic cell types and a bone marrow cell line might suggest a role in 
hematopoietic and immune system disorders, such as leukemias & lymphomas, 
inflammation, immunodeficiencies and autoimmunities. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 58 

1 5 The translation product of this gene shares sequence homology with adenylate 

kinase isozyme 3 (gil 1 63528 GTP: AMP phosphotransferase (EC 2.7.4. 1 0) [Bos 
taurus]), which is thought to be important in catalyzing the phosphorylation of AMP to 
ADP in the presence of ATP or inorganic triphosphate. 

This gene is expressed primarily in fetal liver, heart and placenta, and to a lesser 

20 extent in many other tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: hepatic, cardiovascular 
or reproductive disorders. Similarly, polypeptides and antibodies directed to these 

25 polypeptides are useful in providing inmiunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the hepatic, cardiovascular and reproductive systems, expression of this 
gene at significantly higher or lower levels may be routinely detected in certain tissues 
and cell types (e.g., liver, heart, and placenta, and cancerous and wounded tissues) or 

30 bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 

35 corresponding to this gene are useful for the treatment and diagnosis of conditions 
related to hepatic function and pathogenesis, in particular, those dealing with liver 
development and the differentiation of hepatocyte progenitor cells. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 59 

This gene is expressed primarily in CD34 positive cells (Cord Blood). 
Therefore, polynucleotides and polypeptides of the invention are useiui as 

5 reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: hematopoietic 
differentiation and immune disorders. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 

10 tissues or cells, particularly of hematopoietic and immune systems, expression of this 
gene at significantly higher or lower levels may be routinely detected in certain tissues 
and cell types (e.g., hematopoietic cells, and blood cells, and cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 

15 the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful in the detection and treatment of conditions 
associated with CD34-positive cells, and therefore as a marker for cell differentiation in 

20 hematapoiesis, as well as immunological disorders. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 60 

The translation product of the predicted open reading frame of Uiis contig has 
sequence identity to the murine gene designated Insulin-Like Growth Factor-Binding 
25 Protein (IGFBP)-l as described by Lee and colleagues (Hepatology 19 (3), 656-665 
(1994)), 

This gene is expressed exclusively in hemangiopericytoma. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
30 biological sample and for diagnosis of hemangiopericytoma and other pericyte or 

endothelial cell proliferative disorders. Similarly, polypeptides and antibodies directed 
to these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the circulatory and immune systems, expression of this 
35 gene at significantly higher or lower levels may routinely be detected in certain tissues 
and cell types (e.g., pericyte or endothelial cells, and liver, and cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
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another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

Polynucleotides and polypeptides corresponding to this gene are useful as cell 
5 growth regulators since IGFBP-l-Iike molecules function as modulators of insulin-like 
growth factor activity. In addition, since IGFBP-1 is expressed at high levels 
following hepatectomy and during fetal liver development, polynucleotides of the 
present invention may also be used for the diagnosis of developmental disorders. 
Further, polypeptides of the present invention may be used therapeutically to treat 
10 developmental liver disorders as well as to regulate hepatocyte and supporting cell 
growth following hepatectomy or to treat liver disorders. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of 
hemangiopericytoma and liver disorders. 

15 

FEATURES OF PROTEIN ENCODED BY GENE NO: 61 

This gene is expressed primarily in schizophrenic frontal cortex. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

20 biological sample and for diagnosis of diseases and conditions: nervous system and 
cognitive disorders. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the frontal cortex and CNS expression of this gene at significantly higher 

25 or lower levels may be routinely detected in certain tissues and cell types (e.g., brain 
and other tissue of the nervous system, and cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 

30 individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for study, treatment and diagnosis of frontal 
cortex, neuro-degenerative and CNS disorders 

35 FEATURES OF PROTEIN ENCODED BY GENE NO: 62 

This gene is expressed primarily in human adrenal gland tumor, and to a lesser 
extent in human kidney, medulla and adult pulmonary tissue. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: metabolic, endocrine 
disorders. Similarly, polypeptides and antibodies directed to these polypeptides are 

5 useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the endocrine and nervous system disorders and neoplasia, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., adrenal gland, kidney, brain and other tissue of the nervous system, 

1 0 pulmonary tissue, and cancerous and v/ounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid firom an individual not having the 
disorder. 

15 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for study, treatment and diagnosis of neurological 
and endocrine disorders including neoplasia. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 63 
20 This gene is expressed primarily in human adipocytes, and to a lesser extent m 

spleen, 12-week old human, and testes. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differenUal identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: immune, metabolic and 
25 growth disorders. Similarly, polypeptides and antibodies directed to these polypeptides 
are useful in providing immunological probes for differential identification of the 
tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune system, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., adipocytes, 
30 spleen, and testes and other reproductive tissue, and cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e.. the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 
35 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for study, diagnosis and treatment of immune, 
developmental and metabolic disorders. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 64 
One translated product of this clone is homologous to the mouse zinc finger protein 
PZF. (See Accession No. 453376; see also Gene 152 (2), 233-238 (1995).) Preferred 
5 polypeptide fragments correspond to the highly conserved domains shared between 

mouse and man. For example, preferred polypeptide fragments comprise the amino acid 
sequence: LQCEICGFTCRQKASLNWHMKKHDADSFYQFSCNICGKKFEKKDSWAHKAKSH 

PEV (SEQ ID NO: 621); ITSTDILGTNPESLTQPSD (SEQ ID NO: 622); NSTSGECLLLEAEGM 
SKSY (SEQ ID NO; 623); CSGTERVSLMADGKIFVGSGSSGGTEGLVMNSDILGATTEVLIEDSD 

10 SAGP (SEQ ID NO: 624); IQYVRCEMEGCGTVLAHPRYLQHHIKYQHLLKKKYVCPHPSCGRLF 
RLQKQLLRHAKHHT (SEQ ID NO: 625); DQRDYICEYCARAFKSSHNLAVHRMIHTGEK (SEQ 
ID NO: 626); RSSRTSVSRHRD1ENTRSSRSKTGSLQLICKSEPNTDQLDY (SEQ ID NO: 627); 
PFKDDPRDETYKPHLERETPKPRRKSG (SEQ ID NO: 630); QYVRCEMEGCGTVLAHPRYLQ 
HHIKYQHLLKKKYVCPHPSCGRLFRLQKQLLRHAKHHTD (SEQ ID NO: 629); or residues 

15 151-182 of QRDYICEYCARAFKSSHNLAVHRMIHTGEKHY (SEQ ID NO: 628). Also preferred 
are polynucleotide fragments encoding these polypeptide fragments. 

This gene is expressed primarily in Rhabdomyosarcoma, melanocyte and colon 
cancer tissue and to a lesser extent in smooth muscle, pancreatic tumor, and apoptotic 
T-cells. 

20 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to,. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 
25 or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the immune and hemopoetic, expression of this gene at significantly higher or lower 
levels may be routinely detected in certain tissues and cell types (e.g., striated muscle, 
melanocytes, colon, smooth muscle, pancreas, and blood cells, and cancerous and 
wounded tissues) or bodily fluids (e.g.. serum, plasma, urine, synovial fluid or spinal 
30 nuid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for study, diagnosis and treatment of cancer and 
35 hemopoetic disorders. 
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type(s). For a number of disorders of the above tissues or cells, particularly of the oral 
and intestinal mucosa as well as hemopoetic and immune systems, expression of this 
gene at significantly higher or lower levels may be routinely detected in certain tissues 
and cell types (e.g., hematopoietic ceUs, and cancerous and wounded tissues) or bodily 

5 fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e.. the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 

10 corresponding to this gene are useftil for treatment of diseases of the immune system, 
including cancer, hemopoetic and infectious diseases. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 68 

This gene is expressed primarily in kidney cortex and to a lesser extent in infant 

15 brain, heart, uterus, and blood. 

Therefore, polynucleotides and polypeptides of the invention are useftil as 
reagents for differential identification of kidney tissue present in a biological sample and 
for diagnosis of diseases and conditions: soft tissue cancer, inflammation, kidney 
fibrosis. Similarly, polypeptides and antibodies directed to these polypeptides are useful 
20 in providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
nervous and endocrines systems, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., kidney, 
brain, and other nervous tissue, heart, uterus, and blood cells, and cancerous and 
25 wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates Uiat polynucleotides and polypeptides 
30 corresponding to tiiis gene are useful for study and treatment of cancer and fibroses. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 69 

The translation product of this gene shares sttong sequence homology with 
vertebrate and invertebrate protein tyrosine phosphatases. 
35 This gene is expressed primarily in endometrial tumors, melanocytes, myeloid 

progenitors and to a lesser extent in infant brain, adipocytes, and several hematopoietic 
stem cells. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of transformed hematopoietic and epitheUal cells 
present in a biological sample and for diagnosis of diseases and conditions which 
include, but are not limited to. of skin and endometrium, leukemia. SimUarly, 
5 polypeptides and antibodies directed to these polypeptides are useful in providing 

immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the nervous and 
hemopoietic systems, expression of this gene at significantly higher or lower levels may 
be routinely detected in certain tissues and cell types (e.g., endometrium, melanocytes, 

10 bone marrow, adipocytes, hematopoietic cells, and brain and other tissue of the nervous 
system, and cancerous and wounded tissues) or bodily fluids (e.g., senim, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e.. 
the expression level in healthy tissue or bodily fluid from an individual not having the 

15 disorder. 

The tissue distribution and sequence similarity with tyrosine phosphatases 
indicate that polynucleotides and polypeptides corresponding to this gene are useful for 
study and treatment of cancer and hematopoietic disorders. 

20 FEATURES OF PROTEIN ENCODED BY GENE NO: 70 

This gene is expressed primarily in osteoclastoma, breast, and infant brain and 

to a lesser extent in various fetal and transformed bone, ovarian, and neuronal cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
25 biological sample and for diagnosis of diseases and conditions: degenerative conditions 
of the brain and skeleton. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the nervous and skeletal system, expression of this gene at significantly 
30 higher or lower levels may be routinely detected in certain tissues and cell types (e.g.. 
bone, mammary tissue, and brain and other tissue of the nervous system, and 
cancerous and wounded tissues) or bodily fluids (e.g.. serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e.. the expression level 
35 in healthy tissue or bodily fluid from an individual not having the disorder. 
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The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for study and treatment of degenerative, 
neurological and skeletal disorders. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 71 

This gene was originally cloned from tumor cell lines. RecenUy another group 
has also cloned this gene, calling it the human malignant melanoma metastasis- 
suppressor (KiSS-1) gene. (See Accession No. U43527.) Preferred polypeptide 
fragments comprise the amino acid sequence: lekvasvgnsrptgqqleslglla (SEQ id 

10 NO: 632); VHREEASCYCQAEPSGDL (SEQ ID NO: 633); RPALRQAGGGTREPRQKRWAGL 
(SEQ ID NO: 634); and AVNFRPQRSQSM (SEQ ID NO: 635). Any frame shifts can easily be 
resolved using known molecular biology techniques. 

This gene is expressed primarily in many types of carcinomas and to a lesser 
extent in many normal organs. 

|5 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissues(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, cancer particularly melanomas, and other hyperproliferative disorders. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 

20 providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of 
transformed organ tissue, expression of this gene at significantly higher or lower levels 
may be routinely detected in certain tissues and cell types (e.g., cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 

25 another Ussue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. As a tumor suppressor gene, increase 
amounts of the polypeptide can be used to treat patients having a particular cancer. 
The tissue distribution indicates that this gene and the translated product is 
30 useful for diagnosing and study of cancer. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 72 

This gene is expressed primarily in striatum and to a lesser extent in adipocytes 
and hemangioperiocytoma. 
35 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of striatal cells present in a biological sample and 
for diagnosis of diseases and conditions: neurological, fat and lysosomal storage 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 74 

This gene is expressed primarily in brain, fetal liver and lung and to a lesser 
extent in retina, spinal chord, activated T-cells and endothelial cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

5 reagents for differenUal identification of brain and regenerating liver present in a 

biological sample and for diagnosis of diseases and conditions: CNS and spinal chord 
injuries, immune disorders. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 

10 particularly of the nervous and immune system, expression of this gene at significantly 
higher or lower levels may be routinely detected in certain tissues and cell types (e.g., 
brain and other tissue of the nervous system, liver, pulmonary tissue, blood cells, and 
endothelial cells, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 

15 an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that polynucleoUdes and polypeptides 
corresponding to this gene are useful for study and treatment of hematopoietic and 
20 neurological conditions. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 75 

. The translation product of this gene shares sequence homology with GTP 
binding proteins (intracellular). 

25 This gene is expressed primarily in bone marrow, brain, and melanocytes and to 

a lesser extent in various endocrine and hematopoietic tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: hematopietic and 

30 nervous system conditions. Similarly, polypeptides and antibodies directed to these 

polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the nervous and immune, expression of this gene at significantly higher 
or lower levels may be routinely detected in certain tissues and cell types (e.g., bone 
35 marrow, melanocytes, brain and other tissue of the nervous system, and cancerous and 
wounded tissues) or bodily fluids (e.g.. serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder. 



wo 98/39448 



PCT/US98/04493 



50 



relative to the standard gene expression level, i.e.. the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to nucleoUde binding factors indicate that 
polynucleotides and polypeptides corresponding to this gene are useful for study, 
5 diagnosis, and treatment of brain degenerative, skin and blood diseases. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 76 

This gene is expressed primarily in activated T-cells and to a lesser extent in 

retina, brain, and fetal bone. 

10 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of activated T-cells and developing brain present 
in a biological sample and for diagnosis of diseases and conditions: immune 
deficiencies and skeletal and neuronal growth disorders. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 

15 for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the nervous, immune, and skeietomuscular 
sustems, expression of this gene at significantiy higher or lower levels may be routinely 
detected in certain tissues and cell types (e.g.. blood cells, brain and otiier tissue of die 
nervous system, retinal tissue, and bone, and cancerous and wounded tissues) or 

20 bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healtiiy tissue or bodily 
fluid from an individual not having tiie disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 

25 corresponding to this gene are useful for diagnosis, study and treatment of cancer, 
urogenital, and brain degenerative diseases. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 77 

This gene is expressed primarily in fetal liver, activated monocytes, osteoblasts 
30 and to a lesser extent in synovial, brain, and lymphoid tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of myeloid and lymphoid present in a biological 
sample and for diagnosis of diseases and conditions: inflammation, immune 
deficiencies, cancer. Similarly, polypeptides and antibodies directed to ttiese 
35 polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune system and skeleton, expression of this gene at significantly 
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higher or lower levels may be routinely detected in certain tissues and cell types (e.g.. 

liver, blood cells, bone, synovial tissue, brain and other tissue of the nervous system, 

and lymphoid tissue, and cancerous and wounded tissues) or bodily fluids (e.g., 

serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 
5 taken from an individual having such a disorder, relative to the standard gene 

expression level, i.e., the expression level in healthy tissue or bodily fluid from an 

individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for study, diagnosis, and treatment of lymphoid 
10 and mesenchymal cancers and nervous system diseases. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 78 

The translation product of this gene shares sequence homology with polymerase 
polyprotein precursor which is thought to be important in DNA repair and replication 

15 This gene is expressed primarily in infant brain and to a lesser extent in tumors 

and tumor cell lines 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the lissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 

20 not limited to, especially of the neural system and developing organs. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the neural system 
expression of this gene at significantly higher or lower levels may be routinely detected 

25 in certain (e.g., brain and other tissue of the nervous system, and cancerous and 

wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

30 The tissue distribution and homology to polymerase polyprotein precursor 

indicate that polynucleotides and polypeptides corresponding to this gene are useful for 
diagnosis and treatment of cancers especially of the neural system and developing 
organs 



35 



FEATURES OF PROTEIN ENCODED BY GENE NO: 79 

This gene is expressed primarily in muscle and endothelial cells and to a lesser 
extent in brain. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: vascular diseases. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
5 providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
vascular system, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain (e.g., muscle, endothelial cells, brain and other tissue of the 
nervous system, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 
10 plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
15 corresponding to this gene are useful for treatment and diagnosis of disorders of the 
vascular and neural system including cardiovascular and endothelial. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 80 

This gene is expressed primarily in placenta and to a lesser extent in fetal liver 

20 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: developmental disorders 
and disorder of the haemopoietic system, fetal liver and placenta. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 

25 inmiunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of developmental 
disorders and disorder of the haemopoietic system, fetal liver and placenta, expression 
of this gene at significantly higher or lower levels may be routinely detected in certain 
tissues and cell types (e.g., placenta and liver, and cancerous and wounded tissues) or 

30 bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 

35 coiresponding to this gene are useful for diagnosis and treatment of developmental 
disorders and disorders of the haemopoietic system, fetal liver and placenta. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 81 

This gene is expressed primarily in bone marrow, placenta and tissues and 
organs of the hematopoietic system. 
5 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: disorders of the bone 
and haemopoietic system. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 

10 of the tissue(s) or cell type{s). For a number of disorders of the above tissues or cells, 
particularly of the immune, bone and hematopoietic system, expression of this gene at 
significantly higher or lower levels may be routinely delected in certain tissues and cell 
types (e.g., bone marrow, placenta, and hematopoietic cells, and cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 

15 fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of disorders of the 

20 immune, bone and hematopoietic system 

FEATURES OF PROTEIN ENCODED BY GENE NO: 82 

The translation product of this gene shares sequence homology with secretory 
carrier membrane protein which is thought to be important in protein transport and 

25 export. Any frame shifts in coding sequence can be easily resolved using standard 
molecular biology techniques. Another group recently cloned this gene, calling it 
SCAMP. (See Accession No. 2232243.) 

This gene is expressed primarily in prostate, breast and spleen, and to a lesser 
extent in several other tissues and organs. 

30 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: disorders of the breast 
prostate and spleen. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 

35 of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly disorders of the breast prostate and spleen, expression of this gene at 
significantly higher or lower levels may be routinely delected in certain tissues and cell 
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types (e.g.. prostate, mammary tissue, and spleen, and cancerous and wounded 
tissues) or bodily fluids (e.g.. serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e.. the expression level in healthy tissue or bodily 
5 fluid from an individual not having the disorder. 

The tissue distribution and homology to secretory carrier membrane protein 
indicate that polynucleotides and polypeptides corresponding to this gene are useful for 
diagnosis and treatment of disorders of the breast, prostate and spleen. 

10 FEATURES OF PROTEIN ENCODED BY GENE NO: 83 

This gene is expressed primarily in developing organs and tissue like placenta 
and infant brain and to a lesser extent in developed organs and tissue hke cerebeUum 
and heart. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
]5 reagents for differential identification of the ussue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: neurological diseases. 
Similarly, polypeptides and antibodies directed to these polypeptides are usefiil in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
20 neural system, expression of this gene at significanUy higher or lower levels may be 
routinely detected in certain tissues and cell types (e.g.. placenta, heart, brain and other 
tissue of the nervous system, and cancerous and wounded tissues) or bodily fluids 
(e g serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
25 expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment and diagnosis of diseases of the 
neural system including neurological disorders and cancer. 

30 

FEATURES OF PROTEIN ENCODED BY GENE NO: 84 

The translation product of tiiis gene shares sequence homology with ATPase 6 
in Trypanosoma brucei which is thought to be important in metabolism. 

This gene is expressed primarily in wmor and fetal tissues and to a lesser extent 
35 in melanocytes, kidney cortex, monocytes and ovary. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
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biological sample and for diagnosis of diseases and conditions: metabolism disorders. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the fetal 

5 systems, expression of this gene at significandy higher or lower levels may be routinely 
detected in certain tissues and cell types (e.g., fetal tissues, melanocytes, kidney, blood 
cells, ovary and other tissue of the reproductive system, and cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 

10 the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution and homology to ATPase indicate that polynucleotides 
and polypeptides corresponding to this gene are useful for treatment and diagnosis of 
metabolism disorders, especially in fetal and tumor tissue growth. 

15 

FEATURES OF PROTEIN ENCODED BY GENE NO: 85 

The translation product of this gene shares sequence homology with the 
immunoglobulin superfamily of proteins which are known to be important in immune 
response and immunity. 

20 This gene is expressed primarily in stromal cells, colon cancer, lung, amygdala, 

melanocyte and to a lesser extent in a variety of other tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: defects of stromal cell 

25 development and cancer. Similarly, polypeptides and antibodies directed to these 

polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the stromal cells, expression of this gene at significantly higher or lower 
levels may be routinely detected in certain tissues and cell types (e.g., stromal cells, 

30 colon, lung, amygdala, and melanocytes, and cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

35 The tissue distribution and homology to immunoglobulin indicate that 

polynucleotides and polypeptides corresponding to this gene are useful for treatment 
and diagnosis of immune system disorders. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 86 

The translation product of this gene shares sequence homology with 
transcription iniation factor eIF-4 gamma which is thought to be important in gene 

5 transcription. 

This gene is expressed primarily in tumor tissues. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differenUal identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: tumorigenesis. 
10 Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly in tumor 
tissues, expression of this gene at significantly higher or lower levels may be routinely 
detected in certain tissues and cell types (e.g., endometrium and lung, and cancerous 
15 and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or 
spinal fluid) or another tissue or cell sample taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to transcription iniation factor eIF-4 
20 gamma indicate that polynucleotides and polypeptides corresponding to this gene are 
useful for gene regulation in mmorigenesis. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 87 

The translation product of this gene shares sequence homology at low level in 
25 prolines with secreted basic proline-rich peptide U-2 which is thought to be important in 
protein structuie or inhibiting hydroxyapatite formation in vitro. 

This gene is expressed primarily in endometrial tumor and fetal lung. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
30 biological sample and for diagnosis of diseases and conditions: endometrial tumors. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
muscular/skeletal and reproductive systems, expression of this gene at significantly 
35 higher or lower levels may be routinely detected in certain tissues and cell types (e.g.. 
endometrium, and lung, and cancerous and wounded tissues) or bodily fluids (e.g.. 
serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 
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taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue disUribution and homology to secreted basic proline-rich peptide 11-2 
5 indicate that polynucleotides and polypeptides corresponding to this gene are useful for 
inhibiting hydroxyapatite formation or establishing cell/tissue structure. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 88 

This gene is expressed primarily in: amniotic cells inducted with TNF in culture; 
10 and to a lesser extent in colon tissue from a patient with Crohn's Disease; parathyroid 
tumor; activated T-cells; cells of the human Caco-2 cell line; adenocarcinoma; colon; 
corpus colosum; fetal kidney; pancreas tumor; fetal brain; early stage brain, and anergic 
T-cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

15 reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: tumors. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the immune system; 

20 e.g., tumors, expression of this gene at significantly higher or lower levels may be 

routinely detected in certain (e.g., amniotic cells, colon, kidney, pancreas, parathyroid, 
brain and other tissue of the nervous system, blood cells, hematopoietic cells, liver, 
spleen, bone, testes and other reproductive tissue, brain and other tissue of the nervous 
system, and epithelial cells, and cancerous and wounded tissues) or bodily fluids (e.g., 

25 serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that the protein product of this clone is useful 

30 for modulating tumorigenesis and other immune system conditions such as disorders in 
immune response. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 89 

This gene is expressed primarily in fetal liver/spleen and hematopoietic cells and 
35 to a lesser extent in brain, osteosarcoma, and testis tumor. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
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biological sample and for diagnosis of diseases and conditions: leukemia and 
hematopoietic disorders. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential idenuficauon 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells. 
5 particularly of the hematopoietic and immune systems, expression of this gene at 

significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., hematopoietic ceUs. liver, spleen, bone, testes, and other reproductive 
tissue brain and other tissue of the nervous system, and cancerous and wounded 
tissues) or bodily fluids (e.g.. serum, plasma, urine, synovial fluid or spinal fluid) or 
10 another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e.. the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder . 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to tiiis gene are usefiil for diagnosis and treamient of hematopoietic and 
15 immune disorders. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 90 

The translation product of this gene shares weak sequence homology with 
mouse Gcapl protein which is developmentally regulated in brain. 
20 This gene is expressed primarily in infant and adult brain and fetal liver/spleen 

and to a lesser extent in smooth muscle. T cells, and a variety of other tissues. 

Therefore, polynucleotides and polypeptides of tiie invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: neurological or 
25 hematopoietic disorders. Similarly, polypeptides and antibodies directed to these 

polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the nervous, hematopoietic, immune, and endocrine systems, expression 
of this gene at significantiy higher or lower levels may be routinely detected m certain 
30 tissues and cell types (e.g., brain and other tissue of the nervous system, blood cells, 
liver spleen .and smooth muscle, and cancerous and wounded tissues) or bodily fluids 
(e g serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily flmd from an 
35 individual not having the disorder. 
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The tissue distribution and its homology to Gcapl protein indicate that 
polynucleotides and polypeptides coiresponding to this gene are useful for treatubg and 
diagnosis of disorders in neuronal, hematopoietic, immune, and endocrine systems. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 91 

This gene is expressed primarily in brain and hematopoietic cells and to a lesser 
extent in tumor tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

10 biological sample and for diagnosis of diseases and conditions: disorder in nervous, 
hematopoietic, inunune systems and tumorigenesis. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the in nervous, hematopoietic, immune 

15 systems, expression of this gene at significantly higher or lower levels may be routinely 
detected in certain tissues and cell types (e.g., brain and other tissue of the nervous 
system, and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 

20 the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that the protein product of this clone is useful 
for diagnosis and treatment of disorders in the nervous, hematopoietic, and inmiune 
systems. 

25 

FEATURES OF PROTEIN ENCODED BY GENE NO: 92 

The translation product of this gene shares sequence homology with 
neuroendocrine-specific protein A which is thought to be important in neurologic 
systems. 

30 This gene is expressed primarily in brain tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: neural disorders and 
degeneration disease. Similarly, polypeptides and antibodies directed to these 

35 polypeptides are useful in providing inununological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the central or peripheral nervous systems, expression of this gene at 
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significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., hematopoietic cells, and brain and other tissue of the nervous system, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to neuroendocrine-specific protein A 
indicate that pol3mucleotides and polypeptides corresponding to this gene are useful for 
treatment or diagnosis of neural disorders and degeneration disease. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 93 

The translation product of this gene shares sequence homology with collagen- 
like protein and prolin-rich protein which are thought to be important in connective 
tissue function and tissue structure. 

This gene is expressed primarily in fetal liver/spleen and brain tissues. 

Therefore, polynucleotides and polypeptides of the invendon are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: neuronal or 
hematopoietic disorders. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the nervous and hematopoietic systems, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., liver, spleen, and brain and other tissue of the nervous system, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to collagen-like protein and proline-rich 
proteins indicate that polynucleotides and polypeptides corresponding to this gene are 
useful for supporting brain and hematopoietic tissue function and diagnosis and 
treatment of disorders in these functions. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 94 

This gene is expressed primarily in embryonic tissues and tumor tissues. 
Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
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biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to,. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
5 the inmiune system (e.g., tumors), expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., embryonic 
tissue and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
10 the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of cancer. 

15 FEATURES OF PROTEIN ENCODED BY GENE NO: 95 

This gene is expressed primarily in brain tumor, placenta, and melanoma. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: brain tumor or 

20 melanoma. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the brain or melanocytes, expression of this gene at significantly higher or lower levels 
may be routinely detected in certain tissues and cell types (e.g., brain and other tissue of 

25 the nervous system, placenta, and melanocytes, and cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

30 The tissue distribution indicates that the translation product of this gene is useful 

in the diagnosis and treatment of brain tumors and melanoma. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 96 

The translation product of this gene shares sequence homology with a yeast 
35 membrane protein, SUR4, which encodes for APA 1 that acts on a gluco.se-signaling 

pathway that controls the expression of several genes that are transcriptionally regulated 
by glucose. 
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This gene is repressed primarily in fetal liver, and to a lesser extent in placenta 
and breast tissue. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the Ussue(s) or cell type(s) present in a 
5 biological sample and for diagnosis of diseases and condiUons: defects of fetal liver or 
defects of glucose-regulated ATPase activities in tissues. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the fetal imraune/hematopoietic system, 
10 expression of this gene at significandy higher or lower levels may be routinely detected 
in certain tissues and cell types (e.g., liver, placenta, and mammary tissue, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
1 5 in healthy tissue or bodily fluid from an individual not having the disorder. 

The ussue distribution and homology to yeast SUR4 membrane protein indicate 
that polynucleotides and polypeptides corresponding to this gene are useful for 
diagnosis and treatment of defects of fetal Uver or defects of glucose-regulated ATPase 
activities. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 97 

This gene is expressed primarily in fetal liver, brain, and amniotic fluid. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
25 biological sample and for diagnosis of diseases and conditions: defects of the fetal 

immune system and adult brain. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the fetal immune system and adult brain, expression of this gene at 
30 significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., liver, and brain and other tissue of the nervous system, and cancerous and 
wounded tissues) or bodily fluids (e.g., amniotic fluid, serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
35 in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that the protein product of this clone is useful 
for detecting defects of the fetal immune and hematopoietic systems since fetal liver is 
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the predominant organ responsible for hematopoiesis in the fetus. In addition, the gene 
product of this gene is thought to be useful for detecting certain neurological defects of 
the brain. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 98 

The translation product of this gene shares sequence homology with an yolk 
protein precursor. Vitellogenin which is thought to be important in binding lipids such 
as phosvitin. 

This gene is expressed primarily in amniotic cells and fetal liver. 

10 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: defects in amniotic 
cells, fetal liver development and the fetal immune system. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing inununological probes 

15 for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the [insert system where a related disease 
state is likely, e.g., inunune], expression of this gene at significantly higher or lower 
levels may be routinely detected in certain tissues and cell types (e.g., amniotic cells, 
and liver, and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 

20 urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 

individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution and homology to vitellogenin indicate that the protein 
25 product of this clone is useful for treatment and diagnosis of defects in amniotic cells, 
fetal liver development and the fetal immune system. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 99 

This gene is expressed primarily in placenta, endometrial tumor, osteosarcoma 

30 and stromal cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: tumor of the 
endometrium or bone, and osteosarcoma. Similarly, polypeptides and antibodies 

35 directed to these polypeptides are useful in providing immunological probes for 

differential idenrification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the obstetric system (e.g. placenta. 
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endometrium) and the bones, expression of this gene at significantly higher or lower 
levels may be routinely detected in certain ussues and cell types (e.g.. placenta, 
endometrium, bone, and stromal cells, and cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another Ussue or 

5 cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level. i.e.. the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of tumors and 

10 abnormalities of the endometrium, and the bones because of its abundance in the 
aforementioned tissues.. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 100 

This gene is expressed primarily in hepatocellular tumor. 
15 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: hepatocellular tumor. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful m 
providing immunological probes for differential identification of the tissue(s) or cell 
20 type(s). For a number of disorders of the above tissues or cells, particularly of the hver. 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues and cell types (e.g.. Hver, and cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
25 standard gene expression level, i.e., the expression level in healtiiy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that the protein product of this clone is useful 
for diagnosis and treatinent of hepatocellular cancer because of its abundant expression 
in this tissue. 

30 

FEATURES OF PROTEIN ENCODED BY GENE NO: 101 

This gene is expressed primarily in Corpus Colosum, fetal lung and infant 

brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
35 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: defects of the Corpus 
Colosum or defects of the fetal lung. Similarly, polypeptides and antibodies directed to 



wo 98/39448 



65 



PCT/US98/04493 



these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the Corpus Colosum and brain in general, and fetal lung, 
expression of this gene at significantly higher or lower levels may be routinely detected 

5 in certain tissues and cell types (e.g., lung, and brain and other tissue of the nervous 
system, and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 

10 disorder. 

The tissue distribution indicates that the protein product of this clone is useful 
for diagnosis and treatment of defects of the Corpus Colosum and brain in general, and 
defects of fetal lung. 

15 FEATURES OF PROTEIN ENCODED BY GENE NO: 102 

This gene is expressed primarily in T cells and stromal cells, and to a lesser 
extent in adrenal gland. 

Therefore, polynucleotides and polypeptides of the invention are usefiil as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

20 biological sample and for diagnosis of diseases and conditions: defects of T cell 

immunity and stromal cell development. Similarly, polypeptides and antibodies directed 
to these polypeptides are useful in providing inununological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the inmiune system, expression of this gene at 

25 significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., blood cells, stromal cells, and adrenal gland, and cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 

30 fluid from an individual not having the disorder. 

The tissue distribution indicates that the protein product of this clone is useful 
for diagnosis and treatment of defects of T cell immunity and stromal cell development 
because of its abundant expression in these tissues. 

35 FEATURES OF PROTEIN ENCODED BY GENE NO: 103 

This gene is expressed primarily in infant brain and placenta. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: defects of the brain and 
nervous system. Similarly, polypeptides and antibodies directed to these polypeptides 

5 are useful in providing immunological probes for differential identification of the 
Ussue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the nervous system, especially brain, expression of this gene at 
significantly higher or Iowa levels may be routinely detected in certain tissues and cell 
types (e.g., brain and other tissue of the nervous system, and placenta, cancerous and 

10 wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that the protein product of this clone is useful 

1 5 for detecting defects of the brain, especially in young children. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 105 

This gene is expressed primarily in human osteoclastoma and to a lesser extent 
in human pancreas tumor. 
2.0 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, cancer particularly osteoclastoma and pancreatic tumor. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
25 immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly in transformed tissues, 
expression of this gene at significanUy higher or lower levels may be routinely detected 
in certain tissues and cell types (e.g.. bone and pancreas, and cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
30 another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e.. the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that the protein product of this clone is useful 
for diagnosis and treatment of some types of tumors, particularly pancreatic cancer and 
35 osteoclastoma. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 106 

This gene is expressed primarily in fetal liver/spleen, and to a lesser extent in 
activated T-Ceils. 

Therefore, polynucleotides and polypeptides of the invention are usefiil as 
5 reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: immune disorders. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing inmiunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
10 immune system, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues and cell types (e.g., liver, spleen, and blood cells, 
and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, 
synovial fluid or spinal fluid) or another tissue or cell sample taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 
1 5 expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis or treatment of immune disorders. 

20 FEATURES OF PROTEIN ENCODED BY GENE NO: 107 

This gene is expressed primarily in human embryo and to a lesser extent in 
spleen and chronic lymphocytic leukemia. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

25 biological sample and for diagnosis of diseases and conditions: leukemia. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the immune or 
hemopoietic systems, expression of this gene at significantly higher or lower levels may 

30 be routinely detected in certain tissues and cell types (e.g., embryonic tissue, spleen, 
and blood cells, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
35 disorder. 

The tissue distribution indicates that the protein product of this clone is useful 
for the diagnosis and treatment of leukemia. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 108 

This gene is expressed primarily in placenta, and to a lesser extent in early stage 

human brain and in lung. 
5 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: fetal developmental 
abnormalities. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 

10 or cell type(s). For a number of disorders of the above tissues or cells, particularly in 
fetal and amniotic tissue, expression of this gene at significantly higher or lower levels 
may be routinely detected in certain tissues and cell types (e.g., placenta, brain and 
other tissue of the nervous system, and lung, and cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 

1 5 tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that the protein product of this is useful for 
production of growth factor(s) associated with fetal development. Preferred 

20 polypeptides comprise the full-length polypeptide shown in the sequence hsting, 
truncated however, at the amino terminus and beginning with QTE. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 109 

This gene is expressed primarily in fetal spleen, and to a lesser extent in B-Cell 
25 lymphoma and T-Cell lymphoma. 

Therefore, polynucleotides and polypeptides of the invention are usefiil as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: lymphoma. Similarly, 
polypeptides and antibodies directed to tiiese polypeptides are usefiil in providing 
30 immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the immune system, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues and cell types (e.g., spleen and blood cells, and cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
35 fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e.. the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder . 
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The tissue distribution indicates that the protein product of this clone is useful 
for the treatment and diagnosis of human lymphomas. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 110 
5 The translation product of this gene shares sequence homology with sarcoma 

amplified sequence (SAS), a tetraspan receptor which is thought to be important in 
malignant fibrous histiocytoma and liposarcoma. 

This gene is expressed primarily in human osteoclastoma, and to a lesser extent 
in pineal gland and infant brain. 

10 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential idenufication of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: malignant fibrous 
histiocytoma and liposarcoma. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differendal identification 

15 of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune system, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., bone, 
pineal gland, and brain and other tissue of the nervous system, and cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 

20 fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to sarcoma amplified sequence (SAS) 
indicate that the protein product of this clone is useful for treatment of, osteosarcoma, 

25 malignant fibrous histiocytoma and liposarcoma and related cancers, particularly 
sarcomas. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 111 

The translation product of this gene shares sequence homology with 6.8K 
30 proteolipid protein, mitochondrial - bovine. 

This gene is expressed primarily in Wilm's tumor and to a lesser extent in 

cerebellum and placenta. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
35 biological sample and for diagnosis of diseases and conditions: Wilm's tumor. 

Similarly, polypeptides and antibodies directed to these polypeptides are useful in 

providing immunological probes for differential identification of the tissue(s) or cell 



wo 98/39448 



70 



PCTAUS98/04493 



type(s). For a number of disorders of the above tissues or cells, particularly of the 
immune or renal systems, expression of this gene at significantly higher or lower levels 
may be routinely detected in certain tissues and cell types (e.g., brain and other tissue of 
the nervous system, and placenta, and cancerous and wounded tissues) or bodily fluids 

5 (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution and homology to 6.8K proteolipid protein indicate that 

10 the protein product of this clone is useful for diagnostic and therapeutics associated with 
tumors, particularly Wilm's tumor disease. 

FEATURES OF PROTEIN ENCODED BY GENE NO; 112 

This gene is expressed primarily in embryonic tissue and to a lesser extent in 

15 osteoblasts, endothelial cells, macrophages (GM-CSF treated), and bone marrow. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: immune disorders. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 

20 providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a iiumber of disorders of the above tissues or cells, particularly of the 
immune system, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues and cell types (e.g., embryonic tissue, bone, 
endothelial cells, blood cells and bone marrow, and cancerous and wounded tissues) or 

25 bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 

30 corresponding to this gene are useful for treatment or diagnosis of immune disorders. 
Preferred polypeptides encoded by this gene comprise the following amino acid 
sequence: MITDVQLAIFANMLGVSLFLLWLYHYVAVNNPKKQE (SEQ ID NO: 636). 

FEATURES OF PROTEIN ENCODED BY GENE NO: 113 
35 This gene is expressed primarily in hepatocellular tumor, and to a lesser extent 

in fetal liver/spleen. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: tumors, particularly 
hepatocellular tumors. Similarly, polypeptides and antibodies directed to these 

5 polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the hepatic system, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., liver, and 
spleen, and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 

10 urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that the protein product of this clone is useful 
15 for diagnosis and treatment of tumors, particularly hepatocellular tumors. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 114 

The translation product of this gene exhibits a very high degree of sequence 
identity with the human Pig8 gene which is thought to be important in p53 mediated 
20 apoptosis. The sequence of this gene has since been published by Polyak and 

colleagues (Nature 389, 300-306 (1997)). In addition, the predicted translation product 
of this contig exhibits very high sequence homology with a murine gene denoted as 
EI24 which is also thought to be important in p53 mediated apoptosis. 

This gene is expressed primarily in infant brain and activated T-cells and to a 
25 lesser extent in bone marrow, fetal liver, and prostate. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, and tissue damage by radiation and anti-cancer drugs. Similarly, 
30 polypeptides and antibodies directed to these polypeptides are useful in providing 

immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the nervous and 
immune systems, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues and cell types (e.g., brain and other tissue of the 
35 nervous system, blood cells, bone marrow, liver, and prostate, and cancerous and 

wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
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relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to human Pig8 and murine EI24 genes 
indicate that polynucleotides and polypeptides corresponding to this gene are useful for 
5 preventing apoptosis in patients being treated with anti-oncogenic drugs such as 

etoposide, hydroperoxycyclophosphamide, and X-irradiation, since this protein product 
is upregulated in cells undergoing such treatment where pS3 was overexpressed. It may 
also be useful in the treatment of hematopoietic disorders and in boosting numbers of 
hematopoietic stem cells by interfering with the apoptosis of progenitor cells. The 

10 mature polypeptide is predicted to comprise the following amino acid sequence: 
EEMADSVKTFLQDLARGIKDSIWGICTISiaDARIQQKREEQRRRRASSVLAQR!^ 
EPRFVSRIFQCCAWNGGVFWFSLLLFYRVFIPVLQSVTARIIGDPSLHGDVWSWLEFFLTSIFSA 
LWVLPLFVLSKVVNAIWFQDIADLAFEVSGRKPHPFPSVSKIIADMLFNLLLQALFLIQGMFVSL 
FPIHLVGQLVSLLHMSLLYSLYCFEYRWFNKGIEMHQRLSNIERNWPYYFGFGLPLAFLTAMQ 

15 SSYnSGCLFSILFPLniSANEAKTPGKAYLFQLRLFSLWFLSNRLFHKTVYLQSALSSSTSAEK 
FPSPHPSPAKLKATAGH (SEQ ID NO: 637). Accordingly, polypeptides comprising the 
foregoing amino acid sequence are provided as are polynucleotides encoded such 
polypeptides. 

20 FEATURES OF PROTEIN ENCODED BY GENE NO: 115 

This gene is expressed primarily in stromal cells and to a lesser extent in 
multiple sclerosis. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

25 biological sample and for diagnosis of diseases and conditions: affecting the nervous 
system. Similarly, polypeptides and antibodies directed to these polypeptides are useful 
in providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
nervous system, expression of this gene at significantly higher or lower levels may be 

30 routinely detected in certain tissues and cell types (e.g., stromal cells and cancerous and 
wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

35 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for treatment and diagnosis of multiple sclerosis 
and other autoimmune diseases. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 116 
This gene is expressed primarily in the gall bladder 
Therefore, polynucleotides and polypeptides of the invention are usefiil as 

5 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: gall stones or infection 
of the digestive system . Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 

10 particularly of the digestive system or renal system, expression of this gene at 

significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., gall bladder and tissue of the digestive system, and cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 

15 the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for possible prevention of digestive disorders 
where there may be a lack of digestive enzymes produced or in the detection and 

20 possible prevention of gall stones. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 117 

The translation product of this gene shares sequence homology with dystrophin 
gene which is thought to be important in building and maintenance of muscles. 

25 This gene is expressed primarily in placenta and to a lesser extent in fetal brain 

and fetal liver, and spleen. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: muscular dystropy, 

30 Duchenne and Becker's muscular dystropies. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the skeletal muscle system, expression of this 
gene at significantly higher or lower levels may be routinely detected in certain tissues 

35 and cell types (e.g., placenta, brain and other tissue of the nervous system, muscle, 
liver, and spleen, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
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an individual having such a disorder, relative to the standard gene expression level, i.e.. 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution and homology to the dystrophin gene indicate that 
5 polynucleotides and polypeptides corresponding to this gene are useful for diseases 

related the degenerative myopathies that are characterized by the weakness and atrophy 
of muscles without neural degradation; such as Duchenne and Becker's muscular 
dystropies. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 118 

This gene is expressed primarily in olfactory tissue and to a lesser extent m 

cartilage. , , 

Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present m a 
biological sample and for diagnosis of diseases and conditions: connective Ussue 
diseases; chondrosarcoma. Similarly, polypeptides and antibodies directed to th^e 
polypeptides are useful in providing immunological probes for differenual identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the connective tissue, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., olfactory 
tissue and cartilage, and cancerous and wounded tissues) or bodily fluids (e.g semm. 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e.. 
the expression level in healthy tissue or bodily fluid from an individual not having the 
25 disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for tumors of connective tissues, osteoarthntis 
and the treatment and diagnosis of chondrosarcoma. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 119 

This gene is expressed primarily m Activated Neutrophils and to a lesser extent 
in fetal spleen, and CD34 positive cells from cord blood. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present m a 
biological sample and for diagnosis of diseases and conditions: allergies, defects m 
hematopoiesis and inflammation. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
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identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the inunune system and hematopoiesis system the, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues and cell types (e.g., blood cells, and spleen, and cancerous and 

5 v^ounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 

10 corresponding to this gene are useful for reducing the allergic effects felt by allergy 
suffers by neutralizing the activity of the immune system, especially since neutrophils 
are abundant in persons suffering from allergies and other inflammatory conditions. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 120 
15 The translation product of this gene shares sequence homology with poly A 

binding protein II which is thought to be important in RNA binding for transcription of 
RNAtoDNA 

This gene is expressed primarily in colon and to a lesser extent in brain and 
immune system. 

20 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: colon cancer. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For a 

25 number of disorders of the above tissues or cells, particularly of the immune and 

digestive system, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues and cell types (e.g., colon, tissue and cells of the 
immune system, and brain or other tissue of the nervous system, and cancerous and 
wounded tissues) or bodily fluids (e.g.. serum, plasma, urine, synovial fluid or spinal 

30 fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to poly A binding protein II indicate that 
polynucleotides and polypeptides corresponding to this gene are useful for detection 

35 and treatment of colon cancer and other disorders of the digestive system.. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 121 

The translation product of this gene shares sequence homology with thymidine 
diphosphoglucose 4.6 dehydrase which is thought to be important in the metabolism of 
sugar. 

5 This gene is expressed primarily in fetal liver and spleen and to a lesser extent in 

infant brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: diabetes. Similarly, 

10 polypeptides and antibodies directed to these polypeptides are useful in providing 

immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the endocrine system, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues and cell types (e.g., liver, spleen, and brain and other tissue of the 

15 nervous system, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder, 

20 The tissue distribution and homology to thymidine diphospoglucose 4.6 

dehydrase indicate that polynucleotides and polypeptides corresponding to this gene are 
useful for treatment of persons with diabetes since it appears that this protein is needed 
in the metabolism of sugar in to its more basic components. 

25 FEATURES OF PROTEIN ENCODED BY GENE NO: 122 

The translation product of this gene shares sequence homology with 
ceruloplasmin which is thought to be important in the metabolism and transport of iron 
and copper. Ceruloplasmin also contains domains with homology to clotting factors V 
and Vni. Defects in the circulating levels of ceruloplasmin (aceruloplasminemia) have 
30 been associated with certain disease conditions such as Wilson disease, and the 
accompanying hepatolenticular degeneration. 

This gene is expressed primarily in brain and retina and to a lesser extent in 
endothelial cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
35 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: diseases marked by 
defects in iron metabolism; aceruloplasminemia not characterized by defects in the 
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known ceruloplasmin gene locus; nonclassical Wilson disease; movement disorders; 
and tumors derived from a brain tissue origin. Similarly, polypeptides and antibodies 
dirccted to these polypepudes are useful in providing immunological probes for 
differential identification of the tissue{s) or cell type(s). For a number of disorders of 
5 the above tissues or cells, particularly of the brain, retina, and nervous system, 

expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues and cell types (e.g., brain and other tissue of the nervous system, 
retinal tissue, and endothelial cells, and cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 

10 cell sample taken from an individual having such a disorder, relative to the standard 

gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution and homology to ceruloplasmin indicate that 
polynucleotides and polypeptides corresponding to this gene are useful for treatment of 

1 5 patients with aceruloplasminemia, or other defects in iron and/or copper metabolism. 
Mutations in this locus could also be diagnostic for patients currently experiencing or 
predicted to experience aceruloplasminemia. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 123 

20 This gene is expressed primarily in brain and B cell lymphoma and to a lesser 

extent in fetal liver and spleen. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: B cell lymphoma; 

25 tumors and diseases of the brain and/or spleen; hematopoietic defects. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
inununological probes for differential identification of the tissue(s) or cell lype(s). For a 
number of disorders of the above tissues or cells, particularly of the brain and 
hematopoietic system, expression of this gene at significantly higher or lower levels 

30 may be routinely detected in certain tissues and cell types (e.g., brain and other tissue of 
the nervous system, blood cells, liver, and spleen, and cancerous and wounded tissues) 
or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 

35 fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment of disorders in neuronal, 



wo 98/39448 



78 



PCT/US98/04493 



hematopoietic, and inunune systems. It could potentially be useful for 
neurodegenerative disorders and neuronal and/or hematopoietic cell survival or 
proliferation. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 124 

This gene is expressed primarily in osteoclastoma, dermatofibrosarcoma, and B 
cell lymphoma and to a lesser extent in endothelial cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

10 biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, cancer in particular osteoclastoma, dermatofibrosarcoma, and B cell 
lymphoma. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 

15 the bone, immune, and circulatory system, expression of this gene at significantly 

higher or lower levels may be routinely detected in certain tissues and cell types (e.g., 
bone, epidermis, blood cells, and endothelial cells, and cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 

20 the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of cancers and 
lymphoma; osteoporosis; and the control of cell proliferation and/or differentiation. 

25 

FEATURES OF PROTEIN ENCODED BY GENE NO: 125 

This gene is expressed primarily in immune tissues and hematopoietic cells, 
particularly in activated T cells and neutrophils, spleen, and fetal liver, and to a lesser 
extent in infant adrenal gland. 

30 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: defects in T cell 
activation; hematopoietic disorders; tumors of a hematopoietic and/or adrenal gland 
origin. Similarly, polypeptides and antibodies directed to these polypeptides are useful 

35 in providing inmiunological probes for differential identification of the tissue(s) or cell 
lype(s). For a number of disorders of the above tissues or cells, particularly of the 
hematopoietic and/or endocrine systems, expression of this gene at significantly higher 
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or lower levels may be routinely detected in certain tissues and cell types (e.g., cells and 
tissues of the immune system, hematopoietic cells, blood cells, liver, and adrenal gland, 
and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, 
synovial fluid or spinal fluid) or another tissue or cell sample taken from an individual 
5 having such a disorder, relative to the standard gene expression level, i.e., the 
expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for immune and/or hematopoietic disorders; 
10 diseases related to proliferation and/or differentiation of hematopoietic cells; defects in T 
cell and neutrophil activation and responsiveness; and endocrine and/or metabolic 
disorders, particularly of early childhood. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 126 

15 This gene is expressed primarily in placenta and endothelial cells and to a lesser 

extent in melanocytes and embryonic tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: tumors of an endothelial 

20 cell origin; angiogenesis associated with tumor development and metastasis. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the vascular system 
and developing embryo, expression of this gene at significantly higher or lower levels 

25 may be routinely detected in certain tissues and cell types (e.g., placenta, endothelial 
cells, melanocytes, and embryonic tissues, and cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 

30 fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment of developmental disorders; 
inhibition of angiogenesis; and vascular patterning. 



35 



FEATURES OF PROTEIN ENCODED BY GENE NO: 127 

This gene is expressed primarily in endothelial cells and hematopoietic tissues, 
including spleen, tonsils, leukocytes, and both B- and T-cell lymphomas. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: tumors of an endothelial 
cell and/or hematopoietic origin; leukemias and lymphomas. Similarly, polypeptides 

5 and antibodies directed to these polypeptides are useful in providing inmiunological 
probes for differential identification of the tissue(s) or cell type(s). For a number of 
disorders of the above tissues or cells, particularly of the inunune and vascular systems, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues and cell types (e.g., endothelial cells, hematopoietic cells, spleen, 

10 tonsils, and blood cells, and cancerous and wounded tissues) or bodily fluids (e.g., 
serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

15 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for the manipulation of angiogenesis; the 
differentiation and morphogenesis of endothelial cells; the proliferation and/or 
differentiation of hematopoietic cells; and the commitment of hematopoietic cells to 
distinct cell lineages. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 128 

This gene is expressed primarily in kidney medulla and to a lesser extent in 
spleen from chronic myelogenous leukemia patients, prostate cancer, and some other 
tissues. 

25 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: tumors of a kidney 
origin; chromic myelogenous.leukemia; prostate cancer. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 

30 for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the kidney and spleen, expression of this 
gene at significantly higher or lower levels may be routinely detected in certain tissues 
and cell types (e.g., kidney, spleen, and prostate, and cancerous and wounded tissues) 
or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 

35 tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 
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The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are usefiil for the diagnosis and treatment of kidney 
disorders and cancer, particularly chromic myelogenous leukemia and prostate cancer. 
It may also be useful for the enhancement of kidney tubule regeneration in the treatment 
5 of acute renal failure. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 129 

This gene is expressed primarily in adult and infant brain and to a lesser extent 
in mesenchymal or fibroblast cells, as well as tissues with a mesenchymal origin. 

10 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: tumors of a brain and/or 
mesenchymal origin; neurodegenerative disorders; cancer; fibrosis. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 

15 inmiunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the brain and of 
mesenchymal cells and tissues, expression of this gene at significantly higher or lower 
levels may be routinely detected in certain tissues and cell types (e.g., brain and other 
tissue of the nervous system and cancerous and wounded tissues) or bodily fluids 

%0 (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 

25 corresponding to this gene are useful for the diagnosis of tumors of a brain and/or 

mesenchymal origin; neurodegenerative disorders; cancer; and fibrosis, based upon the 
expression of this gene within those tissues. Fibrosis is considered as mesenchymal 
cells and fibroblasts are the primary cellular targets involved in this pathological 
condition. 

30 

FEATURES OF PROTEIN ENCODED BY GENE NO: 130 

This gene is expressed primarily in hepatocellular cancer and to a lesser extent in 

fetal tissues as well as testes tumor. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
35 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: liver cancer. Similarly, 

polypeptides and antibodies directed to these polypeptides are useful in providing 
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immunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the digestive system, 
expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues and cell types (e.g., liver, fetal tissue, and testes and other 
5 reproductive tissue, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

10 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for diagnosis and treatment of liver cancer. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 131 
This gene is expressed only in infant early brain. 

15 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: development and 
diseases of the nervous system. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing inmiunological probes for differential identification 

20 of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the brain and nervous system, expression of this gene at significantly 
higher or lower levels may be routinely detected in certain tissues and cell types (e.g., 
brain and other tissue of the nervous system and cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 

25 tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treating diseases of the brain in children and in 

30 treating nervous system disorders such as Alzheimer's disease, schizophrenia, 
dementia, depression, etc. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 132 

This gene is expressed primarily in brain and to a lesser extent in glioblastoma. 
35 Therefore, polynucleotides and polypeptides of the invention are usefiil as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: Alzheimer's disease. 
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schizophrenia, depression, mania, and dementia. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing inununological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the brain and nervous system, expression of 
this gene at significantly higher or lower levels may be routinely detected in certain 
tissues and cell types (e.g., brain and other tissue of the nervous system, and cancerous 
and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or 
spinal fluid) or another tissue or cell sample taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treating brain disorders such as Alzheimer's 
disease, schizophrenia, depression, mania, and dementia. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 133 

The translation product of this gene shares sequence homology with ribitol 
dehydrogenase of bacteria which is thought to be important in metabolism of sugars. 

This gene is expressed primarily in macrophage and to a lesser extent in T-cell 
lymphoma and lung. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: tissue destruction in 
inflammation. Similarly, polypeptides and antibodies directed to tiiese polypeptides are 
useful in providing inununological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the inunune system, expression of this gene at significantly higher or lower levels may 
be routinely detected in certain tissues and cell types (e.g., blood cells and lung, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to ribitol dehydrogenase indicate that 
polynucleotides and polypeptides corresponding to this gene are useful for altering 
macrophage metabolism in diseases such as inflanmiation where macrophages are 
causing excess tissue destruction. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 134 

This gene is expressed primarily in pancreatic tumor and to a lesser extent in 
synovial sarcoma. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
5 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to,. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
10 the endocrine and connective tissue systems, expression of this gene at significantly 
higher or lower levels may be routinely detected in certain tissues and cell types (e.g., 
pancreas, and synovial tissue, and cancerous and wounded tissues) or bodily fluids 
(e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
15 expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treating and diagnosing various cancers. 

^0 FEATURES OF PROTEIN ENCODED BY GENE NO: 135 

This gene is expressed primarily in T cell lines such as Raji and to a lesser 
extent in infant brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
25 biological sample and for diagnosis of diseases and conditions: immune system 

disorders and inflammation. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune system, expression of this gene at significantly higher or 
30 lower levels may be routinely detected in certain tissues and cell types (e.g., blood 
cells, and brain and other tissue of the nervous system, and cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
35 fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treating and diagnosing inflammatory diseases 
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such as rheumatoid arthritis, sepsis, inflammatory bowel disease, and psoriasis, as well 
as neutropenia. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 136 

5 The translation product of this gene shares high sequence homology with SARI 

subfamily of QTP-binding proteins which is thought to be important in vesicular 
transport in manunalian cells. 

This gene is expressed primarily in serum-stimulated smooth muscle cells and to 
a lesser extent in a T-cell lymphoma. 

10 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: diseases affecting 
vesicular transport. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identification 

15 of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the muscular system, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., blood 
cells, and smooth muscle, and cancerous and wounded tissues) or bodily fluids (e.g., 
serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 

20 taken from an individual having such a disorder, relative to the standard gene 

expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution and homology to GTP-binding proteins indicate that 
polynucleotides and polypeptides corresponding to this gene are useful for gene therapy 

25 in treating the large number of diseases involved in defective vesicular transport within 
cells.. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 137 

The translation product of this gene shares sequence homology with a protein 
30 found in C. elegans cosmid F25B5. 

This gene is expressed primarily in a fetal tissues and to a lesser extent in 
melanocytes. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
35 biological sample and for diagnosis of diseases and conditions: abnormal fetal 
development, especially of the pulmonary system. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
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for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the fetal pulmonary system, expression of 
this gene at significantly higher or lower levels may be routinely detected in certain 
tissues and cell types (e.g., fetal tissue, pulmonary tissue, and melanocytes, and 

5 cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 
The tissue distribution indicates that polynucleotides and polypeptides 

10 corresponding to this gene are useful for treatment and diagnosis of diseases affecting 
the pulmonary system, such as emphysema. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 138 

This gene is expressed primarily in gall bladder and to a lesser extent in smooth 
1 5 muscle. 

Therefore, polynucleotides and polypeptides of the invention are useftil as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: digestive system disease 
and gall bladder problems. Similarly, polypeptides and antibodies directed to these 

20 polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the digestive system, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., gall 
bladder and tissue of the digestive system, and smooth muscle, and cancerous and 

25 wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not Jiaving the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 

30 corresponding to this gene are useful for treating diseases of the digestive system. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 139 

This gene is expressed primarily in placenta and to a lesser extent in brain. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
35 reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: abnormal fetal 
development. Similarly, polypeptides and antibodies directed to these polypeptides are 
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useful in providing immunological probes for differential identification of the tissue{s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
developing tissues, expression of this gene at significantly higher or lower levels may 
be routinely detected in certain tissues and cell types (e.g., placenta, and brain and other 
5 tissue of the nervous system, and cancerous and wounded tissues) or bodily fluids 
(e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 
10 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for treating and diagnosing abnormal fetal 
development. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 140 

15 This gene is expressed primarily in smooth muscle and to a lesser extent in 

ovary, prostate cancer, and activated monocytes. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: hypertension and 

20 atherosclerosis. Similarly, polypeptides and antibodies directed to these polypeptides 
are useful in providing immunological probes for differential identification of the 
tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the circulatory system, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., smooth 

25 muscle, ovary and other reproductive tissue, prostate, and blood cells, and cancerous 
and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or 
spinal fluid) or another tissue or cell sample taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. 

30 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for treating diseases of the circulatory system, 
such as hypertension, atherosclerosis, etc. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 141 

35 This gene is expressed primarily in fetal spleen and to a lesser extent in placenta 

and bone marrow. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: anemia and other 
diseases affecting blood cells. Similarly, polypeptides and antibodies directed to these 

5 polypeptides are useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the circulatory and pulmonary systems, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., spleen, placenta, bone marrow, and blood cells, and cancerous and 

10 wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 

1 5 corresponding to this gene are useful for the generation of red and white blood cells and 
for the diagnosis of disease of these cells. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 142 

The predicted translation product of this contig is a human homolog of the 
20 murine tetracycline/sugar transporter molecule recently reported by Matsuo and 

colleagues (Biochem. Biophys. Res. Commun. 238 (1), 126-129 (1997)). 

This gene is expressed primarily in synovium and to a lesser extent in 

endothelial cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
25 reagents for differential identification of the tissue(s) or cell lype(s) present in a 

biological sample and for diagnosis of diseases and conditions: rheumatoid arthritis and 
inflammation. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 

30 the immune and lymphatic systems, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., synovial 
tissue, and endothelial cells, and cancerous and wounded tissues) or bodily fluids (e.g., 
serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample 
taken from an individual having such a disorder, relative to the standard gene 

35 expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 
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The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment and diagnosis of inflammatory 
diseases, such as rheumatoid arthritis, leukemia, neutropenia, inflammatory bowel 
disease, psoriasis, sepsis, and the like. 

5 

FEATURES OF PROTEIN ENCODED BY GENE NO: 143 

This gene is expressed primarily in placenta and to a lesser extent in melanocyte, 
fetal liver and spleen, and bone marrow. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

10 reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: abnormal early 
development. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, lower levels 

15 may be routinely detected in certain tissues and cell types (e.g., placenta, melanocytes, 
liver, spleen, and bone marrow, and cancerous and wounded tissues) or bodily fluids 
(e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or ceil 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 

20 individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the treatment and diagnosis of abnormal early 
development phenomena and diseases. 

25 FEATURES OF PROTEIN ENCODED BY GENE NO: 144 

This gene is expressed primarily in fetal liver and spleen. 
Therefore, polynucleotides and polypeptides of the invention are useftil as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: anemia and neutropenia 

30 Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
immune and blood systems, expression of this gene at significantly higher or lower 
levels may be routinely detected in certain tissues and cell types (e.g., liver and spleen, 

35 and cancerous and wounded tissues) or bodily fiuids (e.g., serum, plasma, urine, 

synovial fluid or spinal fluid) or another tissue or cell sample taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e., the 
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expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution indicates that polynucleotides and polypepUdes 
corresponding to this gene are useful in hematopoeisis and bone mairow regeneration 
5 as it is most abundant in fetal tissues responsible for the generation of hematopoeitic 
cells. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 145 

The translation product of this gene shares sequence homology with protein 
10 tyrosine phosphatase which is thought to be important in transducing signal to activate 
cells such as T cell, B cell and other cell types. 

This gene is expressed primarily in T cells and tissues in early stages of 
development and to a lesser extent in cancers. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
15 reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: immuno-related 
diseases and cancer. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing immunological probes for differential identificauon 
of the tissue(s) or cell typc(s). For a number of disorders of the above tissues or cells, 
20 particularly of the immune system expression of this gene at significantiy higher or 

lower levels may be routinely detected in certain tissues and cell types (e.g., embryonic 
and fetal tissue, undifferentiated cells, and blood cells, and cancerous and wounded 
tissues) or bodily fluids (e.g.. serum, plasma, urine, synovial fluid or spinal fluid) or 
anotiier tissue or cell sample taken from an individual having such a disorder, relative to 
25 the standard gene expression level. i.e.. tiie expression level in healthy tissue or bodily 
fluid from an individual not having tiie disorder. 

The tissue distribution and homology to the protein tyrosine phosphatase family 
indicate that polynucleotides and polypeptides corresponding to this gene are useful for 
modulating the immune system. 

30 

FEATURES OF PROTEIN ENCODED BY GENE NO: 146 

This gene is expressed primarily in T cell and to a lesser extent in B cell, 
macrophages and tumor tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
35 reagents for differential identification of the tissue(s) or cell type{s) present in a 

biological sample and for diagnosis of diseases and conditions: immuno-disorders. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
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providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
immune system, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues and cell types (e.g., blood cells, and cancerous and 
5 wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
10 corresponding to this gene are useful for regulating the immune system therefore can be 
used in treating diseases such as autoimmune diseases and cancers. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 147 

This gene is expressed primarily in placenta and to a lesser extent in endothelial 
1 5 cells, testis tumor, ovarian cancer, uterine cancer. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue{s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to cancer. Similarly, polypeptides and antibodies directed to these 
20 polypeptides are useful in providing inununological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the immune system, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., placenta, 
endothelial cells, testis and ovary and other reproductive tissue, and cancerous and 
25 wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 
relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
30 corresponding to this gene are useful for diagnosis arid treatment of cancers. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 148 

This sequence has significant homology to mouse torsin A. Recently, another 
group cloned the human Torsin A gene. (See, Accession No. 2358279; see also Nature 
35 Genet. 17, 40-48 (1997).) 

This gene is expressed primarily in osteoclastoma, T-cell, and placenta and to a 
lesser extent in fetal lung, fetal liver, fetal brain, adult brain and tumor tissues 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: disease conditions in 
hematopoiesis and cancers. Similarly, polypeptides and antibodies directed to these 
5 polypeptides are useful in providing immunological probes for differentia] identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the hematopoiesis system, expression of this gene at significantly higher 
or lower levels may be routinely detected in certain tissues and cell types (e.g., blood 
cells, bone, placenta, lung, liver, and brain and other tissues of the nervous system, 
10 and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, 

synovial fluid or spinal fluid) or another tissue or cell sample taken from an individual 
having such a disorder, relative to the standard gene expression level, i.e.. the 
expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

15 The tissue distribution indicates that polynucleotides and polypeptides 

corresponding to this gene are useful for treating blood related diseases such as 
deficiencies in red blood cell, white blood cell, platelet and other hematopoiesis cells. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 149 

20 This gene is expressed primarily in T cell, prostate and prostate cancer, 

endothelial cells and to a lesser extent in monocyte, dendritic cell, bone marrow, 
salivary gland, colon cancer, stomach cancer, pancreatic tumor, uterine cancer, fetal 
spleen and osteoclastoma. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

25 reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: immuno-related 
diseases and cancers. Siniilarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing inununological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 

30 particularly of the inmiune system, expression of this gene at significantiy higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., blood 
cells, prostate, endothelial cells, dendritic cells, bone marrow, salivary gland, colon, 
stomach, pancreas, utems, spleen and bone, and cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 

35 tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 
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The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment of cancers. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 150 

This gene was recently cloned by another group, calling it eIF3-p66. (See 
Accession No. 235 1378.) This gene plays a role in RNA binding and macromolecular 
assembly, and therefore, any mutations in this gene would likely result in a diseased 
phenotype. Preferred polypeptide fragments comprise the amino acid sequence: 

MAKFMTTVIQDNPSGWGPCAVPEQFRDMPYQPFSKGDRLGKVADWTGATYQDKRYTNKYSS 
QFGGGSQYAYFHEEDESSFQLVDTARTQKTAYQRNRMRFAQRNLRRDKDRRNMLQFNLQILP 
KSAKQKERERIW.QKKFQKQFGVRQKWDQKSQKPRDSSVEVRSDWEVKEEMDFPQLMKMRY 
LEVSEPQDECCGALEYYDKAFDRirraSEKPLRXXKRIFHTVTTroDPVIRiaAKTQ^ 

AILATLMSCrasVYSWDIVVQRVGSKUT=DKRDNSDFDLLTVSErANEPPQDEGNSFNSPRNL 
AMEATYINHNFSQQCLRMGKERYNFPNPNPFVEDDMDKNEIASVAYRYRSGKLGDDIDLIVRC 
EHDGVMTGANGEVSnNIKTLNEWDSRHCNGVDWRQKLDSQRGAVlATCLKNNSYKLARWTC 

CALLAGSEYLKLGYVSRYHVKDSSRHVILGTQQFKPNEFASQrNLSVENAWGILRCVIDICMKL 
EEGKYLILKDPNKQVIRVYSLPDGTFSS (SEQ ID NO: 638). as well as N-terminal and C- 
terminal deletions of this polypeptide fragment. 

This gene is expressed primarily in T cell, bone marrow, embryo and 
20 endothelial cells and to a lesser extent in testis tumor and endometrial tumor. 

Therefore, polynucleotides and polypeptides of the invention are usefiil as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: immune diseases and 
tumors. Similarly, polypeptides and antibodies directed to these polypeptides are useful 
in providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
immune system and reproductive system, expression of this gene at significantly higher 
or lower levels may be routinely detected in certain tissues and cell types (e.g., 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
nuid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distfibution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for immune disorders and cancers. 



25 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 151 

This gene is expressed primarily in testis and to a lesser extent in T cell, spinal 
cord, placenta, neutrophil and monocyte. 

Therefore, polynucleotides and polypeptides of the invention are usefal as 
5 reagents for differential idenUfication of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and condiUons: male reproductive and 
endocrine disorders. Similarly, polypeptides and antibodies directed to these 
polypeptides aie useful in providing immunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
10 particularly of the reproductive, immune and endocrine systems, expression of this 
gene at significantly higher or lower levels may be routinely detected in certain tissues 
and cell types (e.g., testis and other reproductive tissue, blood cells, tissue of the 
nervous system, and placema, and cancerous and wounded tissues) or bodily fluids 
(e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell 
15 sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for regulating immune and reproductive functions. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 152 

The translation product of this gene shares sequence homology with tyrosyl- 

tRNA synthetase which is thought to be important in cell growth. 

This gene is expressed primarily in brain, liver, keratinocytes. tonsils, and 

25 heart. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, cancer autoimmune diseases. Similarly, polypeptides and antibodies 

30 directed to these polypeptides are useful in providing immunological probes for 

differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the brain, liver, keratinocytes, tonsils, heart 
expression of this gene at significanUy higher or lower levels may be routinely detected 
in certain tissues and cell types (e.g., brain and other tissues of the nervous system. 

35 liver, keratinocytes, tonsils and heart, and cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
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gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution and homology to tyrosyl4RN A synthetase indicate that 
polynucleotides and polypeptides corresponding to this gene are useful for modulating 
5 cell grovifth. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 153 

This gene is homologous to the Drosophila transcriptional regulator dre4. (See 
Accession No. 251 1745.) Dre4 is a gene required for steroidogenesis in Drosophila 

1 0 melanogaster and encodes a developmentally expressed homologue of the yeast 

u^anscriptional regulator CDC68. Preferred polypeptide fragments comprise the amino 
acid sequence: KJOWTOVQFYTEVGEITroLGKHQHMHDRDDLYAEQMEREMRHKLKTAF 
FffiKVEALTKEELEFEVPFRDLGFNGAPYRSTCLLQPTSSALVNATEWPPFVVTLDEVELIHFXR 
VQFHLK^^FDMV]YYKDYSKKVTM^NAIPVASLDP1KEV^SCDLKYTEGVQSL^^ 

15 DPEGFFEQGGWSFL (SEQ ID NO: 639), as well as N-terminal and C-terminal deletions of 
this fragments. Also preferred are polynucleotide fragments encoding this polypeptide 
fragment. 

This gene is expressed primarily in fetal liver, spleen, placenta, lung, T cell, 
thyroid, testes. 

20 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: brain tumor, heart and 
liver diseases. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 

25 or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the fetal liver, spleen, placenta, lung, T cell, thyroid, testes expression of this gene at 
significandy higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., liver, spleen, placenta, lung, blood cells, thyroid, and testes and other 
reproductive tissue, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 

30 plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

35 FEATURES OF PROTEIN ENCODED BY GENE NO: 154 

This gene is expressed primarily in brain and to a lesser extent in fetal heart, 
testis, spleen, lung. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissUe(5) or cell type(s) present m a 
biological sample and for diagnosis of diseases and conditions: heart, liver and spleen 
diseases, immunological diseases. Similarly, polypeptides and antibodies directed to 
5 these polypeptides are useful in providing immunological probes for difTerenUal 
identification of the tissue(s) or ceU type(s). For a number of disorders of the above 
tissues or cells, particularly of the brain, fetal-lreart. testis, spleen, lung expression of 
this gene at significantly higher or lower levels may be routinely detected in certain 
tissues and cell types (e.g.. brain and other tissue of the nervous system, heart, testes 
10 and other reproductive tissue, spleen, and lung, and cancerous and wounded tissues) or 
bodily fluids (e.g.. serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having tiie disorder. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 155 

Activation of T cells through the T cell antigen receptor (TCR) results in the 
rapid tyrosine phosphorylation of a number of cellular proteins, one of the earliest bemg 
a 100 kDa protein. This gene is the human equivalent of murine valosin containing 
20 protein (VCP). VCP is a member of a family of ATP binding, homo-oligomenc 

proteins, and the mammalian homolog of Saccharomyces cerevisiae cdc48p. a protem 
essential to the completion of mitosis in yeast. Both endogenous and expressed munne 
VCP are tyrosine phosphorylated in response toT cell activation. Thus we have 
identified a novel component of the TCR mediated tyrosine kinase activation pathway 
25 that may provide a link between TCR activation and cell cycle control. 

This gene is expressed primarily in brain, liver, spleen, placenta. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of ti^e tissue(s) or cell type(s) present m a 
biological sample and for diagnosis of diseases and conditions which include, but are 
30 not limited to, cancer immunological disorders. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the brain, liver, spleen, placenta expression of 
this gene at significantly higher or lower levels may be routinely detected m certain 
35 tissues and cell types (e.g., brain and other tissue of the nervous system, liver, spleen, 
and placenta, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
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an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution and homology to VCR indicate that polynucleotides and 
5 polypeptides corresponding to this gene are useful for treating cancer. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 156 

The translation product of this gene shares sequence homology with rat growth 
response protein which is thought to be important in cell growth. A group recently 

10 cloned the human homolog of this gene, calling it insulin induced protein 1 . (See 
Accession No. 2358269, see also. Genomics 43 (3), 278-284 (1997).) Preferred 
polypeptide fragments comprise the amino acid sequence: RSGLGLGITIAFLATLITQF 
LVYNGVYQYTSPDFLYIRSWLPCIFFSGGVTVGNIGRQLAMGVPEKPHSD (SEQ ID NO: 640). 
as well as N-terminal and C-terminal deletions of this polypeptide fragment. Also 

1 5 preferred are polynucleotide fragments encoding these polypeptide fragments. 

This gene is expressed primarily in brain, liver, placenta, heart, spleen, 
lymphoma. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

20 biological sample and for diagnosis of diseases and conditions which include, but arc 
not limited to, cancer immunological disorders. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the brain, liver, placenta, heart, spleen. 

25 expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues and cell types (e.g., brain and other tissue of the nervous system, 
liver, placenta, heart, spleen, and lymphoid tissue, and cancerous and wounded tissues) 
or bodily fluids (e.g.. serum, plasma, urine, synovial fluid or spinal fluid) or another 
tissue or cell sample taken from an individual having such a disorder, relative to the 

30 standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution and homology to growth-response protein indicate that 
polynucleotides and polypeptides corresponding to this gene are useful for modulating 
cell growth. 

35 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 157 

This g«« is e.p.«sed prinwUy in Glioblastoma, endomemal .v,mor, 

5 „a«n.sfordifferenMi<l=nUr«aUo„oftetiss„=(s)orcellW(s)pres».m^ 
rcalsamplea.^fordiagnosiso(dis^"^co«iiao„s:GUoblas.on«, 
El!^^altlr.lympho™a«.p.»«asm.«>r.Sinu.a,ly,po.yp«P«- 
^rLdirec«d»e«s.pol,p«P«desa,.useMi„proviai„g— ^^^^^ 

r«ere«i^i^>i«-^°''^'^^>°''''''^'^*'""T™^^^^ 

ir„s and wounded .issues) or bodily fluids (e.g.. serum, plasma, -nne 
:^ini:id . -cCl sam^e .a..n .„m an n«bv,dual 

■ 5 Lng such a disorder, relattve >o .he sundard gene express.on le«l. ..e.. .he 
erssionlevelinhcald,yUs»ueorbodi.yfl»id«.omanindividu.»o.h.v,ngd» 

disorder. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 158 
»„ ™,ransla.ionpr«iuc.ofUusgenesha„sse,»encehomo,ogy«.hlOE 

,ecep.or which is .hough, .o be impor^n. m allergy and astoa. 

This gene is expressed primarily in T cell, and te«l hver. 

Therefore, polynucleottdes and polypepUdes of fl» mvenuon are useful as 
„agen.fordifferli.iden«f.ca«onof.d.dss„e(s^^^^^^^^^ 
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The tissue distribution and homology to IgE receptor indicate that 
polynucleotides and polypeptides corresponding to this gene are useful for allergy and 
asthma. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 159 

The translation product of this gene shares sequence homology with 
immunoglobin heavy chain which is thought to be important in immune response to the 
antigen. 

This gene is expressed primarily in activated neutrophil and to a lesser extent in 

10 activated T cell, monocyte and heart. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: infection , inflammation 
and cancer. Similarly, polypeptides and antibodies directed to these polypeptides are 

IS useful in providing immunological probes for differential identification of the dssue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the immune, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues and cell types (e.g., blood cells, and heart, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 

20 fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to inmiunoglobin heavy chain variable 
region indicate that polynucleotides and polypeptides corresponding to this gene are 

25 useful for making the ligand to block specific antigen which cause certain disease. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 160 

The translation product of this gene shares sequence homology with mouse X 
inactive specific transcript protein which is thought to be important in X chromosome 
30 inactivation. 

This gene is expressed primarily in HSA172 cell and to a lesser extent in normal 
ovary tissue, ovarian cancer, frontal cortex and brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
35 biological sample and for diagnosis of diseases and conditions: ovarian tumor, 

schizophrenia and other neurological disorders. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 



wo 98/39448 



100 



PCT/US98/04493 



differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the immune and neural system, expression of 
this gene at significantiy higher or lower levels may be routinely detected in certain 
tissues and cell types (e.g., ovary and other reproductive tissue, and brain and other 

5 tissue of the nervous system, and cancerous and wounded tissues) or bodily fluids 
(e.g., semm, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell 
sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

10 The tissue distribution and homology to X inactive specific tnmscript protein 

indicate that polynucleotides and polypeptides corresponding to this gene are useful for 
diagnosis and treatment of reproductive system tumors and CNS tumors. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 161 
15 This gene is (^pressed primarily in adipose cell and to a lesser extent in liver 

and prostate. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: obesity and liver 
20 disorder. Sunilarly, polypeptides and antibodies directed to these polypeptides are 

usefiil in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of tiie above tissues or cells, particularly of 
tiie adipose cell, expression of this gene at significantiy higher or lower levels may be 
routinely detected in certain tissues and cell types (e.g.. adipose cells, liver, and 
25 prostate, and cancerous and wounded tissues) or bodily fluids (e.g.. serum, plasma, 
urine, synovial fluid or spinal fluid) or anotiier tissue or cell sample taken firom an 
individual having such a disorder, relative to die standard gene expression level. i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

30 The tissue distribution indicates tiiat polynucleotides and polypeptides 

corresponding to this gene are useful for treatment of obesity and liver disorder. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 162 

The translation product of this gene shares sequence homology with yeast 
35 ubiquitin activating enzyme homolog which is thought to be important in protein 
posttraslation processing. 
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This gene is expressed primarily in stromal cell and to a lesser extent in retina, 
H. Atrophic Endometrium, colon carcinoma and myeloid progenitor cell. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
5 biological sample and for diagnosis of diseases and conditions: defects of stromal cell 
development, neuronal growth disorders and tumors. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identificadon of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the imanune and neural system, expression 
10 of this gene at significantly higher or lower levels may be routinely detected in certain 
tissues and cell types (e.g., retinal cells, endometrium, colon, and bone manow, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
15 in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to ubiquitin-activating enzyme homolog 
indicate that polynucleotides and polypeptides corresponding to this gene are useful for 
diagnosis or treatment of some type of tumors , fucosidosis and neuronal growth 
disorders. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 163 

This gene is expressed primarily in primary breast cancer and 
hemangiopericytoma and to a lesser extent in adult brain and cerebellum. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

25 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: breast cancer, leukemia 
and cerebellum disorders. Similarly, polypeptides and antibodies directed to these 
polypeptides are useful in providing inrniunological probes for differential identification 
of the tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 

30 particularly of the immune system and neural system , expression of this gene at 

significandy higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., manunary tissue, brain and other tissue of the nervous system, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 

35 such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 
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The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis or treatment of various tumors and 
disease involved in neural system. 

5 FEATURES OF PROTEIN ENCODED BY GENE NO: 164 

The translation product of this gene shares sequence homology with proline rich 
proteins. Recently, another group has also cloned this gene, calling it CD84 leukocyte 
antigen, a new member of the Ig superfamily. (See Accession No. U82988, see also. 
Blood 90 (6), 2398-2405 (1997).) 
10 This gene is expressed primarily in Weizmann olfactory tissue and 

osteoclastoma and to a lesser extent in anergic T-cell. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: ostsis and immune 
15 disease. Similarly, polypeptides and antibodies directed to these polypeptides are useful 
in providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
immune system, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues and cell types (e.g., olfactory tissue, bone, and 
20 blood cells, and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy Ussue or bodily fluid from an individual not having the 
disorder. 

25 The tissue distribution and homology to the Ig superfamily indicate that the 

protein product of this clone is useful for treatment of osteoporosis, autoimmune 
disease, and other immune disorders. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 165 

30 This gene is expressed primarily in atrophic endometrium and colon cancer and 

to a lesser extent in some fetal tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: tumors. Similarly, 

35 polypeptides and antibodies directed to these polypeptides are useful in providing 

immunological probes for differential identification of the tissue(s) or cell type(s). For 
number of disorders of the above tissues or cells, particularly of the immune system. 
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expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues and cell types (e.g., endometrium, colon, and fetal tissue, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
5 such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis and treatment of tumors, specifically 
endometrium and colon tumors. 

10 

FEATURES OF PROTEIN ENCODED BY GENE NO: 166 

This gene is expressed primarily in human primary breast cancer and to a lesser extent 
in activated monocyte. Although the predicted signal sequence is identified in Table 1, 
other upstream sequences are also relevant. Preferred polypeptide fragments comprise 

15 the amino acid sequence: VTQPKHIJSASMGGSNmiPFSFYYPWElJVXXPXVRISWRRGHFHG 
QSFYSTRPPSIHKDYVNRLFLNWTEGQESGFLRISNLRKEDQSVYFCRVELDTRRSG (SEQ ID 
NO: 641), as well as N-terminal and C-terminal deletions. Also preferred are 
polynucleotide fragments encoding these polypeptide fragments. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

20 reagents for differential identification of the tissue(s) or cell type{s) present in a 

biological sample and for diagnosis of diseases and conditions: breast cancer. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
inununological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of tht above tissues or cells, particularly of the inunune system, 

25 expression of this gene at significantly higher or lower levels may be routinely detected 
in certain tissues and cell types (e.g., mammary tissue, and blood cells, and cancerous 
and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or 
spinal fluid) or another tissue or cell sample taken from an individual having such a 
disorder, relative to the standard gene expression level, i.e., the expression level in 

30 healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for diagnosis of breast cancer. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 167 

35 This gene is expressed primarily in fetal tissues and to a lesser extent in adult 

lung. This gene has also been mapped to chromosomal location 9q34, and thus, can be 
used as a marker for linkage analysis for chromosome 9. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample. Similarly, polypeptides and antibodies directed to these polypeptides 
are useful in providing immunological probes for differential identification of the 

5 tissue(s) or cell type(s). For a number of disorders of the above tissues or cells, 
particularly of the embryo tissues, expression of this gene at significantly higher or 
lower levels may be routinely detected in certain tissues and cell types (e.g., fetal 
tissues, and lung, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 

10 an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 168 

15 The translation product of this gene shares sequence homology with Ig Heavy 

Chain which is thought to be important in immune response. 

This gene is expressed primarily in prostate cancer tissue specifically 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present m a 

20 biological sample and for diagnosis of diseases and conditions: prostate cancer. 
Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing inununological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
prostate, expression of this gene at significantly higher or lower levels may be routinely 

25 detected in certain tissues and cell types (e.g., prostate, tissue and cells of the immune 
system, and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 

30 disorder. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 169 

The translation product of this gene shares sequence homology with cytosolic 
acyl coenzyme-A hydrolase, which is thought to be important in neuron-specific fatty 
35 acid metabolism. The gene represented by this contig has since been published by Hajra 
and colleagues (GenBank Accession No. U91316). 
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This gene is expressed primarily in human pituitary gland and to a lesser extent 
in colorectal cancer tissue. This gene has also been observed in the LNCAP cell line. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

5 biological sample and for diagnosis of diseases and conditions: hyperlipidemias of 
familial and/or idiopathic origins. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing immunological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly blood, expression of this gene at significantiy higher or 

10 lower levels may be routinely detected in certain tissues and cell types (e.g., pituitary 
and colon, and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, 
urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from an 
individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 

15 disorder. 

The tissue distribution and homology to rat cytosolic acyl coenzyme-A 
hydrolase indicate that polynucleotides and polypeptides corresponding to this gene are 
useful for the detection or treatment of hyperlipidemia disease states by vutue of the 
ability of specific drugs to activate the enzyme. 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 170 

The translation product of this gene shares sequence homology with a 
Caenorhabditis elegans gene which is thought to be important in organism 
development. 

25 This gene is expressed primarily in human synovial sarcoma tissue, bone 

marrow, and to a lesser extent in human brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 

30 not limited to, of bone, specifically synovial sarcoma. Similarly, polypeptides and 

antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the bone, connective tissues and possibly 
immune system, expression of this gene at significantly higher or lower levels may be 

35 routinely detected in certain tissues and cell types (e.g., synovial tissue, bone marrow, 
brain and other tissue of the nervous system, and cancerous and wounded tissues) or 
bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another 
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tissue or cell sample taken from an individual having such a disorder, relative to the 
standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution and homology to Caenorhabditis elegans indicate that 
5 polynucleotides and polypeptides corresponding to this gene are useful as a diagnostic 
and/or therapeutic modality directed at the detection and/or treattnent of connective 
tissue sarcomas or other related bone diseases. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 171 

10 The translation product of this gene shares sequence homology with betal- 

6GlcN Ac transferase which is thought to be important in the transfer and metabolism of 
betal-6, N-acetylglucosamine. This gene product has previously been shown to 
suppress melanoma lung metastasis in both syngeneic and nude mice, decreased 
invasiveness into the matrigel, and inhibition of cell attachment to collagen and laminin 

15 without affecting cell growth. 

This gene is expressed primarily in human testes and prostate tissues, and to a 
lesser extent in kidney, medulla, and pancreas. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

20 biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, cancer particularly mtelanoma. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 
differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the immune system, expression of this gene at 

25 significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., testes and other reproductive tissue, prostate, kidney, pancreas, brain and 
other tissue of the nervous system, and cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 

30 gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution and homology to betal-6GlcNAc transferase indicate that 
the protein product of this clone is useful for the development of diagnostic and/or 
therapeutic modalities directed at the detection and/or treatment of cancer, the metastasis 

35 of malignant tissue or cells. Defects in this potentially secreted enzyme may play a role 
in metastasis. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 172 
This gene is expressed primarily in fetal spleen and liver. 
Therefore, polynucleotides and polypeptides of the mvention are useful as 

5 reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: inunune disorders, 
Wilm's tumor disease, hepatic disorders, and hematopoietic disorders. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
immunological probes for differential identification of the tissue(s) or cell type(s). For a 

10 number of disorders of the above tissues or cells, particularly of the hematopoiesis and 
immune systems, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues and cell types (e.g., spleen and liver, and cancerous 
and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or 
spinal fluid) or another tissue or cell sample taken from an individual having such a 

15 disorder, relative to the standard gene expression level, i.e., the expression level in 
healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution indicates that polynucleotides and polypeptides 
corresponding to this gene are useful for the treatment and identification of fetal defects 
along with correcting diseases that affect hematopoiesis and the immune system* 

20 

FEATURES OF PROTEIN ENCODED BY GENE NO: 173 

The translation product of this gene shares sequence homology with ret II 
oncogene which is thought to be important in Hirschsprung disease and many types of 
cancers. 

25 This gene is expressed in multiple tissues including the lymphatic system, brain, 

and thyroid. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for identification of the tissue(s) or cell type(s) present in a biological sample 
and for diagnosis of diseases and conditions: Hirschsprung disease and multiple 

30 cancers. Similarly, polypeptides and antibodies directed to these polypeptides are useftil 
in providing immunological probes for identification of the tissue(s) or cell type(s). For 
a number of disorders of the above tissues or cells, particularly of the inunune and 
central nervous system, expression of this gene at significantly higher or lower levels 
may be routinely detected in certain tissues and cell types (e.g., lymphoid tissue, 

35 thyroid, and brain and other tissue of the nervous system, and cancerous and wounded 
tissues) or bodily fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
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the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 

The tissue distribution and homology to ret n oncogene indicate that 
polynucleotides and polypeptides corresponding to this gene are useful for diagnosis 

5 and treatment of various cancers. It would also be useful for the diagnosis and treatment 
of Hirschsprung disease. Preferred polypeptides of the invention comprise the amino 
acid sequence: MEA(X>VNEAESAREQLQXLHIXJIAGQKASKQE1JETELE^ 
RTKhnT.QSRIKDRDEEIQKl.RNQLTNKTLSNSSQSELENRLHQLTETLIQKQ™ 
VFQLERLE(XJMNSASGSSSNGSSINMSGIDNGEGTRUl>rVP\a-FNDTET^ 

10 SIDQFSIRLGIFLRRYPIARVFVIIYMALUILWVMIVIXTY^ HHDQPYGK (SEQ ID NO: 
642). 

FEATURES OF PROTEIN ENCODED BY GENE NO: 174 

The translation product of this gene shares sequence homology with testis 
1 5 enhanced gene transcript which is thought to be important in regulation of human 
development. 

This gene is expressed primarily in infant brain and to a lesser extent in a variety 
of other tissues and cell types, including the prostate, testes, monocytes, macrophages, 
dendritic cells, keratinocytes, and adipocytes. 

20 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: neurological, 
developmental, immune and inflammation disorders. Similarly, polypeptides and 
antibodies directed to these polypeptides arc useful in providing inmiunological probes 

25 for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the brain and immune systems, expression 
of this gene at significantly higher or lower levels may be routinely detected in certain 
tissues and cell types (e.g., brain and other tissue of the nervous system, prostate, 
testes and other reproductive tissue, blood cells, keratinocytes, and adipocytes, and 

30 cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or another tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

The tissue distribution and homology to testis enhanced gene transcript indicate 
35 that the protein product of this clone is useful for diagnosis and treatment of disorders 
involving the developing brain and the immune system. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 175 

This gene is expressed primarily in prostate and to a lesser extent in various 
other tissues, including placenta. 

5 Therefore, polynucleotides and polypeptides of the invention are useful as 

reagents for differential identification of the tissue(s) or cell type($) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, cancers, especially of the prostate. Similarly, polypeptides and antibodies 
directed to these polypeptides are useful in providing immunological probes for 

10 differential identification of the tissue(s) or cell type(s). For a number of disorders of 
the above tissues or cells, particularly of the prostate, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues and cell 
types (e.g., prostate and placenta, and cancerous and wounded tissues) or bodily fluids 
(e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or cell 

15 sample taken from an individual having such a disorder, relative to the standard gene 
expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution indicates that the protein product of this clone is useful 
for diagnosis and treatment of prostate disorders and cancer. It may also be useful for 

20 the diagnosis and treatment of endocrine disorders. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 176 

The translation product of this gene shares sequence homology with 
Sacchromyces cerevisiae YNT20 gene which is thought to be important in 

25 mitochondrial function. 

This gene is expressed at a particularly high level in muscle tissue. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identiflcation of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases related to such tissues and cell types 

30 including: muscle wasting diseases. Similarly, polypeptides and antibodies directed to 
these polypeptides are useful in providing inununological probes for differential 
identification of the tissue(s) or cell type(s). For a number of disorders of the above 
tissues or cells, particularly of the neuromuscular system, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues and cell 

35 types (e.g., muscle and cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
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the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

The tissue distribution and homology to the YNT20 gene indicate that this 
protein is useful for treatment and detection of neuromuscular diseases caused by loss 
5 of mitochondrial function. For example this gene or its protein product could be used in 
replacement therapy for such diseases. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 177 

This gene is expressed primarily in the brain and to a lesser extent in kidney, 

10 placenta, smooth muscle, heart and lung. 

Therefore, polynucleotides and polypeptides of the invention are usefiil as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions: neuromuscular 
diseases, degenerative diseases of the central nervous system, and heart disease. 

15 Similarly, polypeptides and antibodies directed to these polypeptides are useful in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
neuromuscular system, cenural nervous system, and heart, expression of this gene at 
significantly higher or lower levels may be routinely detected in certain tissues and cell 

20 types (e.g., brain and other tissue of the nervous system, kidney, placenta, muscle, 
heart and lung, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 

25 disorder. 

This gene or its protein product could also be used for replacement therapy for 
the above mentioned diseases. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 178 
30 The translation product of this gene shares sequence homology with caldesmon 

which is thought to be important in the cellular response to changes in glucose levels. 
This gene is expressed primarily in multiple tissues including brain and retina. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for identification of the tissue(s) or cell type(s) present in a biological sample 
35 and for diagnosis of diseases and conditions: central nervous system disorders and 
retinopathy. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for identification of the tissue(s) or cell 
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type(s). For a number of disorders of the above tissues or cells, particularly of the CNS 
disorders and retinopathy, expression of this gene at significantly higher or lower levels 
may be routinely detected in certain tissues and cell types (e.g., brain and other tissue of 
the nervous system, and retinal tissue, and cancerous and wounded tissues) or bodily 

5 fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The tissue distribution and homology to caldesmon indicate that polynucleotides 

10 and polypeptides corresponding to this gene are useful for treatment of retinopathies. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 179 

The translation product of this gene shares sequence homology with mouse 
fibrosin protein, which is thought to be important in regulation of fibrinogenesis in 

15 certain chronic inflammatory diseases. 

This gene is expressed primarily in amniotic cells and breast tissue. 
Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of breast cancer and abnormal embryo 

20 development. Similarly, polypeptides and antibodies directed to these polypeptides are 
useful in providing immunological probes for differential identification of the tissue(s) 
or cell type(s). For a number of disorders of the above tissues or cells, particularly of 
the reproductive system, expression of this gene at significantly higher or lower levels 
may be routinely detected in certain tissues and cell types (e.g., amniotic cells, and 

25 mammary tissue, and cancerous and wounded tissues) or bodily fluids (e.g., serum, 
plasma, urine, synovial fluid or spinal fluid) or another tissue or cell sample taken from 
an individual having such a disorder, relative to the standard gene expression level, i.e., 
the expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

30 The tissue distribution and homology to fibrosin indicate that the protein product 

of this clone is useful for treatment of breast cancer. This gene or its protein product 
could be used in replacement therapy for breast cancer. In addition the protein product 
of this gene is useful in the treatment of chronic inflanunatory diseases. 



35 



FEATURES OF PROTEIN ENCODED BY GENE NO: 180 

This gene is expressed several infant tissues including brain and liver and 
various adult tissues including brain, lung, liver, testes, and prostate. 
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Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of diseases and conditions which include, but are 
not limited to, brain cancer, lung cancer, liver cancer and cancers of the reproductive 

5 system. Similarly, polypeptides and antibodies directed to these polypeptides are useful 
in providing immunological probes for differential identification of the tissue{s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
central nervous system, hepatic system, and reproductive system, expression of this 
gene at significantly higher or lower levels may be routinely detected in certain tissues 

10 and cell types (e.g., brain and other tissue of the nervous system, lung, liver, testes and 
other reproductive tissue, and prostate, and cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 

1 5 individual not having the disorder. 

The tissue distribution of this gene product indicates that the protein product of 
this clone is involved in growth regulation and could be used as a growth factor or 
growth blocker in a variety of settings including treatment of cancers. 

20 FEATURES OF PROTEIN ENCODED BY GENE NO: 181 

This gene is expressed primarily in activated monocytes and to a lesser extent in 
melanocytes and dendritic cells. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 

25 biological sample and for diagnosis of immune system diseases and cancer. Similarly, 
polypeptides and antibodies directed to these polypeptides are useful in providing 
inmiunological probes for differential identification of the tissue(s) or cell type(s). For a 
number of disorders of the above tissues or cells, particularly of the immune system, 
expression of this gene at significantly higher or lower levels may be routinely detected 

30 in certain tissues and cell types (e.g., blood cells, melanocytes, and dendritic cells, and 
cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, synovial 
fluid or spinal fluid) or anotter tissue or cell sample taken from an individual having 
such a disorder, relative to the standard gene expression level, i.e., the expression level 
in healthy tissue or bodily fluid from an individual not having the disorder. 

35 The tissue distribution indicates that the protein product of this clone could be 

involved in growth regulation and could be used as a growth factor or growth blocker 
in a variety of settings including treatment of cancers. 
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FEATURES OF PROTEIN ENCODED BY GENE NO: 182 

This gene is expressed primarily in placenta and several tumors of various tissue 
origin and to a lesser extent in normal tissues including liver, lung, brain, and skin, 

5 Therefore, polynucleotides and polypeptides of the invention are usefiil as 

reagents for differential identification of the tissue(s) or ceU type(s) present in a 
biological sample and for diagnosis of cancers of all kinds. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing unmunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 

10 of the above tissues or cells, particularly of the central nervous system, respiratory 
system and skin, expression of this gene at significantly higher or lower levels may be 
routinely detected in certain tissues and cell types (e.g., liver, lung, brain and other 
tissues of the nervous system, and skin, and cancerous and wounded tissues) or bodily 
fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 

1 5 cell sample taken from an individual having such a disorder, relative to the standard 

gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The high expression of this gene in multiple tumors indicates that the protein 
product of the clone may be involved in cell growth control and therefore would be 

20 useful for treatment of certain cancers. Likewise molecules developed to block the 
activity of the protein product of this clone could be used to block its potential role in 
tumor growth promotion. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 183 
25 The translation product of this gene shares sequence homology with the mouse 

Ndrl gene which is thought to be important in cancer progression. 

This gene is expressed multiple cell types and tissues including brain, lung, 
kidney, bone marrow, liver, and spleen. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
30 reagents for differential identification of die tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of all types of cancers. Similarly, polypeptides and 
antibodies directed to these polypeptides are useful in providing immunological probes 
for differential identification of the tissue(s) or cell type(s). For a number of disorders 
of the above tissues or cells, particularly of the nervous, inmiune, and endocrine 
35 systems, expression of this gene at significantly higher or lower levels may be routinely 
detected in certain tissues and cell types (e.g., brain and other tissue of the nervous 
system, lung, kidney, bone marrow, liver and spleen, and cancerous and wounded 
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tissues) or bodily fluids (e.g., seram, plasma, urine, synovial fluid or spinal fluid) or 
another tissue or cell sample taken from an individual having such a disorder, relative to 
the standard gene expression level, i.e., the expression level in healthy tissue or bodily 
fluid from an individual not having the disorder. 
5 The tissue disuribution and homology to Ndrl gene, which is thought to be 

involved in cancer progression, indicate that polynucleotides and polypeptides 
corresponding to this gene are useful for treatment of certain cancers. Likewise 
molecules developed to block the activity of the protein product of this clone could be 
used to block its potential role in tumor growth promotion. 

10 

FEATURES OF PROTEIN ENCODED BY GENE NO: 184 

This gene is expressed primarily in early stage human brain and liver and to a 
lesser extent in several other fetal tissues. 

Therefore, polynucleotides and polypeptides of the invention are useful as 

15 reagents for differential identification of the tissue(s) or cell type(s) present in a 

biological sample and for diagnosis of diseases and conditions: brain and liver cancers. 
Similarly, polypeptides and antibodies directed to these polypeptides are useftil in 
providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 

20 central nervous system and immune system, expression of this gene at significantly 
higher or lower levels may be routinely detected in certain tissues and cell types (e.g., 
brain and other tissue of the nervous system, liver, and fetal tissue, and cancerous and 
wounded tissues) or bodily fluids (e.g., semm, plasma, urine, synovial fluid or spinal 
fluid) or another tissue or cell sample taken from an individual having such a disorder, 

25 relative to the standard gene expression level, i.e., the expression level in healthy tissue 
or bodily fluid from an individual not having the disorder. 

The expression of this gene in embryonic tissues indicates that the protein could 
be involved in growth regulation and could be used as a growth factor or growth 
blocker in a variety of settings including treatment of cancers. 

30 

FEATURES OF PROTEIN ENCODED BY GENE NO: 185 

This gene is expressed primarily in infant and embryonic brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
35 biological sample and for diagnosis of degenerative nervous system disorders and brain 
cancer. Similarly, polypeptides and antibodies directed to these polypeptides are useful 
in providing immunological probes for differential identification of the tissue(s) or cell 
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type(s). For a number of disorders of the above tissues or cells, particularly of the 
central nervous system, expression of this gene at significantly higher or lower levels 
may be routinely detected in certain tissues and cell types (e.g., embryonic tissue, brain 
and other tissue of the nervous system, and cancerous and wounded tissues) or bodily 

5 fluids (e.g., serum, plasma, urine, synovial fluid or spinal fluid) or another tissue or 
cell sample taken from an individual having such a disorder, relative to the standard 
gene expression level, i.e., the expression level in healthy tissue or bodily fluid from an 
individual not having the disorder. 

The expression of this gene in embryonic tissues indicates that the protein could 

10 be involved in growth regulation and could be used as a growth factor or growth 
blocker in a variety of settings including treatment of cancers. 

FEATURES OF PROTEIN ENCODED BY GENE NO: 186 

This gene is expressed primarily in multiple tissues including placenta, fetal 

15 lung, fetal liver, and brain. 

Therefore, polynucleotides and polypeptides of the invention are useful as 
reagents for differential identification of the tissue(s) or cell type(s) present in a 
biological sample and for diagnosis of all types of cancers including liver, brain and 
lung. Similarly, polypeptides and antibodies directed to these polypeptides are useful in 

20 providing immunological probes for differential identification of the tissue(s) or cell 
type(s). For a number of disorders of the above tissues or cells, particularly of the 
central nervous system, pulmonary system, and hepatic system, expression of this gene 
at significantly higher or lower levels may be routinely detected in certain tissues and 
cell types (e.g., placenta, lung, liver, and brain and other tissue of the nervous system, 

25 and cancerous and wounded tissues) or bodily fluids (e.g., serum, plasma, urine, 

synovial fluid or spinal fluid) or another tissue or cell sample taken from an individual 
having such a disorder, relative to the standard gene expression level. i.e., the 
expression level in healthy tissue or bodily fluid from an individual not having the 
disorder. 

30 The expression of this gene in embryonic tissues indicates that the protein could 

be involved in growth regulation and could be used as a growth factor or growth 
blocker in a variety of settmgs including treatment of cancers. 
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Table 1 summarizes the information corresponding to each "Gene No." 
described above. The nucleotide sequence identified as "NT SEQ ID NO:X" was 
assembled fiom partially homologous ("overiapping") sequences obtained from the 
"cDNA clone ID" identified in Table 1 and, in some cases, from additional related DNA 
5 clones. The overiapping sequences were assembled into a single contiguous sequence 
of high redundancy (usually three to five overiapping sequences at each nucleotide 
position), resulting in a final sequence identified as SEQ ID NO:X. 

The cDN A Clone ID was deposited on the date and given the corresponding 
deposit number listed in "ATCC Deposit No:Z and Date." Some of the deposits contain 
10 multiple different clones corresponding to the same gene. "Vector" refers to tiie type of 
vector contained in the cDNA Clone ID. 

'Total NT Seq." refers to die total number of nucleotides in the contig identified 
by "Gene No." The deposited clone may contain all or most of these sequences, 
refiecied by the nucleotide position indicated as "5' NT of Clone Seq." and the "3' NT 
1 5 of Clone Seq." of SEQ ID NO:X. The nucleotide position of SEQ ID NO:X of the 
putative start codon (methionine) is identified as "5' NT of Start Codon." Similarly , 
the nucleotide position of SEQ ID NO:X of the predicted signal sequence is identified as 
"5' NT of First AA of Signal Pep " 

The translated amino acid sequence, beginning with the methionine, is identified 
20 as "AA SEQ ID NO:Y," although other reading frames can also be easily translated 
using known molecular biology techniques. The polypeptides produced by these 
alternative open reading frames'are specifically contemplated by the present invention. 

The first and last amino acid position of SEQ ID NO: Y of the predicted signal 
peptide is identified as "First AA of Sig Pep" and "Last A A of Sig Pep " The predicted 
25 first amino acid position of SEQ ID NO: Y of the secreted portion is identified as 

"Predicted First AA of Secreted Portion." Finally, the amino acid position of SEQ ID 
NO: Y of die last amino acid in the open reading frame is identified as "Last AA of 
ORF." 

SEQ ID NO:X and the u-anslated SEQ ID NO:Y are sufficienUy accurate and 
30 otherwise suitable for a variety of uses well known in die art and described further 

below. For instance, SEQ ID NO:X is usefiil for designing nucleic acid hybridization 
probes that will detect nucleic acid sequences contained in SEQ ID NO:X or the cDN A 
contained in the deposited clone. These probes will also hybridize to nucleic acid 
molecules in biological samples, thereby enabling a variety of forensic and diagnostic 
35 methods of die invention. Similarly, polypeptides identified from SEQ ID NO: Y may 
be used to generate antibodies which bind specifically to the secreted proteins encoded 
by the cDNA clones identified in Table 1. 
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Nevertheless, DNA sequences generated by sequencing reactions can contain 
sequencing errors. The errors exist as misidentified nucleotides, or as insertions or 
deletions of nucleotides in the generated DNA sequence. The erroneously inserted or 
deleted nucleotides cause frame shifts in the reading frames of the predicted amino acid 

5 sequence. In these cases, the predicted amino acid sequence diverges from the actual 
amino acid sequence, even though the generated DNA sequence may be greater than 
99.9% identical to the actual DNA sequence (for example, one base insertion or deletion 
in an open reading frame of over 1(XX) bases). 

Accordingly, for those applications requiring precision in the nucleotide 

10 sequence or the amino acid sequence, the present invention provides not only the 

generated nucleotide sequence identified as SEQ ID NO:X and the predicted translated 
amino acid sequence identified as SEQ ID NO: Y, but also a sample of plasmid DNA 
containing a human cDNA of the invention deposited with the ATCC, as set forth in 
Table 1 . The nucleotide sequence of each deposited clone can readily be determined by 

15 sequencing the deposited clone in accordance with known methods. The predicted 
amino acid sequence can then be verified from such deposits. Moreover, the amino 
acid sequence of the protein encoded by a particular clone can also be directly 
determined by peptide sequencing or by expressing the protein in a suitable host cell 
containing the deposited human cDNA, collecting the protein, and determining its 

20 sequence. 

The present invention also relates to the genes corresponding to SEQ ID NO:X, 
SEQ ID NO: Y, or the deposited clone. The corresponding gene can be isolated in 
accordance with known methods using the sequence information disclosed herein. 
Such methods include preparing probes or primers from the disclosed sequence and 
25 identifying or amplifying the corresponding gene from appropriate sources of genomic 
material. 

Also provided in the present invention are species homologs. Species 
homologs may be isolated and identified by making suitable probes or primers from the 
sequences provided herein and screening a suitable nucleic acid source for the desired 
30 homologue. 

The polypeptides of the invention can be prepared in any suitable manner. Such 
polypeptides include isolated naturally occurring polypeptides, recombinantly produced 
polypeptides, synthetically produced polypeptides, or polypeptides produced by a 
combination of these methods. Means for preparing such polypeptides are well 
35 understood in the art. 

The polypeptides may be in the form of the secreted protein, including the 
mature form, or may be a part of a larger protein, such as a fusion protein (see below). 
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It is often advantageous to include an additional amino acid sequence which contains 
secretory or leader sequences, pro-sequences, sequences which aid in purification , 
such as multiple histidine residues, or an additional sequence for stability during 
recombinant production. 

The polypeptides of the present invention are preferably provided in an isolated 
form, and preferably are substantially purified. A recombinantly produced version of a 
polypeptide, including the secreted polypeptide, can be substantially purified by the 
one-step method described in Smith and Johnson, Gene 67:31-40 (1988). 
Polypeptides of the invention also can be purified from natural or recombinant sources 
using antibodies of the invention raised against the secreted protein in methods which 
arc well known in the art. 



Signal Sequences 

Methods for predicting whether a protein has a signal sequence, as well as the 
cleavage point for that sequence, are available. For instance, the method of McGeoch. 
Virus Res. 3:271-286 (1985), uses the information from a short N-terminal charged 
region and a subsequent uncharged region of the complete (uncleaved) protein. The 
method of von Heinje, Nucleic Acids Res. 14:4683-4690 (1986) uses the information 
from the residues surrounding the cleavage site, typically residues -13 to +2, where +1 
indicates the amino terminus of the secreted protein. The accuracy of predicting the 
cleavage points of known mammalian secretory proteins for each of these methods is in 
the range of 75-80%. (von Heinje, supra.) However, the two methods do not always 
produce the same predicted cleavage point(s) for a given protein. 

In the present case, the deduced amino acid sequence of the secreted polypeptide 
was analyzed by a computer program called SignalP (Henrik Nielsen et al.. Protein 
Engineering 10: 1-6 ( 1997)), which predicts the cellular location of a protein based on 
the amino acid sequence. As part of this computational prediction of locaUzation. the 
methods of McGeoch and von Heinje are incorporated. The analysis of the amino acid 
sequences of the secreted proteins described herein by this program provided the results 
shown in Table I. 

As one of ordinary skill would appreciate, however, cleavage sites sometimes 
vary from organism to organism and cannot be predicted with absolute certainty. 
Accordingly, the present invention provides secreted polypeptides having a sequence 
shown in SEQ ID NO:Y which have an N-terminus beginning within 5 residues (i.e., + 
or - 5 residues) of the predicted cleavage point. Similarly, it is also recognized that in 
some cases, cleavage of the signal sequence from a secreted protein is not entirely 
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uniform, resulting in more than one secreted species. These polypeptides, and the 
polynucleotides encoding such polypeptides, are contemplated by the present invention. 

Moreover, the signal sequence identified by the above analysis may not 
necessarily predict the naturally occurring signal sequence. For example, the naturally 
5 occurring signal sequence may be further upstream from the predicted signal sequence. 
However, it is likely that the predicted signal sequence will be capable of directing the 
secreted protein to the ER. These polypeptides, and the polynucleotides encoding such 
polypeptides, are contemplated by the present invention. 

10 Polynucleotide and Polypeptide Variants 

"Variant" refers to a polynucleotide or polypeptide differing from the 
polynucleotide or polypeptide of the present invention, but retaining essential properties 
thereof. Generally, variants are overall closely similar, and, in many regions, identical 
to the polynucleotide or polypeptide of the present invention. 

15 "Identity" per se has an art-recognized meaning and can be calculated using 

published techniques. (See,e.g.: (COMPUTATIONAL MOLECULAR BIOLOGY, 
Lesk, A.M., ed., Oxford University Press, New York, (1988); BIOCOMPUTING: 
INFORMATICS AND GENOME PROJECTS. Smith, D.W., ed.. Academic Press, 
New York, ( 1993); COMPUTER ANALYSIS OF SEQUENCE DATA, PART I, 

20 Griffin, A.M., and Griffin, H.G., eds., Humana Press, New Jersey, (1994); 

SEQUENCE ANALYSIS IN MOLECULAR BIOLOGY, von Heinje, G., Academic 
Press, (1987); and SEQUENCE ANALYSIS PRIMER, Gribskov, M. and Devereux, 
J., eds., M Stockton Press, New York, (1991).) While there exists a number of 
methods to measure identity between two polynucleotide or polypeptide sequences, the 

25 term "identity" is well known to skilled artisans. (Carillo, H., and Lipton, D., SIAM J 
Applied Math 48: 1073 (1988).) Methods commonly employed to determine identity or 
similarity between two sequences include, but are not limited to, those disclosed in 
"Guide to Huge Computers," Martin J. Bishop, ed.. Academic Press, San Diego, 
(1994), and Carillo, H., and Lipton, D., SIAM J Applied Math 48:1073 (1988). 

30 Methods for aligning polynucleotides or polypeptides are codified in computer 

programs, including the GCG program package (Devereux, J., et al.. Nucleic Acids 
Research (1984) 12(1):387 (1984)), BLASTP, BLASTN, PASTA (Atschul, S.F. el 
al., J. Molec. Biol. 215:403 (1990), Bestfit program (Wisconsin Sequence Analysis 
Package, Version 8 for Unix, Genetics Computer Group, University Research Park, 

35 575 Science Drive, Madison, WI 5371 1 (using the local homology algorithm of Smith 
and Waterman, Advances in Applied Mathematics 2:482-489 (1981).) 
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When using any of the sequence alignment programs to determine whether a 
particular sequence is, for instance. 95% identical to a reference sequence, the 
parameters are set so that the percentage of identity is calculated over the full length of 
the reference polynucleotide and that gaps in idenUty of up to 5% of the total number of 
nucleotides in the reference polynucleotide are allowed. 

A preferred method for determing the best overall match between a query 
sequence (a sequence of the present invention) and a subject sequence, also referred to 
as a global sequence alignment, can be determined using the FASTDB computer 
program based on the algorithm of Brutlag et al. (Comp. App. Biosci. 6:237-245 
(1990).) The term "sequence" includes nucleotide and amino acid sequences. In a 
sequCTce alignment the query and subject sequences are either both nucleotide 
sequences or both amino acid sequences. The result of said global sequence alignment 
is in percent identity. Preferred parameters used in a FASTDB search of a DNA 
sequence to calculate percent identiy are: Mauix=Unitary, k-tuple=4. Mismatch 
Penalty=l. Joining Penalty=30, Randomization Group Length=0, and Cutoff Score=l, 
Gap Penalty=5, Gap Size Penalty 0.05, and Window Size=500 or query sequence 
length in nucleotide bases, whichever is shorter. Preferred parameters employed to 
calculate percent identity and similarity of an amino acid alignment are: Mattix=PAM 
150, k-tuple=2. Mismatch Penalty=l, Joining Penalty=20. Randomization Group 
Length=0, Cutoff Score=l, Gap Penalty=5, Gap Size Penalty=0.05, and Window 
Size=500 or query sequence length in amino acid residues, whichever is shorter. 

As an iUustraUon. a polynucleotide having a nucleotide sequence of at least 95% 
"identity" to a sequence contained in SEQ ID NO:X or the cDN A contained in the 
deposited clone, means that the polynucleotide is identical to a sequence contained in 
SEQ ID NO:X or the cDNA except that the polynucleotide sequence may include up to 
five point mutations per each 100 nucleotides of the total length (not just within a given 
100 nucleotide stretch). In other words, to obtain a polynucleotide having a nucleotide 
sequence at least 95% identical to SEQ ID NO:X or the deposited clone, up to 5% of the 
nucleotides in the sequence contained in SEQ ID NO:X or the cDNA can be deleted, 
inserted, or substimted with other nucleotides. These changes may occur anywhere 
throughout the polynucleotide. 

Further embodiments of the presem invention include polynucleotides having at 
least 85% identity, more preferably at least 90% identity, and most preferably al least 
95%, 96%, 97%, 98% or 99% identity to a sequence contained in SEQ ID NO:X or tiie 
cDNA contained in the deposited clone. Of course, due to the degeneracy of the genetic 
code, one of ordinary skill in the art will immediately recognize that a large number of 
the polynucleotides having at least 85%. 90%. 95%. 96%, 97%. 98%, or 99% identity 
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will encode a polypeptide identical to an amino acid sequence contained in SEQ ID 
NO:Y or the expressed protein produced by the deposited clone. 

Similarly, by a polypeptide having an amino acid sequence having at least, for 
example, 95% "identity" to a reference polypeptide, is intended that the amino acid 
5 sequence of the polypeptide is identical to the reference polypeptide except that the 
polypeptide sequence may include up to five amino acid alterations per each 100 amino 
acids of the total length of the reference polypeptide. In other words, to obtain a 
polypeptide having an amino acid sequence at least 95% identical to a reference amino 
acid sequence, up to 5% of the amino acid residues in the reference sequence may be 

10 deleted or substituted with another amino acid, or a number of amino acids up to 5% of 
the total amino acid residues in the reference sequence may be inserted into the reference 
sequence. These alterations of the reference sequence may occur at the amino or 
carboxy terminal positions of the reference amino acid sequence or anywhere between 
those terminal positions, interspersed either individually among residues in the 

15 reference sequence or in one or more contiguous groups within the reference sequence. 

Further embodiments of the present invention include polypeptides having at 
least 80% identity, more preferably at least 85% identity, more preferably at least 90% 
identity, and most preferably at least 95%, 96%, 97%, 98% or 99% identity to an 
amino acid sequence contained in SEQ ID NO: Y or the expressed protein produced by 

20 the deposited clone. Preferably, the above polypeptides should exhibit at least one 
biological activity of the protein. 

In a preferred embodiment, polypeptides of the present invention include 
polypeptides having at least 90% similarity, more preferably at least 95% similarity, and 
still more preferably at least 96%, 97%, 98%, or 99% similarity to an amino acid 

25 sequence contained in SEQ ID NO: Y or the expressed protein produced by the 
deposited clone. 

The variants may contain alterations in the coding regions, non-coding regions, 
or both. Especially preferred are polynucleotide variants containing alterations which 
produce silent substitutions, additions, or deletions, but do not alter the properties or 

30 activities of the encoded polypeptide. Nucleotide variants produced by silent 
substitutions due to the degeneracy of the genetic code are preferred. Moreover, 
variants in which 5-10, 1-5, or 1-2 amino acids are substituted, deleted, or added in any 
combination are also preferred. Polynucleotide variants can be produced for a variety 
of reasons, e.g., to optimize codon expression for a particular host (change codons in 

35 the human mRNA to those preferred by a bacterial host such as E. coli). 

Naturally occurring variants are called "allelic variants," and refer to one of 
several alternate forms of a gene occupying a given locus on a chromosome of an 
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organism. (Genes II, Lewin, B., ed., John Wiley & Sons, New York (1985).) These 
allelic variants can vary at either the polynucleotide and/or polypeptide level. 
Alternatively, non-naturally occurring variants may be produced by mutagenesis 
techniques or by direct synthesis. 

5 Using known methods of protein engineering and recombinant DNA 

technology, variants may be generated to improve or alter the characteristics of the 
polypeptides of the present invention. For instance, one or more amino acids can be 
deleted from the N-terminus or C-terminus of the secreted protein without substantial 
loss of biological function. The authors of Ron et al., J. Biol. Chem. 268: 2984-2988 

10 (1993), reported variant KGF proteins having heparin binding activity even after 

deleting 3, 8, or 27 amino-terminal amino acid residues. Similarly, Interferon ganuna 
exhibited up to ten times higher activity after deleting 8-10 amino acid residues from the 
carboxy terminus of this protein. (Dobeli et al., J. Biotechnology 7: 1 99-2 1 6 (1988).) 
Moreover, ample evidence demonstrates that variants often retain a biological 

15 activity similar to that of the naturally occurring protein. For example. Gay le and 

coworkers (J. BioL Chem 268:22105-221 1 1 (1993)) conducted extensive mutational 
analysis of human cytokine XL- la. They used random mutagenesis to generate over 
3,500 individual IL-la mutants that averaged 2.5 amino acid changes per variant over 
the entire length of the molecule. Multiple mutations were examined at every possible 

20 amino acid position. The investigators found that "[mjost of the molecule could be 
altered with littie effect on either [binding or biological activity]." (See, Abstract.) In 
fact, only 23 unique amino acid sequences, out of more than 3,500 nucleotide 
sequences examined, produced a protein that significantly differed in activity from wild- 
type. 

25 Furthermore, even if deleting one or more amino acids from the N-terminus or 

C-terminus of a polypeptide results in modification or loss of one or more biological 
functions, other biological activities may still be retained. For example, the ability of a 
deletion variant to induce and/or to bind antibodies which recognize the secreted form 
will likely be retained when less than the majority of the residues of the secreted form 

30 are removed from the N-terminus or C-terminus. Whether a particular polypeptide 
lacking N- or C-terminal residues of a protein retains such inmiunogenic activities can 
readily be determined by routine methods described herein and otherwise known in the 
art. 

Thus, the invention further includes polypeptide variants which show 
35 substantial biological activity. Such variants include deletions, insertions, inversions, 
repeats, and substitutions selected according to general rules known in the art so as 
have little effect on activity. For example, guidance concerning how to make 
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phenotypically silent amino acid substitutions is provided in Bowie, J. U. et al.. 
Science 247: 1306-1310 ( 1990), wherein the authors indicate that there are two main 
strategies for studying the tolerance of an amino acid sequence to change. 

The first strategy exploits the tolerance of amino acid substitutions by natural 
5 selection during the process of evolution. By comparing amino acid sequences in 
different species, conserved amino acids can be identified. These conserved amino 
acids are likely important for protein function. In contrast, the amino acid positions 
where substitutions have been tolerated by natural selection indicates that these 
positions are not critical for protein function. Thus, positions tolerating amino acid 

10 substitution could be modified while still maintaining biological activity of the protein. 

The second strategy uses genetic engineering to introduce amino acid changes at 
specific positions of a cloned gene to identify regions critical for protein function. For 
example, site directed mutagenesis or alanine-scanning mutagenesis (introduction of 
single alanine mutations at every residue in the molecule) can be used. (Cunningham 

15 and Wells, Science 244: 1081- 1085 (1989).) The resulting mutant molecules can then 
be tested for biological activity. 

As the authors state, these two strategies have revealed that proteins are 
surprisuigly tolerant of anuno acid substitutions. The authors further indicate which 
amino acid changes are likely to be permissive at certain amino acid positions in the 

20 protein. For example, most buried (within the tertiary structure of the protein) amino 
acid residues require nonpolar side chains, whereas few features of surface side chains 
are generally conserved. Moreover, tolerated conservative amino acid substitutions 
involve replacement of the aliphatic or hydrophobic amino acids Ala, Val, Leu and He; 
replacement of the hydroxyl residues Ser and Thr; replacement of the acidic residues 

25 Asp and Glu; replacement of the amide residues Asn and Gin, replacement of the basic 
residues Lys, Arg, and His; replacement of the aromatic residues Phe, Tyr, and Trp, 
and replacement of the small-sized amino acids Ala, Ser, Thr, Met, and Gly. 

Besides conservative amino acid substitution, variants of the present invention 
include (i) substitutions with one or more of the non-conserved amino acid residues, 

30 where the substituted amino acid residues may or may not be one encoded by the 
genetic code, or (ii) substitution with one or more of amino acid residues having a 
substituent group, or (iii) fusion of the mature polypeptide with another compound, 
such as a compound to increase the stability and/or solubility of the polypeptide (for 
example, polyethylene glycol), or (iv) fusion of the polypeptide with additional amino 

35 acids, such as an IgG Fc fusion region peptide, or leader or secretory sequence, or a 
sequence facilitating purification. Such variant polypeptides are deemed to be within 
the scope of those skilled in the art from the teachings herein. 
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For example, polypeptide variants containing amino acid substitutions of 
charged amino acids with other charged or neutral amino acids may produce proteins 
with improved characteristics, such as less aggregation. Aggregation of pharmaceutical 
formulations both reduces activity and increases clearance due to the aggregate's 
5 immunogenic activity. (Pinckard et al., Clin. Exp. Immunol. 2:331-340 (1967); 
Robbins et al.. Diabetes 36: 838-845 (1987); Cleland et al., Crit. Rev. Therapeutic 
Drug Carrier Systems 10:307-377 (1993).) 

p«^|| Yniicleofirie and Po lypeptide Fragments 
10 In the present invention, a "polynucleotide fragment" refers to a short 

polynucleotide having a nucleic acid sequence contained in the deposited clone or 
shown in SEQ ID NO:X. The short nucleotide fragments are preferably at least about 
15 nt, and more preferably at least about 20 nt, still more preferably at least about 30 nt. 
and even more preferably, at least about 40 nt in length. A fragment "at least 20 nt in 
15 length." for example, is intended to include 20 or more contiguous bases from the 
cDNA sequence contained in the deposited clone or the nucleotide sequence shown in 
SEQ ID NO:X. These nucleotide fragments are useful as diagnostic probes and primers 
as discussed herein. Of course, larger fragments (e.g., 50, 150, 500, 600, 2000 
nucleotides) are preferred. 
20 Moreover, representative examples of polynucleotide fragments of the 

invention, include, for example, fragments having a sequence from about nucleotide 
number 1-50, 51-100, 101-150, 151-200, 201-250, 251-300, 301-350, 351-400, 401- 
450, 451-500. 501-550, 551-600, 651-700, and 701 to the end of SEQ ID NO:X or the 
cDNA contained in die deposited clone. In this context "about" includes the particularly 
25 recited ranges, larger or smaller by several (5, 4, 3. 2. or 1 ) nucleotides, at either 
terminus or at both termini. Preferably, these fragments encode a polypeptide which 

has biological activity. 

In the present invention, a "polypeptide fragment" refers to a short amino acid 
sequence contained in SEQ ID NO: Y or encoded by the cDNA contained in the 

30 deposited clone. Protein fragments may be "free-standing," or comprised witiiin a 
larger polypeptide of which the fragment forms a part or region, most preferably as a 
single continuous region. Representative examples of polypeptide fragments of the 
invention, include, for example, fragments from about amino acid number 1-20, 21-40, 
41-60, 61-80, 81-100, 102-120, 121-140, 141-160, and 161 to the end of the coding 

35 region. Moreover, polypeptide fragments can be about 20, 30, 40, 50, 60, 70, 80, 90, 
100, 1 10, 120, 130, 140, or 150 amino acids in length. In this context "about" 
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includes the particularly recited ranges^ larger or smaller by several (5, 4, 3, 2, or 1) 
amino acids, at either extreme or at both extremes. 

Preferred polypeptide fragments include the secreted protein as well as the 
mature form. Further preferred polypeptide fragments include the secreted protein or 
5 the mature foim having a continuous series of deleted residues from the amino or the 
carboxy terminus, or both. For example, any number of amino acids, ranging from 1- 
60, can be deleted from the amino terminus of either the secreted polypeptide or the 
mature form. Similarly, any number of amino acids, ranging from 1-30, can be deleted 
from the carboxy terminus of the secreted protein or mature form. Furthermore, any 
10 combination of the above amino and carboxy terminus deletions are preferred. 

Similarly, polynucleotide fragments encoding these polypeptide fragments are also 
preferred. 

Also preferred are polypeptide and polynucleotide fragments characterized by 
suiictural or functional domains, such as fragments that comprise alpha-helix and alpha- 

15 helix forming regions, beta-sheet and beta-sheet-forming regions, turn and turn- 
forming regions, coil and coil-forming regions, hydrophilic regions, hydrophobic 
regions, alpha amphipathic regions, beta amphipathic regions, flexible regions, surface- 
forming regions, substrate binding region, and high antigenic index regions. 
Polypeptide fragments of SEQ ID NO: Y falling within conserved domains are 

20 specifically contemplated by the present invention. Moreover, polynucleotide 
fragments encoding these domains are also contemplated. 

Other preferred fragments are biologically active fragments. Biologically active 
fragments are those exhibiting activity similar, but not necessarily identical, to an 
activity of the polypeptide of the present invention. The biological activity of the 

25 fragments may include an improved desired activity, or a decreased undesirable activity. 

Epitopes & Antibodies 

In the present invention, "epitopes" refer to polypeptide fragments having 
antigenic or immunogenic activity in an animal, especially in a human. A preferred 

30 embodiment of the present invention relates to a polypeptide fragment comprising an 
epitope, as well as the polynucleotide encoding this fragment. A region of a protein 
molecule to which an antibody can bind is defined as an "antigenic epitope." In 
contrast, an "immunogenic epitope" is defined as a part of a protein that elicits an 
antibody response. (See, for instance, Geysen et al., Proc. Natl. Acad. Sci. USA 

35 81:3998-4002 (1983).) 



wo 98/39448 



176 



PCTAJS98/04493 



10 



Fragments which function as epitopes may be produced by any conventional 
means. (See. e.g., Houghten. R. A.. Proc. Nad. Acad. Sci. USA 82:5131-5135 
(1985) further described in U.S. Patent No. 4,631,21 1.) 

In the present invention, antigenic epitopes preferably contain a sequence of at 
least seven, more preferably at least nine, and most preferably between about 15 to 
about 30 amino acids. Antigenic epitopes are useful to raise antibodies, including 
monoclonal antibodies, that specifically bind the epitope. (See, for instance. Wilson et 
al.. Cell 37:767-778 (1984); Sutcliffe, J. G. et al.. Science 219:660-666 (1983).) 

Similarly, immunogenic epitopes can be used to induce antibodies according to 
methods well known in the art. (See, for instance, Sutcliffe et al., supra; Wilson et al., 
supra; Chow, M. et al., Proc. Natl. Acad. Sci. USA 82:910-914; and Bittle, F. J. et 
al., J. Gen. Virol. 66:2347-2354 (1985).) A preferred immunogenic epitope includes 
the secreted protein. The immunogenic epitopes may be presented together with a 
carrier protein, such as an albumin, to an animal system (such as rabbit or mouse) or, if 
15 it is long enough (at least about 25 amino acids), without a carrier. However, 

immunogenic epitopes comprising as few as 8 to 10 amino acids have been shown to be 
sufficient to raise antibodies capable of binding to. at the very least, linear epitopes in a 
denatured polypeptide (e.g., in Western blotting.) 

As used herein, the term "antibody" (Ab) or "monoclonal antibody" (Mab) is 
20 meant to include intact molecules as well as antibody fragments (such as, for example. 
Fab and F(ab')2 fragments) which are capable of specifically binding to protein. Fab 
and F(ab')2 fragments lack the Fc fragment of intact antibody, clear more rapidly from 
the circulation, and may have less non-specific tissue binding than an intact antibody. 
(Wahl et al.. J. Nucl. Med. 24:316-325 (1983).) Thus, these fragments are preferred, 
25 as well as the products of a FAB or other immunoglobulin expression library. 
Moreover, antibodies of the present invention include chimeric, single chain, and 
humanized antibodies. 



Fusion Proteins 

30 Any polypeptide of the present invention can be used to generate fusion 

proteins. For example, the polypeptide of the present invention, when fused to a 
second protein, can be used as an antigenic tag. Antibodies raised against the 
polypeptide of the present invention can be used to indirectly detect the second protein 
by binding to the polypeptide. Moreover, because secreted proteins target cellular 

35 locations based on trafficking signals, the polypeptides of the present invention can be 
used as targeting molecules once fused to other proteins. 
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Examples of domains thai can be fused to polypeptides of the present invention 
include not only heterologous signal sequences, but also other heterologous functional 
regions. The fusion does not necessarily need to be direct, but may occur through 
linker sequences. 

5 Moreover, fusion proteins may also be engineered to improve characteristics of 

the polypeptide of the present invention. For instance, a region of additional amino 
acids, particularly charged amino acids, may be added to the N-terminus of the 
polypeptide to improve stability and persistence during purification from the host cell or 
subsequent handling and storage. Also, peptide moieties may be added to the 

10 polypeptide to facilitate purification. Such regions may be removed prior to final 

preparation of the polypeptide. The addition of peptide moieties to facilitate handling of 
polypeptides are familiar and routine techniques in the art. 

Moreover, polypeptides of the present invention, including fragments, and 
specifically epitopes, can be combined with parts of the constant domain of 

15 inununoglobulins (IgG), resulting in chimeric polypeptides. These fusion proteins 
facilitate purification and show an increased half-life in vivo. One reported example 
describes chimeric proteins consisting of the first two domains of the human CD4- 
polypeptide and various domains of the constant regions of the heavy or light chains of 
manunalian immunoglobulins. (EP A 394,827; Traunecker et al., Nanire 33 1 :84-86 

20 (1988).) Fusion proteins having disulfide-linked dimeric structures (due to the IgG) 
can also be more efficient in binding and neutralizing other molecules, than the 
monomeric secreted protein or protein fragment alone. (Fountoulakis et al., J. 
Biochem. 270:3958-3964 (1995).) 

Similarly, EP-A-O 464 533 (Canadian counterpart 2045869) discloses fusion 

25 proteins comprising various portions of constant region of immunoglobulin molecules 
together with another human protein or part thereof. In many cases, the Fc part in a 
fusion protein is beneficial in therapy and diagnosis, and thus can result in, for 
example, improved pharmacokinetic properties. (EP-A 0232 262.) Alternatively, 
deleting the Fc part after the fusion protein has been expressed, detected, and purified, 

30 would be desired. For example, the Fc portion may hinder therapy and diagnosis if the 
fusion protein is used as an antigen for immunizations. In drug discovery, for 
example, human proteins, such as hIL-5, have been fused with Fc portions for the 
purpose of high-throughput screening assays to identify antagonists of hIL-5. (See, D. 
Bennett et al., J. Molecular Recognition 8:52-58 (1995); K. Johanson et al., J. Biol. 

35 Chem. 270:9459-947 1 ( 1 995).) 

Moreover, the polypeptides of the present invention can be fused to marker 
sequences, such as a peptide which facilitates purification of the fused polypeptide. In 
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preferred embodiments, the marker amino acid sequence is a hexa-histidine peptide, 
such as the tag provided in a pQE vector (QIAGEN, Inc., 9259 Eton Avenue. 
Chatsworth, CA, 9131 1), among others, many of which are commercially available. 
As described in Gentz et al., Proc. Natl. Acad. Sci. USA 86:821-824 (1989), for 
instance, hexa-histidine provides for convenient purification of the fusion protein. 
Another peptide tag useful for purification, the "HA" tag, corresponds to an epitope 
derived from the influenza hemagglutinin protein. (Wilson et al., Cell 37:767 (1984).) 

Thus, any of these above fusions can be engineered using the polynucleotides 
or the polypeptides of the claimed invention. 



10 



yi.rtftrs. Host Cells , and Protein Production 

The present invention also relates to vectors containing the polynucleotide of the 
present invention, host cells, and the production of polypeptides by recombinant 
techniques. The vector may be, for example, a phage, plasmid, viral, or retroviral 
15 vector. Retroviral vectors may be replication competent or replication defective. In the 
latter case, viral propagation generally will occur only in complementing host cells. 

The polynucleotides may be joined to a vector containing a selectable marker fw 
propagation in a host. Generally, a plasmid vector is introduced in a precipitate, such 
as a calcium phosphate precipitate, or in a complex with a charged lipid. If the vector is 
20 a virus, it may be packaged in vitro using an appropriate packaging ceU line and then 
transduced into host cells. 

The polynucleotide insert should be operatively linked to an appropriate 
promoter, such as the phage lambda PL promoter, the E. coli lac, trp, phoA and tac 
promoters, the S V40 early and late promoters and promoters of rettoviral LTRs, to 
25 name a few. Other suitable promoters will be known to the skilled artisan. The 

expression constmcts will further contain sites for transcription initiation, termination, 
and, in the transcribed region, a ribosome binding site for translation. The coding 
portion of the transcripts expressed by the constructs wiU preferably include a 
translation initiating codon at the beginning and a termination codon (UAA. UGA or 
30 UAG) appropriately positioned at the end of the polypeptide to be translated. 

As indicated, the expression vectors will preferably include at least one 
selectable marker. Such markers include dihydrofolate reductase. G418 or neomycin 
resistance for eukaryotic cell culture and tetracycline, kanamycin or ampicillin resistance 
genes for culturing in E. coli and other bacteria. Representative examples of 
35 appropriate hosts include, but are not limited to, bacterial cells, such as E. coli, 

Streptomyces and Salmonella typhimurium cells; fungal cells, such as yeast cells; insect 
cells such as Drosophila S2 and Spodoptera Sf9 cells; animal cells such as CHO. COS. 
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293, and Bowes melanoma cells; and plant cells. Appropriate culture mediums and 
conditions for the above-described host cells are known in the art. 

Among vectors preferred for use in bacteria include pQE70, pQE60 and pQE-9, 
available from QIAGEN, Inc.; pBluescript vectors, Phagescript vectors, pNH8A, 
5 pNH16a, pNHlSA, pNH46A, available from Stratagene Cloning Systems, Inc.; and 
ptrc99a, pKK223-3, pKK233-3, pDR540, pRlT5 available from Pharmacia Biotech. 
Inc. Among preferred eukaryotic vectors are pWLNEO. pSV2CAT, pOG44, pXTl 
and pSG available from Stratagene; and pSVK3, pBPV, pMSG and pSVL available 
from Pharmacia. Other suitable vectors will be readily apparent to the skilled artisan. 

10 Introduction of the construct into the host cell can be effected by calcium 

phosphate transfection, DEAE-dextran mediated transfection, canonic lipid-mediated 
transfection, electroporation, transduction, infection, or other methods. Such methods 
are described in many standard laboratory manuals, such as Davis et al., Basic Methods 
In Molecular Biology (1986). It is specifically contemplated that the polypeptides of the 

15 present invention may in fact be expressed by a host cell lacking a recombinant vector. 

A polypeptide of this invention can be recovered and purified from recombinant 
cell cultures by well-known methods including anunonium sulfate or ethanol 
precipitation, acid extraction, anion or cation exchange chromatography, 
phosphocellulose chromatography, hydrophobic interaction chromatography, affinity 

20 chromatography, hydroxylapatite chromatography and lectin chromatography. Most 
preferably, high performance liquid chromatography ("HPLC") is employed for 
purification. 

Polypeptides of the present invention, and preferably the secreted form, can also 
be recovered from: products purified from natural sources, including bodily fluids, 

25 tissues and cells, whether direcUy isolated or cultured; products of chemical synthetic 
procedures; and products produced by recombinant techniques from a prokaryotic or 
eukaryotic host, including, for example, bacterial, yeast, higher plant, insect, and 
manunalian cells. Depending upon the host employed in a recombinant production 
procedure, the polypeptides of the present invention may be glycosylated or may be 

30 non-glycosylated. In addition, polypeptides of the invention may also include an initial 
modified methionine residue, in some cases as a result of host-mediated processes. 
Thus, it is well known in the art that the N-terminal methionine encoded by the 
translation initiation codon generally is removed with high efficiency from any protein 
after translation in all eukaryotic cells. While the N-terminal methionine on most 

35 proteins also is efficiently removed in most prokaryotes, for some proteins, this 

prokaryotic removal process is inefficient, depending on the nature of the amino acid to 
which the N-terminal methionine is covalentiy hnked. 
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Uses of the Polynucleotides 

Each of the polynucleotides identified herein can be used in numerous ways as 
reagents. The following description should be considered exemplary and utilizes 

5 known techniques. 

The polynucleotides of the present invention are useful for chromosome 
identification. There exists an ongoing need to identify new chromosome markers, 
since few chromosome marking reagents, based on actual sequence data (repeat 
polymorphisms), are presentiy available. Each polynucleotide of the present invention 

10 can be used as a chromosome marker. 

Briefly, sequences can be mapped to chromosomes by preparing PCR primers 
(preferably 15-25 bp) from the sequences shown in SEQ ID NO:X. Primers can be 
selected using computer analysis so that primers do not span more than one predicted 
exon in the genomic DNA. These primers are then used for PCR screening of somatic 

15 cell hybrids containing individual human chromosomes. Only those hybrids containing 
the human gene corresponding to the SEQ ID NO:X will yield an amplified fragment. 

Similarly, somatic hybrids provide a rapid method of PCR mapping the 
polynucleotides to particular chromosomes. Three or more clones can be assigned per 
day using a single thermal cycler. Moreover, sublocalization of the polynucleotides can 

20 be achieved with panels of specific chromosome fragments. Other gene mapping 

strategies that can be used include in situ hybridization, prescreening with labeled flow- 
sorted chromosomes, and preselection by hybridization to construct chromosome 
specific-cDNA libraries. 

Precise chromosomal location of the polynucleotides can also be achieved using 

25 fluorescence in situ hybridization (FISH) of a metaphase chromosomal spread. This 
technique uses polynucleotides as short as 500 or 600 bases; however, polynucleotides 
2,000-4,000 bp are preferred. For a review of this technique, see Verma et al., 
"Human Chromosomes: a Manual of Basic Techniques," Pergamon Press, New York 
(1988). 

30 For chromosome mapping, the polynucleotides can be used individually (to 

mark a single chromosome or a single site on that chromosome) or in panels (for 
marking multiple sites and/or multiple chromosomes). Preferred polynucleotides 
correspond to the noncoding regions of the cDNAs because the coding sequences are 
more likely conserved within gene families, thus increasing the chance of cross 

35 hybridization during chromosomal mapping. 

Once a polynucleotide has been mapped to a precise chromosomal location, the 
physical position of the polynucleotide can be used in linkage analysis. Linkage 
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analysis establishes coinheritance between a chromosomal location and presentation of a 
particular disease. (Disease mapping data are found, for example, in V. McKusick, 
Mendelian Inheritance in Man (available on line through Johns Hopkins University 
Welch Medical Library) .) Assuming 1 megabase mapping resolution and one gene per 
5 20 kb, a cDNA precisely localized to a chromosomal region associated with the disease 
could be one of 50-500 potential causative genes. 

Thus, once coinheritance is established, differences in the polynucleotide and 
the corresponding gene between affected and unaffected individuals can be examined. 
First, visible structural alterations in the chromosomes, such as deletions or 

10 translocations, are examined in chromosome spreads or by PCR. If no structural 

alterations exist, the presence of point mutations are ascertained. Mutations observed in 
some or all affected individuals, but not in normal individuals, indicates that the 
mutation may cause the disease. However, complete sequencing of the polypeptide and 
the corresponding gene from several normal individuals is required to distinguish the 

15 mutation from a polymorphism. If a new polymorphism is identified, this polymorphic 
polypeptide can be used for further linkage analysis. 

Furthermore, increased or decreased expression of the gene in affected 
individuals as compared to unaffected individuals can be assessed using 
polynucleotides of the present invention. Any of these alterations (altered expression, 

20 chromosomal rearrangement, or mutation) can be used as a diagnostic or prognostic 
marker. 

In addition to the foregoing, a polynucleotide can be used to control gene 
expression through triple helix formation or antisense DN A or RNA. Both methods 
rely on binding of the polynucleotide to DNA or RNA. For these techniques, preferred 

25 polynucleotides are usually 20 to 40 bases in length and complementary to either the 
region of the gene involved in transcription (triple helix - see Lee et al., Nucl. Acids 
Res. 6:3073 (1979); Cooney et al.. Science 241:456 (1988); and Dervan et al.. Science 
251:1360 (1991) ) or to the mRNA itself (antisense - Okano, J. Neurochem. 56:560 
(1991); Oligodeoxy-nucleotides as Antisense Inhibitors of Gene Expression, CRC 

30 Press, Boca Raton, FL (1988).) Triple helix formation optimally results in a shut-off 
of RNA transcription from DNA, while antisense RNA hybridization blocks translation 
of an mRNA molecule into polypeptide. Both techniques are effective in model 
systems, and the information disclosed herein can be used to design antisense or triple 
helix polynucleotides in an effort to treat disease. 

35 Polynucleotides of the present invention are also useful in gene therapy. One 

goal of gene therapy is to insert a normal gene into an organism having a defective 
gene, in an effort to correct the genetic defect. The polynucleotides disclosed in the 
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present invention offer a means of targeting such genetic defects in a highly accurate 
manner. Another goal is to insert a new gene that was not present in the host genome, 
thereby producing a new trait in the host cell. 

The polynucleotides are also useful for identifying individuals from minute 

5 biological samples. The United States military, for example, is considering the use of 
restriction fragment length polymorphism (RFLP) for identification of its personnel. In 
this technique, an individual's genomic DNA is digested with one or more restriction 
enzymes, and probed on a Southern blot to yield unique bands for identifying 
personnel. This method does not suffer from the current limitations of "Dog Tags" 

10 which can be lost, switched, or stolen, making positive identification difficult. The 
polynucleotides of the present invention can be used as additional DNA markers for 
RFLP. 

The polynucleotides of the present invention can also be used as an alternative to 
RFLP, by determining the actual base-by-base DNA sequence of selected portions of an 

15 individual's genome. These sequences can be used to prepare PCR primers for 

amplifying and isolating such selected DNA, which can then be sequenced. Using this 
technique, individuals can be identified because each individual will have a unique set 
of DNA sequences. Once an unique ID database is established for an individual, 
positive identification of that individual, living or dead, can be made from extremely 

20 small tissue samples. 

Forensic biology also benefits from using DNA-based identification techniques 
as disclosed herein. DNA sequences taken from very small biological samples such as 
tissues, e.g., hair or skin, or body fluids, e.g., blood, saliva, semen, etc., can be 
amplified using PCR. In one prior art technique, gene sequences amplified from 

25 polymorphic loci, such as DQa class II HLA gene, are used in forensic biology to 

identify individuals. (Erlich, H., PCR Technology, Freeman and Co. (1992).) Once 
these specific polymorphic loci are amplified, they are digested with one or more 
restriction enzymes, yielding an identifying set of bands on a Southern blot probed with 
DNA corresponding to the DQa class 11 HLA gene. Similarly, polynucleotides of the 

30 present invention can be used as polymorphic markers for forensic purposes. 

There is also a need for reagents capable of identifying the source of a particular 
tissue. Such need arises, for example, in forensics when presented with tissue of 
unknown origin. Appropriate reagents can comprise, for example, DNA probes or 
primers specific to particular tissue prepared from the sequences of the present 

35 invention. Panels of such reagents can identify tissue by species and/or by organ type. 
In a similar fashion, these reagents can be used to screen tissue cultures for 
contamination. 
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In the very least, the polynucleotides of the present invention can be used as 
molecular weight markers on Southern gels, as diagnostic probes for the presence of a 
specific mRNA in a particular cell type, as a probe to "subtract-out" known sequences 
in the process of discovering novel polynucleotides, for selecting and making oligomers 
5 for attachment to a "gene chip" or other support, to raise anti-DNA antibodies using 
DNA immunization techniques, and as an antigen to elicit an immune response. 

Uses of the PolvDeptides 

Each of the polypeptides identified herein can be used in numerous ways. The 

10 following description should be considered exemplary and utilizes known techniques. 

A polypeptide of the present invention can be used to assay protein levels in a 
biological sample using antibody-based techniques. For example, protein expression in 
tissues can be studied with classical immunohistological methods. (Jalkanen, M., et 
al., J, Cell. Biol. 101:976-985 (1985); Jalkanen, M., et al., J. Cell . Biol. 105:3087- 

15 3096 (1987).) Other antibody-based methods useful for detecting protein gene 

expression include immunoassays, such as the enzyme linked inmiunosorbent assay 
(ELISA) and the radioimmunoassay (RIA). Suitable antibody assay labels are known 
in the art and include enzyme labels, such as, glucose oxidase, and radioisotopes, such 
as iodine (1251, 1211), carbon (14C), sulfur (35S), tritium (3H), indium (1 12In), and 

20 technetium (99mTc), and fluorescent labels, such as fluorescein and rhodamine, and 
biotin. 

In addition to assaying secreted protein levels in a biological sample, proteins 
can also be detected in vivo by imaging. Antibody labels or markers for in vivo 
imaging of protein include those detectable by X-radiography, NMR or ESR. For X- 
25 radiography, suitable labels include radioisotopes such as barium or cesium, which emit 
detectable radiation but are not overtly harmful to the subject. Suitable markers for 
NMR and ESR include those with a detectable characteristic spin, such as deuterium, 
which may be incorporated into the antibody by labeling of nutrients for the relevant 
hybridoma. 

30 A protein-specific antibody or antibody fragment which has been labeled with 

an appropriate detectable imaging moiety, such as a radioisotope (for example, 1311, 
1 121n, 99mTc), a radio-opaque substance, or a material detectable by nuclear magnetic 
resonance, is introduced (for example, parenterally, subcutaneously, or 
intraperitoneally) into the mammal. It will be understood in the art that the size of the 

35 subject and the imaging system used will determine the quantity of imaging moiety 

needed to produce diagnostic images. In the case of a radioisotope moiety, for a human 
subject, the quantity of radioactivity injected will normally range from about 5 to 20 
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millicuries of 99mTc. The labeled antibody or antibody fragment will then 
preferentially accumulate at the locaUon of cells which contain the specific protein. In 
vivo tumor imaging is described in S.W. Burchiel et al., "Immunopharmacokinetics of 
Radiolabeled Antibodies and Their Fragments." (Chapter 13 in Tumor Imaging: The 
Radiochemical Detection of Cancer, S.W. Burchiel and B. A. Rhodes, eds., Masson 
Publishing Inc. (1982).) 

Thus, the invention provides a diagnostic method of a disorder, which involves 
(a) assaying the expression of a polypeptide of the present invention in cells or body 
fluid of an individual; (b) comparing the level of gene expression with a standard gene 
expression level, whereby an increase or decrease in the assayed polypeptide gene 
expression level compared to the standard expression level is indicative of a disorder. 

Moreover, polypeptides of the present invention can be used to treat disease. 
For example, patients can be administered a polypeptide of the present invention in an 
effort to replace absent or decreased levels of the polypeptide (e.g., insulin), to 
supplement absent or decreased levels of a different polypeptide (e.g., hemoglobin S 
for hemoglobin B), to inhibit the activity of a polypeptide (e.g., an oncogene), to 
activate the activity of a polypeptide (e.g., by binding to a receptor), to reduce the 
activity of a membrane bound receptor by competing with it for free ligand (e.g., 
soluble TNF receptors used in reducing inflammation), or to bring about a desired 
response (e.g., blood vessel growth). 

Similarly, antibodies directed to a polypeptide of the present invention can also 
be used to treat disease. For example, administration of an antibody directed to a 
polypeptide of the present invention can bind and reduce overproduction of the 
polypeptide. Similarly, administration of an antibody can activate the polypeptide, such 
as by binding to a polypeptide bound to a membrane (receptor). 

At the very least, the polypeptides of the present invention could be used as 
molecular weight markers on SDS-PAGE gels or on molecular sieve gel filtration 
columns using methods well known to those of skill in the art. Polypeptides can also 
be used to raise antibodies, which in turn are used to measure protein expression from a 
recombinant cell, as a way of assessing transformation of the host cell. Moreover, tiie 
polypeptides of the present invention can be used to test the following biological 
activities. 

Rinlnpical Activities 

The polynucleotides and polypeptides of the present invention can be used in 
assays to test for one or more biological activities. If these polynucleotides and 
polypeptides do exhibit activity in a particular assay, it is likely that these molecules 
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may be involved in the diseases associated with the biological activity. Thus, the 
polynucleotides and polypeptides could be used to treat the associated disease. 

Immune Activity 

S A polypeptide or polynucleotide of the present invention may be useful in 

treating deficiencies or disorders of the immune system, by activating or inhibiting the 
proliferation, differentiation, or mobilization (chemotaxis) of immune cells. Immune 
ceils develop through a process called hematopoiesis, producing myeloid (platelets, red 
blood cells, neutrophils, and macrophages) and lymphoid (B and T lymphocytes) cells 

10 from pluripotent stem cells. The etiology of these immune deficiencies or disorders 

may be genetic, somatic, such as cancer or some autoimmune disorders, acquired (e.g., 
by chemotherapy or toxins), or infectious. Moreover, a polynucleotide or polypeptide 
of the present invention can be used as a marker or detector of a particular immune 
system disease or disorder. 

15 A polynucleotide or polypeptide of the present invention may be useful in 

treating or detecting deficiencies or disorders of hematopoietic cells. A polypeptide or 
polynucleotide of the present invention could be used to increase differentiation and 
proliferation of hematopoietic cells, including the pluripotent stem cells, in an effort to 
treat those disorders associated with a decrease in certain (or many) types hematopoietic 

20 cells. Examples of immunologic deficiency syndromes include, but are not limited to: 
blood protein disorders (e.g. aganmiaglobulinemia, dysgammaglobulinemia), ataxia 
telangiectasia, common variable inmiunodeficiency, Digeorge Syndrome, HIV 
infection. HTLV-BLV infection, leukocyte adhesion deficiency syndrome, 
lymphopenia, phagocyte bactericidal dysfunction, severe combined immunodeficiency 

25 (SCIDs), Wiskott-Aldrich Disorder, anemia, thrombocytopenia, or hemoglobinuria. 

Moreover, a polypeptide or polynucleotide of the present invention could also 
be used to modulate hemostatic (the stopping of bleeding) or thrombolytic activity (clot 
formation). For example, by increasing hemostatic or thrombolytic activity, a 
polynucleotide or polypeptide of the present invention could be used to treat blood 

30 coagulation disorders (e.g., afibrinogenemia, factor deficiencies), blood platelet 

disorders (e.g. thrombocytopenia), or wounds resulting from trauma, surgery, or other 
causes. Alternatively, a polynucleotide or polypeptide of the present invention that can 
decrease hemostatic or thrombolytic activity could be used to inhibit or dissolve 
clotting. These molecules could be important in the treatment of heart auacks 

35 (infarction), strokes, or scarring. 

A polynucleotide or polypeptide of the present invention may also be useful in 
treating or detecting autoimmune disorders. Many autoimmune disorders result from 
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inappropriate recognition of self as foreign material by immune cells. This 
inappropriate recognition results in an immune response leading to the destruction of the 
host tissue. Therefore, the administration of a polypeptide or polynucleotide of the 
present invention that inhibits an immune response, particularly the proliferation. 

5 differentiation, or chemotaxis of T-cells, may be an effective therapy in preventing 
autoimmune disorders. 

Examples of autoimmune disorders that can be treated or detected by the present 
invention include, but are not limited to: Addison*s Disease, hemolytic anemia, 
antiphospholipid syndrome, rheumatoid arthritis, dermatitis, allergic encephalomyelitis, 

10 glomerulonephritis, Goodpasture's Syndrome, Graves' Disease, Multiple Sclerosis, 
Myasthenia Gravis, Neuritis, Ophthalmia, Bullous Pemphigoid, Pemphigus, 
Polyendocrinopathies, Purpura, Reiter*s Disease, Stiff-Man Syndrome, Autoimmune 
Thyroiditis, Systemic Lupus Erythematosus, Autoimmune Pulmonary Inflammation, 
Guillain-Barre Syndrome, insulin dependent diabetes mellitis, and autoimmune 

15 inflammatory eye disease. 

Similarly, allergic reactions and conditions, such as asthma (particularly allergic 
asthma) or other respiratory problems, may also be treated by a polypeptide or 
polynucleotide of the present invention. Moreover, these molecules can be used to treat 
anaphylaxis, hypersensitivity to an antigenic molecule, or blood group incompatibility. 

20 A polynucleotide or polypeptide of the present invention may also be used to 

treat and/or prevent organ rejection or graft-versus-host disease (GVHD). Organ 
rejection occurs by host immune cell destruction of the transplanted tissue through an 
immune response. Similarly, an immune response is also involved in GVHD, but, in 
this case, the foreign transplanted immune cells destroy the host tissues. The 

25 administration of a polypeptide or polynucleotide of the present invention that inhibits 
an immune response, particularly the proliferation, differentiation, or chemotaxis of T- 
cells, may be an effective therapy in preventing organ rejection or GVHD. 

Similarly, a polypeptide or polynucleotide of the present invention may also be 
used to modulate inflanmiation. For example, the polypeptide or polynucleotide may 

30 inhibit the proliferation and differentiation of cells involved in an inflammatory 

response. These molecules can be used to treat inflammatory conditions, both chronic 
and acute conditions, including inflanmiation associated with infection (e.g., septic 
shock, sepsis, or systemic inflammatory response syndrome (SIRS)), ischemia- 
reperfusion injury, endotoxin lethality, arthritis, complement-mediated hyperacute 

35 rejection, nephritis, cytokine or chemokine induced lung injury, inflammatory bowel 
disease, Crohn's disease, or resulting from over production of cytokines (e.g., TNF or 
IL-1.) 
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Hvperproliferative Disorders 

A polypeptide or polynucleotide can be used to treat or detect hyperproliferative 
disorders, including neoplasms. A polypeptide or polynucleotide of the present 
5 invention may inhibit the proliferation of the disorder through direct or indirect 
interactions. Alternatively, a polypeptide or polynucleotide of the present invention 
may proliferate other cells which can inhibit the hyperproliferative disorder. 

For example, by increasing an immune response, particularly increasing 
antigenic qualities of the hyperproliferative disorder or by proliferating, differentiating, 
10 or mobilizing T-cells, hyperproliferative disorders can be treated. This immune 
response may be increased by either enhancing an existing immune response, or by 
initiating a new immune response. Alternatively, decreasing an immune response may 
also be a method of treating hyperproliferative disorders, such as a chemotherapeutic 
agent. 

15 Examples of hyperproliferative disorders that can be treated or detected by a 

polynucleotide or polypeptide of the present invention include, but are not limited to 
neoplasms located in the: abdomen, bone, breast, digestive system, liver, pancreas, 
peritoneum, endocrine glands (adrenal, parathyroid, pituitary, testicles, ovary, thymus, 
thyroid), eye, head and neck, nervous (central and peripheral), lymphatic system, 

20 pelvic, sldn, soft tissue, spleen, thoracic, and urogenital. 

Similarly, other hyperproliferative disorders can also be treated or detected by a 
polynucleotide or polypeptide of the present invention. Examples of such 
hyperproliferative disorders include, but are not limited to: hyperganunaglobulinemia, 
lymphoproliferative disorders, paraproteinemias, purpura, sarcoidosis, Sezary 

25 Syndrome, Waldenstron's Macroglobulinemia, Gaucher* s Disease, histiocytosis, and 
any other hyperproliferative disease, besides neoplasia, located in an organ system 
listed above. 

Infectious Disease 

30 A polypeptide or polynucleotide of the present invention can be used to treat or 

detect infectious agents. For example, by increasing the immune response, particularly 
increasing the proliferation and differentiation of B and/or T cells, infectious diseases 
may be treated. The immune response may be increased by either enhancing an existing 
immune response, or by initiating a new immune response. Alternatively, the 

35 polypeptide or polynucleotide of the present invention may also directly inhibit the 
infectious agent, without necessarily eliciting an immune response. 
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Viruses are one example of an infectious agent that can cause disease or 
symptoms that can be treated or detected by a polynucleotide or polypeptide of the 
present invention. Examples of viruses, include, but are not limited to the following 
DNA and RNA viral families: Arbovirus. Adenoviridae, Arenaviridae, Arterivirus, 

5 Bimaviridae, Bunyaviridae, Caliciviridae, Circoviridae, Coronaviridae, Flaviviridae, 
Hepadnaviridae (Hepatitis), Herpesviridae (such as. Cytomegalovirus, Herpes 
Simplex, Herpes Zoster), Mononegavirus (e.g., Paramyxoviridae, Morbillivirus, 
Rhabdoviridae), Orthomyxoviridae (e.g.. Influenza), Papovaviridae. Parvoviridae, 
Picomaviridae, Poxviridae (such as Smallpox or Vaccinia), Reoviridae (e.g., 

10 Rotavirus), Retroviridae (HTLV-I, HTLV-II, Lentivirus), and Togaviridae (e.g., 
Rubivirus). Viruses falling within these families can cause a variety of diseases or 
symptoms, including, but not limited to: arthritis, bronchiollitis, encephalitis, eye 
infections (e.g., conjunctivitis, keratitis), chronic fatigue syndrome, hepatitis (A, B, C, 
E, Chronic Active, Delta), meningitis, opportunistic infections (e.g., AIDS), 

15 pneumonia, Burkitt's Lymphoma, chickenpox , hemorrhagic fever. Measles, Mumps, 
Parainfluenza, Rabies, the conmion cold. Polio, leukemia, Rubella, sexually 
transmitted diseases, skin diseases (e.g., Kaposi's, warts), and viremia. A polypeptide 
or polynucleotide of the present invention can be used to treat or detect any of these 
symptoms or diseases. 

20 Similarly, bacterial or fungal agents that can cause disease or symptoms and that 

can be treated or detected by a polynucleotide or polypeptide of the present invention 
include, but not limited to, the following Gram-Negative and Gram-positive bacterial 
families and fungi: Actinomycetales (e.g., Corynebacterium, Mycobacterium, 
Norcardia), Aspergillosis, Bacillaceae (e.g.. Anthrax, Clostridium), Bacteroidaceae, 

25 Blastomycosis, Bordetella, Borrelia, Brucellosis, Candidiasis, Campylobacter, 

Coccidioidomycosis, Cryptococcosis, Dermatocycoses, Enterobacteriaceae (Klebsiella, 
Salmonella, Serratia, Yersinia), Erysipelothrix, Helicobacter, Legionellosis, 
Leptospirosis, Listeria, Mycoplasmatales, Neisseriaceae (e.g., Acinetobacter, 
Gonorriiea, Menigococcal), Pasteurellacea Infections (e.g., Actinobacillus, 

30 Heamophilus, Pasteurella), Pseudomonas, Rickettsiaceae, Chlamydiaceae, Syphilis, 
and Staphylococcal. These bacterial or fungal families can cause the following diseases 
or symptoms, including, but not limited to: bacteremia, endocarditis, eye infections 
(conjunctivitis, tuberculosis, uveitis), gingivitis, opportunistic infections (e.g., AIDS 
related infections), paronychia, prosthesis-related infections, Reiter's Disease, 

35 respiratory tract infections, such as Whooping Cough or Empyema, sepsis, Lyme 
Disease, Cat-Scratch Disease, Dysentery, Paratyphoid Fever, food poisoning, 
Typhoid, pneumonia. Gonorrhea, meningitis. Chlamydia, Syphilis, Diphtheria, 
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Leprosy, Paratuberculosis, Tuberculosis, Lupus, Botulism, gangrene, tetanus, 
impetigo. Rheumatic Fever, Scarlet Fever, sexually transmitted diseases, skin diseases 
(e.g., cellulitis, dermatocycoses), toxemia, urinary tract infections, wound infections. 
A polypeptide or polynucleotide of the present invention can be used to treat or detect 
5 any of these symptoms or diseases. 

Moreover, parasitic agents causing disease or symptoms that can be treated or 
detected by a polynucleotide or polypeptide of the present invention include, but not 
limited to, the following families: Amebiasis, Babesiosis, Coccidiosis, 
Cryptosporidiosis, Dientamoebiasis, Dourine, Ectoparasitic, Giardiasis, Helminthiasis, 

1 0 Leishmaniasis, Theileriasis. Toxoplasmosis, Trypanosomiasis, and Trichomonas. 

These parasites can cause a variety of diseases or symptoms, including, but not limited 
to: Scabies, Trombiculiasis, eye infections, intestinal disease (e.g., dysentery, 
giardiasis), liver disease, lung disease, opportunistic infections (e.g., AIDS related). 
Malaria, pregnancy complications, and toxoplasmosis. A polypeptide or polynucleotide 

15 of the present invention can be used to treat or detect any of these symptoms or 
diseases. 

Preferably, treatment using a polypeptide or polynucleotide of the present 
invention could either be by administering an effective amount of a polypeptide to the 
patient, or by removing cells from the patient, supplying the cells with a polynucleotide 
20 of the present invention, and reluming the engineered cells to the patient (ex vivo 

therapy). Moreover, the polypeptide or polynucleotide of the present invention can be 
used as an antigen in a vaccine to raise an immune response against infectious disease. 

Re generation 

25 A polynucleotide or polypeptide of the present invention can be used to 

differentiate, proliferate, and attract cells, leading to the regeneration of tissues. (See, 
Science 276:59-87 (1997).) The regeneration of tissues could be used to repair, 
replace, or protect tissue damaged by congenital defects, trauma (wounds, bums, 
incisions, or ulcers), age, disease (e.g. osteoporosis, osteocarthritis, periodontal 

30 disease, liver failure), surgery, including cosmetic plastic surgery, fibrosis, reperfusion 
injury, or systemic cytokine damage. 

Tissues that could be regenerated using the present invention include organs 
(e.g., pancreas, liver, intestine, kidney, skin, endothelium), muscle (smooth, skeletal 
or cardiac), vascular (including vascular endothelium), nervous, hematopoietic, and 

35 skeletal (bone, cartilage, tendon, and ligament) tissue. Preferably, regeneration occurs 
without or decreased scarring. Regeneration also may include angiogenesis. 
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Moreover, a polynucleotide or polypeptide of the present invention may increase 
regeneration of tissues difficult to heal. For example, increased tendon/ligament 
regeneration would quicken recovery time after damage. A polynucleotide or 
polypeptide of the present invention could also be used prophylactically in an effort to 
5 avoid damage. Specific diseases that could be treated include of tendinitis, carpal tunnel 
syndrome, and other tendon or ligament defects. A further example of tissue 
regeneration of non-healing wounds includes pressure ulcers, ulcers associated with 
vascular insufficiency, surgical, and traumatic wounds. 

Similarly, nerve and brain tissue could also be regenerated by using a 

10 polynucleotide or polypeptide of the present invention to proliferate and differentiate 
nerve cells. Diseases that could be treated using this method include central and 
peripheral nervous system diseases, neuropathies, or mechanical and traumatic 
disorders (e.g., spinal cord disorders, head trauma, cerebrovascular disease, and 
stoke). Specifically, diseases associated with peripheral nerve injuries, peripheral 

15 neuropathy (e.g., resulting from chemotherapy or other medical therapies), localized 
neuropathies, and central nervous system diseases (e.g., Alzheimer's disease, 
Parkinson's disease, Huntington's disease, amyotrophic lateral sclerosis, and Shy- 
Drager syndrome), could all be Ureated using the polynucleotide or polypeptide of the 
present invention. 

20 

Chemotaxis 

A polynucleotide or polypeptide of the present invention may have chemotaxis 
activity. A chemoiaxic molecule attracts or mobilizes cells (e.g., monocytes, 
fibroblasts, neutrophils, T-cells, mast cells, eosinophils, epithelial and/or endothelial 

25 cells) to a particular site in the body, such as inflammation, infection, or site of 

hyperproliferation. The mobilized cells can then fight off and/or heal the particular 
trauma or abnormality. 

A polynucleotide or polypeptide of the present invention may increase 
chemotaxic activity of particular cells. These chemotactic molecules can then be used to 

30 treat inflanunation, infection, hyperproliferative disorders, or any inmiune system 

disorder by increasing the number of cells targeted to a particular location in the body. 
For example, chemotaxic molecules can be used to treat wounds and other trauma to 
tissues by attracting immune cells to the injured location. Chemotactic molecules of the 
present invention can also attract fibroblasts, which can be used to treat wounds. 

35 It is also contemplated that a polynucleotide or polypeptide of the present 

invention may inhibit chemotactic activity. These molecules could also be used to treat 
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disorders. Thus, a polynucleotide or polypeptide of the present invention could be used 
as an inhibitor of chemotaxis. 

Binding Activity 

S A polypeptide of the present invention may be used to screen for molecules that 

bind to the polypeptide or for molecules to which the polypeptide binds. The binding 
of the polypeptide and the molecule may activate (agonist), increase, inhibit 
(antagonist), or decrease activity of the polypeptide or the molecule bound. Examples 
of such molecules include antibodies, oligonucleotides, proteins (e.g., receptors),or 

10 small molecules. 

Preferably, the molecule is closely related to the natural ligand of the 
polypeptide, e.g., a fragment of the ligand, or a natural substrate, a ligand, a structural 
or functional mimetic. (See, Coligan et al.. Current Protocols in Immunology 
1 (2):Chapter 5 ( 199 1 ).) Similarly, the molecule can be closely related to the natural 

15 receptor to which the polypeptide binds, or at least, a fragment of the receptor capable 
of being bound by the polypeptide (e.g., active site). In eitiier case, the molecule can 
be rationally designed using known techniques. 

Preferably, the screening for these molecules involves producing appropriate 
cells which express the polypeptide, either as a secreted protein or on the cell 

20 membrane. Preferred cells include cells from manmials, yeast, Drosophila, or £. colL 
Cells expressing the polypeptide (or cell membrane containing the expressed 
polypeptide) are then preferably contacted with a test compound potentially containing 
the molecule to observe binding, stimulation, or inhibition of activity of either the 
polypeptide or the molecule. 

25 The assay may simply test binding of a candidate compound to the polypeptide, 

wherein binding is detected by a label, or in an assay involving competition with a 
labeled competitor. Further, the assay may test whether the candidate compound results 
in a signal generated by binding to the polypeptide. 

Alternatively, the assay can be carried out using cell-free preparations, 

30 polypeptide/molecule affixed to a solid support, chemical libraries, or natural product 
mixtures. The assay may also simply comprise the steps of mixing a candidate 
compound with a solution containing a polypeptide, measuring polypeptide/molecule 
activity or binding, and comparing the polypeptide/molecule activity or binding to a 
standard. 

35 Preferably, an ELISA assay can measure polypeptide level or activity in a 

sample (e.g., biological sample) using a monoclonal or polyclonal antibody. The 
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antibody can measure polypeptide level or activity by either binding, directly or 
indirectly, to the polypeptide or by competing with the polypeptide for a substrate. 

All of these above assays can be used as diagnostic or prognostic markers. The 
molecules discovered using these assays can be used to treat disease or to bring about a 
5 particular result in a patient (e.g., blood vessel growth) by activating or inhibiting the 
polypeptide/molecule. Moreover, the assays can discover agents which may inhibit or 
enhance the production of the polypeptide from suitably manipulated cells or tissues. 

Therefore, the invention includes a method of identifying compounds which 
bind to a polypeptide of the invention comprising the steps of: (a) incubating a 
10 candidate binding compound with a polypeptide of the invention; and (b) determining if 
binding has occurred. Moreover, the invention includes a method of identifying 
agonists/antagonists comprising the steps of: (a) incubating a candidate compound with 
a polypeptide of the invention, (b) assaying a biological activity , and (b) determining if 
a biological activity of the polypeptide has been altered. 

15 

Other Activities 

A polypeptide or polynucleotide of the present invention may also increase or 
decrease the differentiation or proliferation of embryonic stem cells* besides, as 
discussed above, hematopoietic lineage. 

20 A polypeptide or polynucleotide of the present invention may also be used to 

modulate mammalian characteristics, such as body height, weight, hair color, eye color, 
skin, percentage of adipose tissue, pigmentation, size, and shape (e.g., cosmetic 
surgery). Similarly, a polypeptide or polynucleotide of the present invention may be 
used to modulate mammalian metabolism affecting catabolism, anabolism, processing, 

25 utilization, and storage of energy. 

A polypeptide or polynucleotide of the present invention may be used to change 
a mammal's mental state or physical state by influencing biorhythms, caricadic 
rhythms, depression (including depressive disorders), tendency for violence, tolerance 
for pain, reproductive capabilities (preferably by Activin or Inhibin-like activity), 

30 hormonal or endocrine levels, appetite, libido, memory, stress, or other cognitive 
qualities. 

A polypeptide or polynucleotide of the present invention may also be used as a 
food additive or preservative, such as to increase or decrease storage capabilities, fat 
content, lipid, protein, carbohydrate, vitamins, minerals, cofaciors or other nutritional 
35 components. 
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Other Preferred Embodiments 

Other preferred embodiments of the claimed invention include an isolated 
nucleic acid molecule comprising a nucleotide sequence which is at least 95% identical 
to a sequence of at least about 50 contiguous nucleotides in the nucleotide sequence of 
5 SEQ ED NO:X wherein X is any integer as defined in Table 1. 

Also preferred is a nucleic acid molecule wherein said sequence of contiguous 
nucleotides is included in the nucleotide sequence of SEQ ID NO:X in the range of 
positions beginning with the nucleotide at about the position of the S* Nucleotide of the 
Clone Sequence and ending with the nucleotide at about the position of the 3' 
10 Nucleotide of the Clone Sequence as defined for SEQ ID NO:X in Table 1 . 

Also preferred is a nucleic acid molecule wherein said sequence of contiguous 
nucleotides is included in the nucleotide sequence of SEQ ID NO:X in the range of 
positions beginning with the nucleotide at about the position of the 5' Nucleotide of the 
Start Codon and ending with the nucleotide at about the position of the 3' Nucleotide of 
1 5 the Clone Sequence as defined for SEQ ID NO:X in Table 1 . 

Similarly preferred is a nucleic acid molecule wherein said sequence of 
contiguous nucleotides is included in the nucleotide sequence of SEQ ID NO:X in the 
range of positions beginning with the nucleotide at about the position of the S' 
Nucleotide of the First Amino Acid of the Signal Peptide and ending with the nucleotide 
20 at about the position of the 3' Nucleotide of the Clone Sequence as defined for SEQ ID 
NO:X in Table 1. 

Also preferred is an isolated nucleic acid molecule comprising a nucleotide 

sequence which is at least 95% identical to a sequence of at least about 150 contiguous 

nucleotides in the nucleotide sequence of SEQ ID NO:X. 
25 Further preferred is an isolated nucleic acid molecule comprising a nucleotide 

sequence which is at least 95% identical to a sequence of at least about 500 contiguous 

nucleotides in the nucleotide sequence of SEQ ID NO:X. 

A further preferred embodiment is a nucleic acid molecule comprising a 

nucleotide sequence which is at least 95% identical to the nucleotide sequence of SEQ 
30 ID NO:X beginning with the nucleotide at about the position of the 5' Nucleotide of the 

First Amino Acid of the Signal Peptide and ending with the nucleotide at about the 

position of the 3' Nucleotide of the Clone Sequence as defined for SEQ ID NO:X in 

Table 1. 

A further preferred embodiment is an isolated nucleic acid molecule comprising 
35 a nucleotide sequence which is at least 95% identical to the complete nucleotide 
sequence of SEQ ID NO:X. 
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Also preferred is an isolated nucleic acid molecule which hybridizes under 
stringent hybridization conditions to a nucleic acid molecule, wherein said nucleic acid 
molecule which hybridizes does not hybridize under stringent hybridization conditions 
to a nucleic acid molecule having a nucleotide sequence consisting of only A residues or 
5 of only T residues. 

Also preferred is a composition of matter comprising a DN A molecule which 
comprises a human cDNA clone identified by a cDNA Clone Identifier in Table 1, 
which DNA molecule is contained in the material deposited with the American Type 
Culture Collection and given the ATCC Deposit Number shown in Table 1 for said 
10 cDNA Clone Identifier. 

Also preferred is an isolated nucleic acid molecule comprising a nucleotide 
sequence which is at least 95% identical to a sequence of at least 50 contiguous 
nucleotides in the nucleotide sequence of a human cDNA clone identified by a cDNA 
Clone Identifier in Table 1 , which DNA molecule is contained in the deposit given the 
1 5 ATCC Deposit Number shown in Table 1 . 

Also preferred is an isolated nucleic acid molecule, wherein said sequence of at 
least 50 contiguous nucleotides is included in the nucleotide sequence of the complete 
open reading frame sequence encoded by said human cDNA clone. 

Also preferred is an isolated nucleic acid molecule comprising a nucleotide 
20 sequence which is at least 95% identical to sequence of at least 150 contiguous 
nucleoddes in the nucleotide sequence encoded by said human cDNA clone. 

A further preferred embodiment is an isolated nucleic acid molecule comprising 
a nucleotide sequence which is at least 95% identical to sequence of at least 500 
contiguous nucleotides in the nucleotide sequence encoded by said human cDNA clone. 
25 A further preferred embodiment is an isolated nucleic acid molecule comprising 

a nucleotide sequence which is at least 95% identical to the complete nucleotide 
sequence encoded by said human cDNA clone. 

A further preferred embodiment is a method for detecting in a biological sample 
a nucleic acid molecule comprising a nucleotide sequence which is at least 95% identical 
30 to a sequence of at least 50 contiguous nucleotides in a sequence selected from the 

group consisting of: a nucleotide sequence of SEQ ID NO:X wherein X is any integer 
as defined in Table 1 ; and a nucleotide sequence encoded by a human cDN A clone 
identified by a cDN A Clone Identifier in Table 1 and contained in the deposit with the 
ATCC Deposit Number shown for said cDN A clone in Table 1 ; which method 
35 comprises a step of comparing a nucleotide sequence of at least one nucleic acid 
molecule in said sample with a sequence selected from said group and determining 
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whether the sequence of said nucleic acid molecule in said sample is at least 95% 
identical to said selected sequence. 

Also preferred is the above method wherein said step of comparing sequences 
comprises determining the extent of nucleic acid hybridization between nucleic acid 

5 molecules in said sample and a nucleic acid molecule comprising said sequence selected 
from said group. Similarly, also preferred is the above method wherein said step of 
comparing sequences is performed by comparing the nucleotide sequence determined 
from a nucleic acid molecule in said sample with said sequence selected from said 
group. The nucleic acid molecules can comprise DNA molecules or RNA molecules. 

10 A further preferred embodiment is a method for identifying the species, tissue or 

cell type of a biological sample which method comprises a step of detecting nucleic acid 
molecules in said sample, if any, comprising a nucleotide sequence that is at least 95% 
identical to a sequence of at least 50 contiguous nucleotides in a sequence selected from 
the group consisting of: a nucleotide sequence of SEQ ID NO:X wherein X is any 

15 integer as defined in Table 1 ; and a nucleotide sequence encoded by a human cDNA 

clone identified by a cDNA Clone Identifier in Table 1 and contained in the deposit with 
the ATCC Deposit Number shown for said cDNA clone in Table 1. 

The method for identifying the species, tissue or cell type of a biological sample 
can comprise a step of detecting nucleic acid molecules comprising a nucleotide 

20 sequence in a panel of at least two nucleotide sequences, wherein at least one sequence 
in said panel is at least 95% identical to a sequence of at least 50 contiguous nucleotides 
in a sequence selected from said group. 

Also preferred is a method for diagnosing in a subject a pathological condition 
associated with abnormal structure or expression of a gene encoding a secreted protein 

25 identified in Table 1 , which method comprises a step of detecting in a biological sample 
obtained from said subject nucleic acid molecules, if any, comprising a nucleotide 
sequence that is at least 95% identical to a sequence of at least 50 contiguous 
nucleotides in a sequence selected from the group consisting of: a nucleotide sequence 
of SEQ ID NO:X wherein X is any integer as defined in Table 1 ; and a nucleotide 

30 sequence encoded by a human cDNA clone identified by a cDNA Clone Identifier in 
Table I and contained in the deposit with the ATCC Deposit Number shown for said 
cDNA clone in Table I. 

The method for diagnosing a pathological condition can comprise a step of 
delecting nucleic acid molecules comprising a nucleotide sequence in a panel of at least 

35 two nucleotide sequences, wherein at least one sequence in said panel is at least 95% 
identical to a sequence of at least 50 contiguous nucleotides in a sequence selected from 
said group. 
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Also preferred is a composition of matter comprising isolated nucleic acid 
molecules wherein the nucleotide sequences of said nucleic acid molecules comprise a 
panel of at least two nucleotide sequences, wherein at least one sequence in said panel is 
at least 95% identical to a sequence of at least 50 contiguous nucleotides in a sequence 
selected from the group consisting of: a nucleotide sequence of SEQ ID NO:X wherein 
X is any integer as defined in Table 1; and a nucleotide sequence encoded by a human 
cDN A clone identified by a cDNA Clone Identifier in Table 1 and contained in the 
deposit with the ATCC Deposit Number shown for said cDNA clone in Table 1 . The 
nucleic acid molecules can comprise DN A molecules or RNA molecules. 

Also preferred is an isolated polypeptide comprising an amino acid sequence at 
least 90% identical to a sequence of at least about 10 contiguous amino acids in the 
amino acid sequence of SEQ ID NO:Y wherein Y is any integer as defined in Table 1. 

Also preferred is a polypeptide, wherein said sequence of contiguous amino 
acids is included in the amino acid sequence of SEQ ID NO: Y in the range of positions 
beginning with the residue at about the position of the First Amino Acid of the Secreted 
Portion and ending with the residue at about the Last Amino Acid of the Open Reading 
Frame as set forth for SEQ ID NO: Y in Table 1 . 

Also preferred is an isolated polypeptide comprising an amino acid sequence at 
least 95% identical to a sequence of at least about 30 contiguous amino acids in the 
amino acid sequence of SEQ ID NO: Y. 

Further preferred is an isolated polypeptide comprising an amino acid sequence 
at least 95% identical to a sequence of at least about 100 contiguous amino acids in the 
amino acid sequence of SEQ ID NO:Y. 

Further preferred is an isolated polypeptide comprising an amino acid sequence 
at least 95% identical to the complete amino acid sequence of SEQ ID NO:Y. 

Further preferred is an isolated polypeptide comprising an amino acid sequence 
at least 90% identical to a sequence of at least about 10 contiguous amino acids in the 
complete amino acid sequence of a secreted protein encoded by a human cDNA clone 
identified by a cDNA Clone Identifier in Table 1 and contained in the deposit with the 
ATCC Deposit Number shown for said cDN A clone in Table 1 . 

Also preferred is a polypeptide wherein said sequence of contiguous amino 
acids is included in the amino acid sequence of a secreted portion of the secreted protein 
encoded by a human cDNA clone identified by a cDNA Clone Identifier in Table 1 and 
contained in the deposit witii tiie ATCC Deposit Number shown for said cDN A clone in 
Table 1. 

Also preferred is an isolated polypeptide comprising an amino acid sequence at 
least 95% identical to a sequence of at least about 30 contiguous amino acids in the 
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amino acid sequence of the secreted portion of the protein encoded by a human cDNA 
clone identified by a cDNA Clone Identifier in Table 1 and contained in the deposit with 
the ATCC Deposit Number shown for said cDN A clone in Table 1 . 

Also preferred is an isolated polypeptide comprising an amino acid sequence at 
5 least 95% identical to a sequence of at least about 100 contiguous amino acids in the 
amino acid sequence of the secreted portion of the protein encoded by a human cDNA 
clone identified by a cDNA Clone Identifier in Table 1 and contained in the deposit with 
the ATCC Deposit Number shown for said cDNA clone in Table 1 . 

Also preferred is an isolated polypeptide comprising an amino acid sequence at 
10 least 95% identical to the amino acid sequence of the secreted portion of the protein 
encoded by a human cDNA clone identified by a cDNA Clone Identifier in Table 1 and 
contained in the deposit with the ATCC Deposit Number shown for said cDNA clone in 
Table 1. 

Further preferred is an isolated antibody which binds specifically to a 

1 5 polypeptide comprising an amino acid sequence that is at least 90% identical to a 

sequence of at least 10 contiguous amino acids in a sequence selected from the group 
consisting of: an amino acid sequence of SEQ ID NO: Y wherein Y is any integer as 
defined in Table 1 ; and a complete amino acid sequence of a protein encoded by a 
human cDNA clone identified by a cDN A Clone Identifier in Table 1 and contained in 

20 the deposit with the ATCC Deposit Number shown for said cDNA clone in Table 1 . 

Further preferred is a method for detecting in a biological sample a polypeptide 
comprising an amino acid sequence which is at least 90% identical to a sequence of at 
least 10 contiguous amino acids in a sequence selected from the group consisting of: an 
amino acid sequence of SEQ ID NO: Y wherein Y is any integer as defined in Table 1 ; 

25 and a complete amino acid sequence of a protein encoded by a human cDN A clone 
identified by a cDNA Clone Identifier in Table 1 and contained in the deposit with the 
ATCC Deposit Number shown for said cDN A clone in Table 1 ; which method 
comprises a step of comparing an amino acid sequence of at least one polypeptide 
molecule in said sample with a sequence selected from said group and detemndning 

30 whether the sequence of said polypeptide molecule in said sample is at least 90% 
identical to said sequence of at least 10 contiguous amino acids. 

Also preferred is the above method wherein said step of comparing an amino 
acid sequence of at least one polypeptide molecule in said sample with a sequence 
selected from said group comprises determining the extent of specific binding of 

35 polypeptides in said sample to an antibody which binds specifically to a polypeptide 

comprising an amino acid sequence that is at least 90% identical to a sequence of at least 
10 contiguous amino acids in a sequence selected from the group consisting of: an 
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amino acid sequence of SEQ ID NO:Y wherein Y is any integer as defined in Table 1; 
and a complete amino acid sequence of a protein encoded by a human cDNA clone 
identified by a cDNA Clone Identifier in Table I and contained in the deposit with the 
ATCC Deposit Number shown for said cDNA clone in Table 1 . 

5 Also preferred is the above method wherein said step of comparing sequences is 

performed by comparing the amino acid sequence determined from a polypeptide 
molecule in said sample with said sequence selected from said group. 

Also preferred is a method for identifying the species, tissue or cell type of a 
biological sample which method comprises a step of detecting polypeptide molecules in 

10 said sample, if any, comprising an amino acid sequence that is at least 90% identical to 
a sequence of at least 10 contiguous amino acids in a sequence selected from the group 
consisting of: an amino acid sequence of SEQ ID NO: Y wherein Y is any integer as 
defined in Table 1; and a complete amino acid sequence of a secreted protein encoded 
by a human cDNA clone identified by a cDNA Clone Identifier in Table 1 and contained 

15 in the deposit with the ATCC Deposit Number shown for said cDNA clone in Table 1 . 

Also preferred is the above method for identifying the species, tissue or cell type 
of a biological sample, which method comprises a step of detecting polypeptide 
molecules comprising an amino acid sequence in a panel of at least two amino acid 
sequences, wherein at least one sequence in said panel is at least 90% identical to a 

20 sequence of at least 10 contiguous amino acids in a sequence selected from the above 
group. 

Also preferred is a method for diagnosing in a subject a pathological condition 
associated with abnormal stmcture or expression of a gene encoding a secreted protein 
identified in Table 1 , which method comprises a step of detecting in a biological sample 

25 obtained from said subject polypeptide molecules comprising an amino acid sequence in 
a panel of at least two amino acid sequences, wherein at least one sequence in said panel 
is at least 90% identical to a sequence of at least 10 contiguous amino acids in a 
sequence selected from the group consisting of: an amino acid sequence of SEQ ID 
NO:Y wherein Y is any integer as defined in Table 1; and a complete amino acid 

30 sequence of a secreted protein encoded by a human cDNA clone identified by a cDNA 
Clone Identifier in Table 1 and contained in the deposit with the ATCC Deposit Number 
shown for said cDNA clone in Table 1 . 

In any of these methods, the step of detecting said polypeptide molecules 
includes using an antibody. 

35 Also preferred is an isolated nucleic acid molecule comprising a nucleotide 

sequence which is at least 95% identical to a nucleotide sequence encoding a 
polypeptide wherein said polypeptide comprises an amino acid sequence that is at least 
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90% identical to a sequence of at least 10 contiguous amino acids in a sequence selected 
from the group consisting of: an amino acid sequence of SEQ ID NO:Y wherein Y is 
any integer as defined in Table 1 ; and a complete amino acid sequence of a secreted 
protein encoded by a human cDNA clone identified by a cDNA Clone Identifier in Table 
5 1 and contained in the deposit with the ATCC Deposit Number shown for said cDNA 
clone in Table 1. 

Also preferred is an isolated nucleic acid molecule, wherein said nucleotide 
sequence encoding a polypeptide has been optimized for expression of said polypeptide 
in a prokaryotic host. 

10 Also preferred is an isolated nucleic acid molecule, wherein said polypeptide 

comprises an amino acid sequence selected from the group consisting of: an amino acid 
sequence of SEQ ID NO:Y wherein Y is any integer as defined in Table 1; and a 
complete amino acid sequence of a secreted protein encoded by a human cDNA clone 
identified by a cDN A Clone Identifier in Table 1 and contained in the deposit with the 

1 5 ATCC Deposit Number shown for said cDN A clone in Table 1 . 

Further preferred is a method of making a recombinant vector comprising 
inserting any of the above isolated nucleic acid molecule into a vector. Also preferred is 
the recombinant vector produced by this method. Also preferred is a method of making 
a recombinant host cell comprising mtroducing the vector into a host cell, as well as the 

20 recombinant host cell produced by this method. 

Also preferred is a method of making an isolated polypeptide comprising 
culturing this recombinant host cell under conditions such that said polypeptide is 
expressed and recovering said polypeptide. Also preferred is this method of making an 
isolated polypeptide, wherein said recombinant host cell is a eukaryotic cell and said 

25 polypeptide is a secreted portion of a human secreted protein comprising an amino acid 
sequence selected from the group consisting of: an amino acid sequence of SEQ ID 
NO: Y beginning with the residue at the position of the First Amino Acid of the Secreted 
Portion of SEQ ID NO: Y wherein Y is an integer set forth in Table 1 and said position 
of the First Amino Acid of the Secreted Portion of SEQ ID NO: Y is defined in Table 1 ; 

30 and an amino acid sequence of a secreted portion of a protein encoded by a human 
cDN A clone identified by a cDN A Clone Identifier in Table 1 and contained in the 
deposit with the ATCC Deposit Number shown for said cDN A clone in Table 1 . The 
isolated polypeptide produced by this method is also preferred. 

Also preferred is a method of treatment of an individual in need of an increased 

35 level of a secreted protein activity, which method comprises administering to such an 
individual a pharmaceutical composition comprising an amount of an isolated 
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polypeptide, polynucleotide, or antibody of the claimed invention effective to increase 
the level of said protein activity in said uidividual. 

Having generally described the invention, the same will be more readily 
understood by reference to the following examples, which are provided by way of 
5 illustration and are not intended as limiting. 

Examples 

Fvam ple 1: Isolation of a Selec ted cDNA Clnne From the Deposited 
Sample 

10 Each cDN A clone in a cited ATCC deposit is contained in a plasmid vector. 

Table 1 identifies the vectors used to construct the cDN A library from which each clone 
was isolated. In many cases, the vector used to construct the library is a phage vector 
from which a plasmid has been excised. The table unmediately below correlates the 
related plasmid for each phage vector used in constmcting the cDNA library. For 

15 example, where a particular clone is identified in Table 1 as being isolated in the vector 
"Lambda Zap," the corresponding deposited clone is in "pBluescript." 



Vectors Lambda Zap (U.S. Patent Nos. 5,128,256 and 5,286,636), Uni-Zap 
XR (U.S. Patent Nos. 5,128, 256 and 5,286,636), Zap Express (U.S. Patent Nos. 
5,128,256 and 5,286,636), pBluescript (pBS) (Short, J. M. et al.. Nucleic Acids Res. 
16:7583-7600 (1988); Alting-Mees, M. A. and Short, J. M., Nucleic Acids Res. 

30 17:9494 ( 1989)) and pBK (Alting-Mees, M. A. et al.. Strategies 5:58-61 (1992)) are 
commercially available from Stratagene Cloning Systems, Inc., 1 101 1 N. Torrey Pines 
Road, La JoUa, CA, 92037. pBS contains an ampicillin resistance gene and pBK 
contains a neomycin resistance gene. Both can be transformed into E. coli strain XL-1 
Blue, also available from Stratagene. pBS comes in 4 forms SK+, SK-, KS+ and KS. 

35 The S and K refers to the orientation of the polylinker to the T7 and T3 primer 

sequences which flank the polylinker region ("S" is for Sad and "K" is for Kpnl which 
are the first sites on each respective end of the linker). or refer to the orientation 



20 



25 



Vector Used to Construct Library 

Lambda Zap 

Uni-Zap XR 

Zap Express 

lafmidBA 

pSportl 

pCMVSport2.0 
pCMVSporl 3.0 
pCR®2.1 



Correspondinp Deposit ed Plasmid 
pBluescript (pBS) 
pBluescript (pBS) 
pBK 

plafmid B A 
pSportl 

pCMVSport2.0 
pCMVSport 3.0 
pCR®2.1 
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of the f 1 origin of replication ("on"), such that in one orientation, single stranded rescue 
initiated from the f 1 ori generates sense strand DNA and in the other, antisense. 

Vectors pSportl, pCMVSport 2.0 and pCMVSport 3.0, were obtained from 
Life Technologies, Inc., P. O. Box 6009, Gaithersburg, MD 20897. All Sport vectors 
5 contain an ampicillin resistance gene and may be transformed into E. coli strain 

DH10B« also available from Life Technologies. (See, for instance, Gruber, C. E., et 
al.. Focus 15:59 (1993).) Vector lafmid BA (Bcnto Soares, Columbia University, NY) 
contains an ampicillin resistance gene and can be uransformed into E. coli strain XL-1 
Blue. Vector pCR®2.1, which is available from Invitrogen, 1600 Faraday Avenue, 

10 Carlsbad, CA 92008, contains an ampicillin resistance gene and may be transformed 
into E. coli strain DHIOB, available from Life Technologies. (See, for instance, Clark, 
J. M., Nuc. Acids Res. 16:9677-9686 (1988) and Mead, D. et al., Bio/Technology 9: 
(1991).) Preferably, a polynucleotide of the present invention does not comprise the 
phage vector sequences identified for the particular clone in Table 1, as well as the 

1 5 corresponding plasmid vector sequences designated above. 

The deposited material in the sample assigned the ATCC Deposit Number cited 
in Table 1 for any given cDN A clone also may contain one or more additional plasmids, 
each comprising a cDNA clone different from that given clone. Thus, deposits sharing 
the same ATCC £>eposit Number contain at least a plasmid for each cDNA clone 

20 identified in Table 1 . Typically, each ATCC deposit sample cited in Table 1 comprises 
a mixture of approximately equal amounts (by weight) of about 50 plasmid DNAs, each 
containing a different cDNA clone; but such a deposit sample may include plasmids for 
more or less than 50 cDN A clones, up to about 500 cDNA clones. 

Two approaches can be used to isolate a particular clone from the deposited 

25 sample of plasmid DNAs cited for that clone in Table 1 . First, a plasmid is directiy 

isolated by screening the clones using a polynucleotide probe corresponding to SEQ ID 
NO:X. 

Particularly, a specific polynucleotide with 30-40 nucleotides is synthesized 
using an Applied Biosystems DNA synthesizer according to the sequence reported. 

30 The oligonucleotide is labeled, for instance, with ^^P-y-ATP using T4 polynucleotide 

icinase and purified according to routine methods. (E.g.. Maniatis et al., Molecular 
Cloning: A Laboratory Manual, Cold Spring Harbor Press, Cold Spring, NY (1982).) 
The plasmid mixture is transformed into a suitable host, as indicated above (such as 
XL-1 Blue (Stratagene)) using techniques known to those of skill in the art, such as 
35 those provided by the vector supplier or in related publications or patents cited above. 
The transformants are plated on 1 .5% agar plates (containing the appropriate selection 
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agent, e.g., ampiciliin) to a density of about 150 transformants (colonies) per plate. 
These plates are screened using Nylon membranes according to routine methods for 
bacterial colony screening (e.g^ Sambrook et al., Molecular Cloning: A Laboratory 
Manual, 2nd Edit., (1989), Cold Spring Harbor Laboratory Press, pages 1.93 to 

5 1 .104), or other techniques known to those of skill in the art. 

Alternatively, two primers of 17-20 nucleotides derived from both ends of the 
SEQ ID NO:X (i.e., within the region of SEQ ID NO:X bounded by the 5' NT and the 
3' NT of the clone defined in Table 1) are synthesized and used to amplify the desired 
cDN A using the deposited cDNA plasmid as a template. The polymerase chain reaction 

10 is carried out under routine conditions, for instance, in 25 |il of reaction mixture with 
0.5 ug of the above cDNA template. A convenient reaction mixture is 1.5-5 mM 
MgClj, 0.01% (w/v) gelatin, 20 flM each of dATP, dCTP, dGTP, dTTP, 25 pmol of 
each primer and 0.25 Unit of Taq polymerase. Thirty five cycles of PCR (denaturation 

at 94°C for 1 min; annealing at 55X for 1 min; elongation at 72°C for 1 min) are 

15 performed with a Perkin-Elmer Cetus automated thermal cycler. The amplified product 
is analyzed by agarose gel electrophoresis and the DN A band with expected molecular 
weight is excised and purified. The PCR product is verified to be the selected sequence 
by subcloning and sequencing the DNA product. 

Several methods are available for the identification of the 5' or 3' non-coding 

20 portions of a gene which may not be present in the deposited clone. These methods 
include but are not limited to, filter probing, clone enrichment using specific probes, 
and protocols similar or identical to 5' and 3' "RACE" protocols which are well known 
in the art. For instance, a method similar to 5' RACE is available for generating the 
missing 5' end of a desired full-length transcript. (Fromont-Racine et al.. Nucleic Acids 

25 Res. 21(7): 1683-1684 (1993).) 

Briefly, a specific RNA oligonucleotide is ligated to the 5' ends of a population 
of RNA presumably containing fiiU-length gene RNA u-anscripts. A primer set 
containing a primer specific to the ligated RNA oligonucleotide and a primer specific to 
a known sequence of the gene of interest is used to PCR amplify the 5' portion of the 

30 desired full-length gene. This amplified product may then be sequenced and used to 
generate the full length gene. 

This above method starts with total RNA isolated from the desired source, 
although poly-A+ RNA can be used. The RNA preparation can then be treated with 
phosphatase if necessary to eliminate 5' phosphate groups on degraded or damaged 

35 RNA which may interfere with the later RNA ligase step. The phosphatase should then 
be inactivated and the RNA treated with tobacco acid pyrophosphatase in order to 
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remove the cap structure present at the 5* ends of messenger RNAs. This reaction 
leaves a 5' phosphate group at the 5* end of the cap cleaved RNA which can then be 
ligated to an RNA oligonucleotide using T4 RNA ligase. 

This modified RNA preparation is used as a template for first strand cDNA 
5 synthesis using a gene specific oligonucleotide. The first strand synthesis reaction is 
used as a template for PCR amplification of the desired 5' end using a primer specific to 
the ligated RNA oligonucleotide and a primer specific to the known sequence of the 
gene of interest. The resultant product is then sequenced and analyzed to confirm that 
the 5' end sequence belongs to the desired gene. 

10 

Example 2: Isolation of Genomic Clones Correspo nding to a 
Polynucleotide 

A human genomic PI library (Genomic Systems, Inc.) is screened by PCR 
using primers selected for the cDNA sequence corresponding to SEQ ID NO:X., 
15 according to the method described in Example 1. (See also, Sambrook.) 

Example 3: Tissue Distribution of Polvpcptide 

Tissue distribution of mRNA expression of polynucleotides of the present 
invention is determined using protocols for Northern blot analysis, described by, 

20 among others, Sambrook et al. For example, a cDNA probe produced by the method 
described in Example 1 is labeled with P^^ using the rediprime^^ DNA labeling system 
(Amersham Life Science), according to manufacturer's instructions. After labeling, the 
probe is purified using CHROMA SPIN-lOO^^^ column (Clontech Laboratories, Inc.), 
according to manufacturer's protocol number PT12(X)- 1 . The purified labeled probe is 

25 then used to examine various human tissues for mRNA expression. 

Multiple Tissue Northern (MTN) blots containing various human tissues (H) or 
human immune system tissues (IM) (Clontech) are examined with the labeled probe 
using ExpressHyb™ hybridization solution (Clontech) according to manufacturer's 
protocol number PTl 190-1 . Following hybridization and washing, the blots are 

30 mounted and exposed to film at -70**C overnight, and the films developed according to 

standard procedures. 

Example 4; Chromosomal Mapping of the Polynucleotides 

An oligonucleotide primer set is designed according to the sequence at the 5' 
35 end of SEQ ID NO:X. This primer preferably spans about 100 nucleotides. This 
primer set is then used in a polymerase chain reaction under the following set of 
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conditions : 30 seconds. 95°C; 1 minute, 56°C; 1 minute, 70«C. This cycle is repeated 
32 times followed by one 5 minute cycle at 70°C. Human, mouse, and hamster DNA 
is used as template in addition to a somatic cell hybrid panel containing individual 
chromosomes or chromosome fragments (Bios, Inc). The reactions is analyzed on 
5 either 8% polyacrylamide gels or 3.5 % agarose gels. Chromosome mapping is 

determined by the presence of an approximately 100 bp PGR fragment in the particular 
somatic cell hybrid. 

Fvam ple 5: Bacterial ExDr« »ssi9" «f a PolvPepUde 
10 A polynucleotide encoding a polypeptide of the present invention is amplified 

using PGR oligonucleotide primers corresponding to the 5' and 3' ends of the DNA 
sequence, as outiined in Example 1, to synthesize insertion fragments. The primers 
used to amplify the cDNA insert should preferably contain resuiction sites, such as 
BamHI and Xbal. at the 5' end of the primers in order to clone the amplified product 
1 5 into the expression vector. For example, BamHI and Xbal correspond to the restriction 
enzyme sites on the bacterial expression vector pQE-9. (Qiagen, Inc., Ghatsworth, 
CA). This plasmid vector encodes antibiotic resistance (AmpO. a bacterial origin of 
replication (ori), an IPTG-regulatable promoter/operator (P/O). a ribosome binding site 
(RBS), a 6-histidine tag (6-His), and restriction enzyme cloning sites. 
20 The pQE-9 vector is digested with BamHI and Xbal and the amplified fragment 

is Ugated into the pQE-9 vector maintaining the reading frame initiated at tiie bacterial 
RBS. The ligation mixture is then used to transform the E. coli strain M15/rep4 
(Qiagen, Inc.) which contains multiple copies of the plasmid pREP4, which expresses 
the laci repressor and also confers kanamycin resistance (KanO- Transformants are 
25 identified by their ability to grow on LB plates and ampicillin/kanamycin resistant 

colonies are selected. Plasmid DNA is isolated and confirmed by restriction analysis. 

Clones containing the desired constructs are grown overnight (O/N) in liquid 
culture in LB media supplemented with both Amp (100 ug/ml) and Kan (25 ug/ml). 
The O/N culture is used to inoculate a large culture at a ratio of 1:100 to 1:250. The 
30 cells are grown to an optical density 600 (O.D.*°<*) of between 0.4 and 0.6. IPTG 

(Isopropyl-B-D-thiogalacto pyranoside) is then added to a final concentration of 1 mM. 
IPTG induces by inactivating tiie lacI repressor, clearing tiie P/O leading to increased 
gene expression. 

Cells are grown for an extra 3 to 4 hours. Cells are then harvested by 
35 centrifugation (20 mins at 6000Xg). The cell pellet is solubilized in the chaotropic 
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agent 6 Molar Guanidine HCl by stirring for 3-4 hours at 4*^0. The cell debris is 

removed by centrifiigation, and the supernatant containing the polypeptide is loaded 
onto a nickel-nitrilo-tri-acetic acid ("Ni-NTA") affinity resin colunm (available from 
QIAGEN, Inc., supra). Proteins with a 6 x His tag bind to the Ni-NTA resin with high 
5 affinity and can be purified in a simple one-step procedure (for details see: The 
QIAexpressionist (1995) QIAGEN, Inc., supra). 

Briefly, the supernatant is loaded onto the column in 6 M guanidine-HCl, pH 8, 
the column is first washed with 10 volumes of 6 M guanidine-HCl, pH 8, then washed 
with 10 volumes of 6 M guanidine-HCl pH 6, and fmally the polypeptide is eluted with 

10 6 M guanidine-HCl, pH 5. 

The purified protein is then renatured by dialyzing it against phosphate-buffered 
saline (PBS) or 50 mM Na-acetate, pH 6 buffer plus 200 mM NaCL Alternatively, the 
protein can be successfully refolded while immobilized on the Ni-NTA column. The 
recommended conditions are as follows: renature using a linear 6M-1M urea gradient in 

15 500 mM NaCl, 20% glycerol, 20 mM Tris/HCl pH 7.4, containing protease inhibitors. 
The renaturation should be performed over a period of 1,5 hours or more. After 
renaturation the proteins are eluted by the addition of 250 mM immidazole. Immidazole 
is removed by a final dialyzing step against PBS or 50 mM sodium acetate pH 6 buffer 
plus 200 mM NaCl. The purified protein is stored at 4°C or frozen at -80° C. 

20 In addition to the above expression vector, the present invention further includes 

an expression vector comprising phage operator and promoter elements operatively 
linked to a polynucleotide of the present invention, called pHE4a. (ATCC Accession 
Number XXXXXX.) This vector contains: 1) aneomycinphosphotransferase gene as 
a selection marker, 2) an E. coli origin of replication, 3) a T5 phage promoter sequence, 

25 4) two lac operator sequences, 5) a Shine-Delgamo sequence, and 6) the lactose operon 
repressor gene (laclq). The origin of replication (oriC) is derived from pUC19 (LTI, 
Gaithersburg, MD). The promoter sequence and operator sequences are made 
synthetically. 

DNA can be inserted into the pHEa by restricting the vector with Ndel and 
30 Xbal, BamHI, Xhol, or Asp7 18, running the resuricted product on a gel, and isolating 
the larger fragment (the stuffer fragment should be about 310 base pairs). The DNA 
insert is generated according to the PCR protocol described in Example 1, using PCR 
primers having restriction sites for Ndel (5' primer) and Xbal, BamHI, Xhol, or 
Asp718 (3* primer). The PCR insert is gel purified and restricted with compatible 
35 enzymes. The insert and vector are ligated according to standard protocols. 
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The engineered vector could easily be substituted in the above protocol to 
express protein in a bacterial system. 

Ficam ple 6: Purification of a Polypeptide from an Inclusion Body 

5 The following alternative method can be used to purify a polypeptide expressed 

in E coli when it is present in the form of inclusion bodies. Unless otherwise specified, 

all of the following steps are conducted at 4-10°C. 

Upon completion of the production phase of the E. coli fermentation, the cell 
culture is cooled to 4-10*^0 and the cells harvested by continuous centrifugation at 
10 1 5,000 rpm (Heraeus Sepatech). On the basis of the expected yield of protein per unit 
weight of cell paste and the amount of purified protein required, an appropriate amount 
of cell paste, by weight, is suspended in a buffer solution containing 100 mM Tris, 50 
mM EDTA, pH 7.4. The cells are dispersed to a homogeneous suspension using a 
high shear mixer. 

15 The cells are then lysed by passing the solution through a microfluidizer 

(Microfuidics, Corp. or APV Gaulin, Inc.) twice at 4000-6000 psi. The homogenate is 
then mixed with NaCl solution to a final concentration of 0.5 M NaCl, followed by 
centrifugation at 7000 xg for 15 min. The resultant peUet is washed again using 0.5M 
NaCl, 100 mM Tris, 50 mM EDTA, pH 7.4. 

20 The resulting washed inclusion bodies are solubilized with 1 .5 M guanidine 

hydrochloride (GuHCl) for 2-4 hours. After 7000 xg centrifugation for 15 min., the 
pellet is discarded and the polypeptide containing supernatant is incubated at 4°C 

overnight to allow further GuHCl extraction. 

Following high speed centrifugation (30,000 xg) to remove insoluble particles, 
25 tiie GuHCl solubilized protein is refolded by quickly mixing the GuHCl extract with 20 
volumes of buffer containing 50 mM sodium, pH 4.5, 150 mM NaCl, 2 mM EDTA by 
vigorous stirring. The refolded diluted protein solution is kept at 4°C without mixing 

for 12 hours prior to further purification steps. 

To clarify the refolded polypeptide solution, a previously prepared tangential 

30 filtration unit equipped with 0.16 \im membrane filter witii appropriate surface area 
(e.g., Filtron), equilibrated with 40 mM sodium acetate, pH 6.0 is employed. The 
filtered sample is loaded onto a cation exchange resin (e.g., Poros HS-50, Perseptive 
Biosystems). The column is washed with 40 mM sodium acetate, pH 6.0 and eluted 
with 250 mM, 500 mM, 1000 mM, and 1500 mM NaCl in the same buffer, in a 
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Stepwise manner. The absorbance at 280 nm of the effluent is continuously monitored. 
Fractions are collected and further analyzed by SDS-PAGE. 

Fractions containing the polypeptide are then pooled and mixed with 4 volumes 
of water The diluted sample is then loaded onto a previously prepared set of tandem 
5 columns of strong anion (Poros HQ-50» Perseptive Biosystems) and weak anion 

(Poros CM-20, Perseptive Biosystems) exchange resins. The columns are equilibrated 
with 40 mM sodium acetate, pH 6.0. Both columns are washed with 40 mM sodium 
acetate, pH 6.0, 200 mM NaCl. The CM-20 column is then eiuted using a 10 column 
volume linear gradient ranging from 0.2 M NaCl, 50 mM sodium acetate, pH 6.0 to 1.0 
10 M NaCl. 50 mM sodium acetate, pH 6.5. Fractions are collected under constant A^go 
monitoring of the effluent. Fractions containing the polypeptide (determined, for 
instance, by 16% SDS-PAGE) are then pooled. 

The resultant polypeptide should exhibit greater than 95% purity after the above 
refolding and purification steps. No major contaminant bands should be observed from 

15 Commassie blue stained 16% SDS-PAGE gel when 5 \ig of purified protein is loaded. 

The purified protein can also be tested for endotoxin/LPS contamination, and typically 
the LPS content is less than 0. 1 ng/ml according to LAL assays. 

Example 7: Cloning and Expression of a Polypeptide in a Baculovirus 

20 Expression System 

In this example, the plasmid shuttle vector pA2 is used to insert a polynucleotide 
into a baculovirus to express a polypeptide. This expression vector contains the strong 
polyhedrin promoter of the Autographa califomica nuclear polyhedrosis virus 
(AcMNPV) followed by convenient restriction sites such as BamHI, Xba I and 

25 Asp7 1 8. The polyadenylation site of the simian virus 40 ("SV40") is used for efficient 
polyadenylation. For easy selection of recombinant virus, the plasmid contains the 
beta-galactosidase gene from E, coli under control of a weak Drosophila promoter in the 
same orientation, followed by the polyadenylation signal of the polyhedrin gene. The 
inserted genes are flanked on both sides by viral sequences for cell-mediated 

30 homologous recombination with wild-type viral DNA to generate a viable virus that 
express the cloned polynucleotide. 

Many other baculovirus vectors can be used in place of the vector above, such 
as pAc373, pVL941, and pAcIM 1 , as one skilled in the art would readily appreciate, as 
long as the construct provides appropriately located signals for transcription, 

35 translation, secretion and the like, including a signal peptide and an in-frame AUG as 
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required. Such vectors are described, for instance, in Luckow et al.. Virology 170:31- 
39 (1989). 

Specifically, the cDNA sequence contained in the deposited clone, including the 
AUG initiation codon and the naturally associated leader sequence identified in Table 1, 
5 is amplified using the PCR protocol described in Example 1 . If the naturally occurring 
signal sequence is used to produce the secreted protein, the pA2 vector does not need a 
second signal peptide. Alternatively, the vector can be modified (pA2 GP) to include a 
baculovirus leader sequence, using the standard methods described in Summers et al., 
"A Manual of Methods for Baculovirus Vectors and Insect Cell Culture Procedures," 
10 Texas Agricultural Experimental Station Bulletin No. 1555 (1987). 

The amplified fragment is isolated from a 1% agarose gel using a commercially 
available kit ("Geneclean," BIO 101 Inc.. La JoUa, Ca.). The fragment then is digested 
with appropriate restriction enzymes and again purified on a 1% agarose gel. 
The plasmid is digested with the corresponding restriction enzymes and 
1 5 optionally, can be depbosphorylated using calf intestinal phosphatase, using routine 
procedures known in the art. The DNA is then isolated from a 1% agarose gel using a 
commercially available kit ("Geneclean" BIO 101 Inc., La Jolla, Ca.). 

The fragment and the depbosphorylated plasmid are hgated together with T4 
DNA ligase. E. coli HB 101 or other suitable E. coli hosts such as XL-1 Blue 
20 (Stratagene Cloning Systems, La Jolla, CA) cells are transformed with the ligation 
mixture and spread on culture plates. Bacteria containing the plasmid are identified by 
digesting DNA from individual colonies and analyzing the digestion product by gel 
electrophoresis. The sequence of the cloned fragment is confirmed by DNA 
sequencing. 

25 Five ]ig of a plasmid containing the polynucleotide is co-transfected with 1 .0 \ig 

of a commercially available linearized baculovirus DNA ("BaculoGoldTM baculovirus 
DNA", Pharmingen, San Diego, CA), using the lipofection method described by 
Feigner et al., Proc. Natl. Acad. Sci. USA 84:7413-7417 (1987). One Jig of 
BaculoGoldTM virus DNA and 5 ^lg of the plasmid are mixed in a sterile weU of a 
30 microtiter plate containing 50 nl of serum-free Grace's medium (Life Technologies 

Inc.. Gaithereburg, MD). Afterwards, 10 \l\ Lipofectin plus 90 Grace's medium are 
added, mixed and incubated for 15 minutes at room temperamre. Then the transfection 
mixture is added drop-wise to Sf9 insect cells (ATCC CRL 17 1 1) seeded in a 35 mm 
tissue culture plate with 1 ml Grace's medium without serum. The plate is then 
35 incubated for 5 hours at 27° C. The transfection solution is then removed from the plate 
and 1 ml of Grace's insect medium supplemented with 10% fetal calf seram is added. 
Cultivation is then continued at 27° C for four days. 
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After four days the supernatant is collected and a plaque assay is performed, as 
described by Sunmiers and Smith, supra. An agarose gel with "Blue Gal" (Life 
Technologies Inc., Gaithersburg) is used to allow easy identification and isolation of 
gal-expressing clones, which produce blue-stained plaques. (A detailed description of a 
5 "plaque assay" of diis type can also be found in the user's guide for insect cell culture 
and baculovirology distributed by Life Technologies Inc., Gaithersburg, page 9-10.) 
After appropriate incubation, blue stained plaques are picked with the tip of a 
micropipettor (e.g., Eppendorf). The agar containing the recombinant viruses is then 
resuspended in a microcentrifuge tube containing 2(X) ^1 of Grace's medium and the 
10 suspension containing the recombinant baculovirus is used to infect Sf9 cells seeded in 
35 mm dishes. Four days later the supematants of these culture dishes are harvested 
and then they are stored at 4^ C. 

To verify the expression of the polypeptide, Sf9 cells are grown in Grace's 
medium supplemented with 10% heat-inactivated FBS. The cells are infected with the 
15 recombinant baculovirus containing the polynucleotide at a multiplicity of infection 
("MOI") of about 2. If radiolabeled proteins are desired, 6 hours later the medium is 
removed and is replaced with SF9(X) 11 medium minus methionine and cysteine 
(available from Life Technologies Inc., Rockville, MD). After 42 hours, 5 jiCi of ^^S- 
methionine and 5 jxCi ^^S-cysteine (available from Amersham) are added. The cells are 
20 further incubated for 16 hours and then are harvested by centrifiigation. The proteins in 
the supernatant as well as the intracellular proteins are analyzed by SDS-PAGE 
followed by autoradiography (if radiolabeled). 

Microsequencing of the amino acid sequence of the £imino terminus of purified 
protein may be used to determine the amino terminal sequence of the produced protein. 

25 Example 8; Expression of a Polypeptide in Mammalian Cells 

The polypeptide of the present invention can be expressed in a manmudian cell. 
A typical mammalian expression vector contains a promoter element, which mediates 
the initiation of transcription of mRN A, a protein coding sequence, and signals required 
for the termination of transcription and polyadenylation of the transcript. Additional 

30 elements include enhancers, Kozak sequences and intervening sequences flanked by 
donor and acceptor sites for RNA splicing. Highly efficient transcription is achieved 
with the early and late promoters from SV40, the long terminal repeats (LTRs) from 
Retroviruses, e.g., RSV, HTLVI, HIVI and the early promoter of the cytomegalovirus 
(CMV). However, cellular elements can also be used (e.g., the human actin promoter). 

35 Suitable expression vectors for use in practicing the present invention include, 

for example, vectors such as pSVL and pMSG (Pharmacia, Uppsala, Sweden), 
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pRSVcat (ATCC 37152), pSV2dhfr (ATCC 37146), pBC12MI (ATCC 67109), 
pCMVSport 2.0, and pCMVSport 3.0. Mammalian host cells that could be used 
include, human Hela, 293, H9 and Jurkat cells, mouse NIH3T3 and C127 cells, Cos 1, 
Cos 7 and CVl, quail QCl-3 cells, mouse L cells and Chinese hamster ovary (CHO) 
5 cells. 

Altematively, the polypeptide can be expressed in stable cell lines containing the 
polynucleotide integrated into a chromosome. The co-transfection with a selectable 
marker such as dhfr, gpt, neomycin, hygromycin allows the identification and isolation 
of the transfected cells. 

10 The transfected gene can also be amplified to express large amounts of the 

encoded protein. The DHFR (dihydrofolate reductase) marker is useful in developing 
cell lines that carry several hundred or even several thousand copies of the gene of 
interest. (See, e.g., Alt, F. W., et al., J. Biol. Chem. 253:1357-1370 (1978); Hamlin, 
J. L. and Ma, C, Biochem. et Biophys. Acta, 1097:107-143 (1990); Page, M. J. and 

15 Sydenham, M. A., Biotechnology 9:64-68 (1991).) Another useful selection marker is 
the enzyme glutamine synthase (GS) (Murphy et al., Biochem J. 227:277-279 (1991); 
Bebbington et al., Bio/Technology 10:169-175 (1992). Using these markers, the 
mammalian cells are grown in selective medium and the cells with the highest resistance 
are selected. These cell lines contain the amplified gene(s) integrated into a 

20 chromosome. Chinese hamster ovary (CHO) and NSO cells are often used for the 
production of proteins. 

Derivatives of the plasmid pSV2-dhfr (ATCC Accession No. 37146), the 
expression vectors pC4 (ATCC Accession No. 209646) and pC6 (ATCC Accession 
No. 209647) contain the strong promoter (LTR) of the Rous Sarcoma Virus (CuIIen et 

25 al.. Molecular and Cellular Biology, 438-447 (March, 1985)) plus a fragment of the 
CMV-enhancer (Boshart et al.. Cell 41:521-530 (1985).) Multiple cloning sites, e.g., 
with the restriction enzyme cleavage sites BamHI, Xbal and Asp718, facilitate the 
cloning of the gene of interest. The vectors also contain the 3' intron, the 
polyadenylation and termination signal of the rat preproinsulin gene, and the mouse 

30 DHFR gene under control of the SV40 early promoter. 

Specifically, the plasmid pC6, for example, is digested with appropriate 
restriction enzymes and then dephosphorylated using calf intestinal phosphates by 
procedures known in the art. The vector is then isolated from a 1% agarose gel. 

A polynucleotide of the present invention is ampUfied according to the protocol 

35 outlined in Example 1 . If the naturally occurring signal sequence is used to produce the 
secreted protein, the vector does not need a second signal peptide. Altematively, if the 
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naturally occurring signal sequence is not used, the vector can be modified to include a 
heterologous signal sequence. (See, e.g., WO 96/34891.) 

The amplified fragment is isolated from a 1% agarose gel using a commercially 
available kit ("Geneclean," BIO 101 Inc„ La Jolla, Ca.). The fragment then is digested 
5 with appropriate restriction enzymes and again purified on a 1% agarose gel. 

The amplified fragment is then digested with the same restriction enzyme and 
purified on a 1% agarose gel. The isolated fragment and the dephosphorylated vector 
are then ligated v^dth T4 DNA ligase. £. coli HB 101 or XL- 1 Blue cells are then 
transformed and bacteria are identified that contain the fragment inserted into plasmid 

10 pC6 using, for instance, restriction enzyme analysis. 

Chinese hamster ovary cells lacking an active DHFR gene is used for 
transfection. Five fig of the expression plasmid pC6 is cotransfected with 0.5 p.g of the 
plasmid pSVneo using lipofectin (Feigner et aL, supra). The plasmid pSV2-neo 
contains a dominant selectable marker, the neo gene from Tn5 encoding an enzyme that 

15 confers resistance to a group of antibiotics including G418. The cells are seeded in 
alpha minus MEM supplemented with 1 mg/ml G418. After 2 days, the cells are 
trypsinized and seeded in hybridoma cloning plates (Greiner, Germany) in alpha minus 
MEM supplemented with 10, 25, or 50 ng/ml of metothrexate plus 1 mg/ml G418. 
After about 10-14 days single clones are trypsinized and then seeded in 6-well petri 

20 dishes or 10 ml flasks using different concentrations of methotrexate (50 nM, 100 nM, 
200 nM, 400 nM, 800 nM). Clones growing at the highest concentrations of 
methotrexate are then transferred to new 6-well plates containing even higher 
concentrations of methotrexate (1 |iM, 2 |iM, 5 fxM, 10 mM, 20 mM). The same 
procedure is repeated until clones are obtained which grow at a concentration of 100 - 

25 200 }iM. Expression of the desired gene product is analyzed, for instance, by SDS- 
PAGE and Western blot or by reversed phase HPLC analysis. 

Example 9; Protein Fujsions 

The polypeptides of the present invention are preferably fused to other proteins. 

30 These fusion proteins can be used for a variety of applications. For example, fusion of 
the present polypeptides to His-lag, HA-tag, protein A, IgG domains, and maltose 
binding protein facilitates purification. (See Example 5; see also EP A 394,827; 
Traunecker, et al.. Nature 331:84-86 (1988).) Similarly, fusion to IgG-1, IgG-3, and 
albumin increases the halflife time in vivo. Nuclear localization signals fused to the 

35 polypeptides of the present invention can target the protein to a specific subcellular 
localization, while covalent heterodimer or homodimers can increase or decrease the 
activity of a fusion protein. Fusion proteins can also create chimeric molecules having 
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more than one ftinclion. Finally, fusion proteins can increase solubility and/or stability 
of the fused protein compared to the non-fused protein. All of the types of fusion 
proteins described above can be made by modifying the following protocol, which 
outlines the fusion of a polypeptide to an IgG molecule, or the protocol described in 
5 Example 5. 

Briefly, the human Fc portion of the IgG molecule can be PGR amplified, using 
primers that span the 5' and 3' ends of the sequence described below. These primers 
also should have convenient restriction enzyme sites that will facilitate cloning into an 
expression vector, preferably a mammalian expression vector. 

10 For example, if pC4 (Accession No.209646) is used, the human Fc portion can 

be ligated into the BamHI cloning site. Note that the 3' BamHI site should be 
destroyed. Next, the vector containing the human Fc portion is re-restricted with 
BamHI, linearizing the vector, and a polynucleotide of the present invention, isolated 
by the PGR protocol described in Example 1 , is ligated into this BamHI site. Note that 

15 the polynucleotide is cloned without a stop codon, otherwise a fusion protein will not 
be produced. 

If the naturally occurring signal sequence is used to produce the secreted 
protein, pC4 does not need a second signal peptide. Alternatively, if the naturally 
occurring signal sequence is not used, the vector can be modified to include a 
20 heterologous signal sequence. (See, e.g.. WO 96/3489 1 .) 

Human IgG Fc region: 

GGGATCCGGAGCCCAAATCTTCTGACAAAACTCACACATGCCCACCGTGCC 
CAGCACCTGAATTCGAGGGTGCACCGTCAGTCTTCCTCTTCCCCCCAAAACC 

25 CAAGGACAGCCTCATGATCTCCCGGACTCCTGAGGTCACATGCGTGGTGGT 
GGACGTAAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGAGG 
GCGTGGAGGTGCATAATGCCAAGAGAAAGCCGCGGGAGGAGCAGTACAAC 
AGCACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTG 
AATGGCAAGGAGTAGAAGTGCAACKJTCTCCAACAAAGCCCTCCCAACCCCC 

30 ATCGAGAAAAGCATCrCCAAAGCCAAAGGGCAGCGCGGAGAACCACAGGT 
GTACACCCTGGCCCCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGGCT 
GACCTGCCTGGTCAAAGGCrTCTATCCAAGCGACATCGCCGTGGAGTGGGA 
GAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTGCCGTGCTGG 
ACTCCGACGGCrCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCA 

35 GGTGGCAGCAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGC 
ACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGGGTAAATGAGTGC 
GACGGCCGCGACTCTAGAGGAT (SEQ ID NO: 1) 
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Example 10: Production of an Antibody from a Polypeptide 

The antibodies of the present invention can be prepared by a variety of methods. 
(See, Current Protocols, Chapter 2.) For example, cells expressing a polypeptide of 

5 the present invendon is administered to an animal to induce the production of sera 
containing polyclonal antibodies. In a preferred method, a preparation of the secreted 
protein is prepared and purified to render it substantially free of natural contaminants. 
Such a preparation is then introduced into an animal in order to produce polyclonal 
antisera of greater specific acdvity. 

10 In the most preferred method, the antibodies of the present invention are 

monoclonal antibodies (or protein binding fragments thereof). Such monoclonal 
antibodies can be prepared using hybridoma technology. (Kohler et al.. Nature 
256:495 (1975); Kohler et al., Eur. J. Immunol. 6:51 1 (1976); Kohler et al., Eur. J. 
Immunol. 6:292 ( 1976); Hammerling et al., in: Monoclonal Anubodies and T-Cell 

15 Hybridomas, Elsevier, N.Y., pp. 563-681 (1981).) In general, such procedures 
involve immunizing an animal (preferably a mouse) with polypeptide or, more 
preferably, with a secreted polypeptide-expressing cell. Such cells may be cultured in 
any suitable tissue culture medium; however, it is preferable to culture cells in Earle*s 
modified Eagle's medium supplemented with 10% fetal bovine serum (inactivated at 

20 about S&'C), and supplemented with about 10 g/1 of nonessential amino acids, about 

1 ,000 U/ml of penicillin, and about 100 jig/ml of streptomycin. 

The splenocytes of such mice are extracted and fused with a suitable myeloma 
cell line. Any suitable myeloma cell line may be employed in accordance with the 
present invention; however, it is preferable to employ the parent myeloma cell line 

25 (SP20), available from the ATCC. After fusion, the resulting hybridoma cells are 
selectively maintained in HAT medium, and then cloned by limiting dilution as 
described by Wands et al. (Gastroenterology 80:225-232 (1981).) The hybridoma cells 
obtained through such a selection are then assayed to identify clones which secrete 
antibodies capable of binding the polypeptide. 

30 Alternatively, additional andbodies capable of binding to the polypeptide can be 

produced in a two-step procedure using anU-idiotypic antibodies. Such a method 
makes use of the fact that antibodies are themselves antigens, and therefore, it is 
possible to obtain an antibody which binds to a second antibody. In accordance with 
this method, protein specific antibodies are used to immunize an animal, preferably a 

35 mouse. The splenocytes of such an animal are then used to produce hybridoma cells, 
and the hybridoma cells are screened to identify clones which produce an antibody 



wo 98^39448 PCTAJS98/04493 

214 

whose ability to bind to the protein-specific antibody can be blocked by the polypeptide. 
Such antibodies comprise anti-idiotypic antibodies to the protein-specific antibody and 
can be used to immunize an animal to induce formation of further protein-specific 
antibodies. 

5 It will be appreciated that Fab and F(ab')2 and other fragments of the antibodies 

of the present invention may be used according to the methods disclosed herein. Such 
fragments are typically produced by proteolytic cleavage, using enzymes such as papain 
(to produce Fab fragments) or pepsin (to produce F(ab')2 fragments). Alternatively, 
secreted protein-binding fragments can be produced through the application of 

10 recombinant DNA technology or through synthetic chemistry. 

For in vivo use of antibodies in humans, it may be preferable to use 
"humanized" chimeric monoclonal antibodies. Such antibodies can be produced using 
genetic constructs derived from hybridoma cells producing the monoclonal antibodies 
described above. Methods for producing chimeric antibodies are known in the art. 

15 (See, for review, Morrison. Science 229: 1202 (1985); Oi et al., BioTechniques 4:214 
(1986); Cabilly et al., U.S. Patent No. 4,816,567; Taniguchi et al., EP 171496; 
Morrison et al., EP 173494; Neuberger et al., WO 8601533; Robinson et al., WO 
8702671; Boulianne et al.. Nature 312:643 (1984); Neuberger et al.. Nature 314:268 
(1985).) 

20 

Example 11: Product ion Of Secreted Protein For Hich-ThrPPghput 
Screening Assays 

The following protocol produces a supernatant containing a polypeptide to be 
tested. This supernatant can then be used in the Screening Assays described in 
25 Examples 13-20. 

First, dilute Poly-D-Lysine (644 587 Boehringer-Mannheim) stock solution 
(Img/mlinPBS) 1:20 in PBS (w/o calcium or magnesium 17-516FBiowhittaker)for a 
working solution of 50ug/ml. Add 200 ul of this solution to each well (24 well plates) 
and incubate at RT for 20 minutes. Be sure to distribute the solution over each well 
30 (note: a 12-channel pipetter may be used with tips on every other channel). Aspirate off 
the Poly-D-Lysine solution and rinse with 1ml PBS (Phosphate Buffered Saline). The 
PBS should remain in the well until just prior to plating the cells and plates may be 
poly-lysine coated in advance for up to two weeks. 

Plate 293T cells (do not carry cells past P+20) at 2 x 10^ cells/well in .5ml 
35 DMEM(Dulbecco's Modified Eagle Medium)(with 4.5 G/L glucose and L-glutamine 
(12-604F Biowhittaker))/10% heat inactivated FBS(14-503F Biowhitlaker)/lx 
Penstrep( 17-602E Biowhittaker). Let the cells grow overnight. 
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The next day, mix together in a sterile solution basin: 300 ul Lipofectamine 
(18324-012 Gibco/BRL) and Sml Optimem I (31985070 Gibco/BRL)/96-well plate. 
With a small volume multi-channel pipetter, aliquot approximately 2ug of an expression 
vector containing a polynucleotide insert, produced by the methods described in 
5 Examples 8 or 9, into an appropriately labeled 96-well round bottom plate. With a 
multi-channel pipetter, add 50ul of the Lipofectamine/Optimem I mixture to each well. 
Pipette up and down gently to mix. Incubate at RT 15-45 minutes. After about 20 
minutes, use a multi-channel pipetter to add 150ul Optimem I to each well. As a 
control, one plate of vector DNA lacking an insert should be transfected with each set of 
10 transfections. 

Preferably, the transfection should be performed by tag-teaming the following 
tasks. By tag-teaming, hands on time is cut in half, and the cells do not spend too 
much time on PBS. First, person A aspirates off the media from four 24-well plates of 
cells, and then person B rinses each well with .5- 1ml PBS. Person A then aspirates off 
15 PBS rinse, and person B, using al2-channel pipetter with tips on every other channel, 
adds the 2(X)ul of DNA/Lipofectamine/Optimem I complex to the odd wells first, then to 

the even wells, to each row on the 24-well plates. Incubate at 3TC for 6 hours. 

While cells are incubating, prepare appropriate media, either 1%BSA in DMEM 
with Ix penstrep, or CHO-5 media (see below) with 2mm glutamine and Ix penstrep. 
20 (BSA (8 1-068-3 Bayer) lOOgm dissolved in IL DMEM for a 10% BSA stock 

solution). Filter the media and collect 50 ul for endotoxin assay in 15mJ polystyrene 
conical. 

The u-ansfection reaction is terminated, preferably by tag-teaming, at the end of 
the incubation period. Person A aspirates off the transfection media, while person B 

25 adds 1 .5ml appropriate media to each well. Incubate at 3TC for 45 or 72 hours 

depending on the media used: 1 %BSA for 45 hours or CHO-5 for 72 hours. 

On day four, using a 3(X)ul multichannel pipetter, aliquot 6(X)ul in one 1ml deep 
well plate and the remaining supernatant into a 2ml deep well. The supematants from 
each well can then be used in the assays described in Examples 13-20. 

30 It is specifically understood that when activity is obtained in any of the assays 

described below using a supernatant, the activity originates from either the polypeptide 
directly (e.g., as a secreted protein) or by the polypeptide inducing expression of other 
proteins, which are then secreted into the supernatant. Thus, the invention further 
provides a method of identifying the protein in the supernatant characterized by an 

35 activity in a particular assay. 
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HGS'CHO^S medium formulation: 
Inorganic Salts 



CaC12 (anhyd) 


116.6mg/L 


CuS0,-5H,0 


0.00130 


Fe(N0,),-9H,0 


0.050 


FeS04-7H,0 


0.417 


KCl 


311.80 


MgCl, 


28.64 


MgSO, 


48.84 


NaCl 


6995.50 


NaHCO, 


2400.0 


NaH,PO,-H,0 


62.50 


Na,HP04 


71.02 


ZnSO,-7H,0 


.4320 



S Lipids 



Arachidonic Acid 


.(X)2mg/L 


Cholesterol 


1.022 


DL-alpha- 
Tocopherol-Acetate 


.070 


LinoleicAcid 


0.0520 


Linolenic Acid 


0.010 


Myristic Acid 


0.010 


Oleic Acid 


0.010 


Palmitric Acid 


0.010 


Palmitic Acid 


0.010 


Pluronic F-68 


100 


Stearic Acid 


0.010 


Tween 80 


2.20 


Carbon Source 


1 D-Glucose 1 4551 mg/L | 


Amino Acids 


L- Alanine 


130.85 mg/ml 


L-Arginine-HCL 


147.50 


L-Asparagine-H,0 


7.50 


L-Aspartic Acid 


6.^5 


L.Cystine-2HCL- 
H,0 


29.56 


L-Cystine-2HCL 


31.29 


L-Glutamic Acid 


7.33 


L-Glulamine 


365.0 


Glycine 


18.75 


L-Histidine-HCL- 


52.48 
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H,0 




L-Isoleucine 


106.97 


L-Leucine 


111.45 


L-Lysine HCL 


163.75 


L-Methionine 


32.34 


L-Phenylalainine 


68.48 


L-Proline 


40.0 


L-Serine 


26.25 


L-Threonine 


101.05 


L-Tryptophan 


19.22 


L-Tryrosine-2Na- 
2H,0 


91.79 


L-Valine 


99.65 



Vitamins 



Biotin 


0.0035 mg/L 


D-Ca Pantothenate 


3.24 


Choline Chloride 


1L78 


Folic Acid 


4.65 


i-Inositol 


15.60 


Niacinamide 


3.02 


Pyridoxal HCL 


3.00 


Pyridoxine HCL 


0.031 


Riboflavin 


0.319 


Thiamine HCL 


3.17 


Thymidine 


0.365 


Vitamin B,, 


0.680 



Other Components 



HEPES Buffer 


25 mM 


Na Hypoxanthine 


2.39 mg/L 


Lipoic Acid 


0.105 


Sodium Putrescine-2HCL 


0.081 


Sodium Pyruvate 


55.0 


Sodium Selenite 


0.0067 


Ethanolamine 


20uM 


Ferric Citrate 


0.122 


Meihyl-B-Cyclodextrin complexed with 
Linoleic Acid 


41.70 


Methyl-B-Cyclodextrin complexed with 
Oleic Acid 


33.33 


Methyl-B-Cyclodextrin complexed with 
Retinal Acetate 


10 



5 



Adjust osmolarity to 327 mOsm 
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Example 12: Construction of GAS Reporter Construct 

One signal transduction pathway involved in the differentiation and proliferation 
of cells is called the Jaks-STATs pathway. Activated proteins in the Jaks-STATs . 
pathway bind to gamma activation site ''GAS'* elements or interferon-sensitive 

5 responsive element ("ISRE"), located in the promoter of many genes. The binding of a 
protein to these elements alter the expression of the associated gene. 

GAS and ISRE elements are recognized by a class of transcription factors called 
Signal Transducers and Activators of Transcription, or "STATs." There are six 
members of the STATs family. Statl and Stat3 are present in many cell types, as is 

10 Stat2 (as response to IFN-alpha is widespread). Stat4 is more restricted and is not in 
many cell types though it has been found in T helper class I, cells after treatment with 
IL- 12. StatS was originally called manunary growth factor, but has been found at 
higher concentrations in other cells including myeloid cells. It can be activated in tissue 
culture cells by many cytokines. 

15 The STATs are activated to translocate from the cytoplasm to the nucleus upon 

tyrosine phosphorylation by a set of kinases known as the Janus Kinase ("Jaks") 
family. Jaks represent a distinct family of soluble tyrosine kinases and include Tyk2, 
Jakl, Jak2, and Jak3. These kinases display significant sequence similarity and are 
generally catalytically inactive in resting cells. 

20 The Jaks are activated by a wide range of receptors summarized in the Table 

below. (Adapted from review by Schidler and Darnell, Ann. Rev. Biochem. 64:621-51 
(1995).) A cytokine receptor family, capable of activating Jaks, is divided into two 
groups: (a) Class 1 includes receptors for IL-2, IL-3, IL-4, IL-6, IL-7, IL-9, IL-1 1, IL- 
12, IL-15, Epo, PRL, GH, G-CSF, GM-CSF, LIF, CNTF, and thrombopoietin; and 

25 (b) Class 2 includes IFN-a, IFN-g, and IL-10. The Class 1 receptors share a 

conserved cysteine motif (a set of four conserved cysteines and one tryptophan) and a 
WSXWS motif (a membrane proxial region encoding Trp-Ser-Xxx-Trp-Ser (SEQ ID 
N0:2)). 

Thus, on binding of a ligand to a receptor, Jaks are activated, which in turn 
30 activate STATs, which then translocate and bind to GAS elements. This entire process 
is encompassed in the Jaks-STATs signal transduction pathway. 

Therefore, activation of the Jaks-STATs pathway, reflected by the binding of 
the GAS or the ISRE element, can be used to indicate proteins involved in the 
proliferation and differentiation of cells. For example, growth factors and cytokines are 
35 known to activate the Jaks-STATs pathway. (See Table below.) Thus, by using GAS 
elements linked to reporter molecules, activators of the Jaks-STATs pathway can be 
identified. 
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JAKs 






STATS 


GASCelements) or 




ISRE 
















Ligand 


tyk2 


Jakl 


Jak2 


Jak3 






5 


IFN familv 
















IFN-a/B 


+ 


+ 




- 


1,2,3 


ISRE 




IFN-g 






+ 


- 


1 


GAS 




(IRFl>Lys6>IFP) 
















11-10 


+ 


7 


7 


- 


1,3 




10 


















gpl30 familv 
















IL-6 (Pleiotrohic) 




+ 


+ 


7 


1,3 


GAS 




(IRFl>Lys6>IFP) 
















11- 11 (Pleiotrohic) 


9 


+ 


7 


7 


1,3 




15 


OnM(Pleiotrohic) 


7 


+ 


+ 


7 


1,3 






LIF(Pleiotrohic) 


7 


+ 




7 


1,3 






CNTF(Pleiotrohic) 


-/+ 


+ 


+ 


7 


1,3 






G-CSF(Pleiotrohic) 


7 


+ 


7 


7 


1,3 






IL- 1 2CPleiotrohic'i 








4. 


1 3 




20 


















g-C familv 
















lL-2 (lymphocytes) 


- 


+ 


- 


+ 


1,3,5 


GAS 




IL-4 (lymph/myeloid) 


- 


+ 


- 


+ 


6 


GAS (IRFl = IFP 




»Ly6){IgH) 














25 


IL-7 (lymphocytes) 


- 


+ 


- 


+ 


5 


GAS 




IL-9 (lymphocytes) 




+ 


- 




5 


GAS 




IL-13 (lymphocyte) 




+ 


7 


7 
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To construct a synthetic GAS containing promoter element, which is used in the 
Biological Assays described in Examples 13-14, a PCR based strategy is employed to 
generate a GAS-SV40 promoter sequence. The 5' primer contains four tandem copies 
of the GAS binding site found in the IRFl promoter and previously demonstrated to 
5 bind STATs upon induction with a range of cytokines (Rothman et al.. Immunity 

1 :457-468 (1994).). although other GAS or ISRE elements can be used instead. The 5* 
primer also contains 18bp of sequence complementary to the SV40 early promoter 
sequence and is flanked with an Xhol site. The sequence of the 5' primer is: 
S^GCGCCTCGAGATTTCCCCGAAATCTAGATITCCCCGAAATGATTTCCCCG 
10 AAATGATTTCCCCGAAATATCTGCCATCTCAArrAG:3' (SEQIDNO:3) 

The downstream primer is complementary to the SV40 promoter and is flanked 
with a Hind in site: 5^GCGGCAAGCnTnTGCAAAGCCTAGGC:3* (SEQ ID 
NO:4) 

PCR amplification is performed using the SV40 promoter template present in 
15 the B-gal:promoter plasmid obtained from Clontech. The resulting PCR fragment is 
digested with Xhol/Hind III and subcloned into BLSK2-. (Stratagene.) Sequencing 
with forward and reverse primers confirms that the insert contains the following 
sequence: 

5':CreGAeATrrCCCCGAAATCrAGATrTCCCCGAAATGATITCCCCGAAATG 
20 ATrTCCCCGAAATATCTGCCATCTCAATTAGTCAGCAACCATAGTCCCGCCC 
CTAACTCCGCCCATCCCGCCCCTAACTCCGCCCAGTTCCGCCCATrCTCCGC 
CCCATGCCTGACrAATTTTTTTTATTTATGCAGAGGCCGAGGCCGCCTCGGC 
CrCTGAGCTATrCCAGAAGTAGTGAGGAGGCTTTTTTGGAGGCCrAGGCrr^ 
TGCAAAAA(i£rT:3* {SEQIDNO:5) 

25 

With this GAS promoter element linked to the SV40 promoter, a GAS:SEAP2 
reporter construct is next engineered. Here, the reporter molecule is a secreted alkaline 
phosphatase, or **SEAP." Clearly, however, any reporter molecule can be instead of 
SEAP, in this or in any of the other Examples. Well known reporter molecules that can 

30 be used instead of SEAP include chloramphenicol acetyltransferase (CAT), luciferase, 
alkaline phosphatase, B-galactosidase, green fluorescent protein (GFP), or any protein 
detectable by an antibody. 

The above sequence confirmed synthetic GAS-SV40 promoter element is 
subcloned into the pSEAP-Promoter vector obtained from Clontech using Hindlll and 

35 Xhol, effectively replacing the S V40 promoter with the amplified GAS:S V40 promoter 
element, to create the GAS-SEAP vector. However, this vector does not contain a 
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neomycin resistance gene, and therefore, is not preferred for nuunmalian expression 
systems. 

Thus, in order to generate mammalian stable cell lines expressing the GAS- 
SEAP reporter, the GAS-SEAP cassette is removed from the GAS-SEAP vector using 
5 Sail and NotI, and inserted into a backbone vector containing the neomycin resistance 
gene, such as pGFP-1 (Clontech), using these restriction sites in the multiple cloning 
site, to create the GAS-SEAP/Neo vector. Once this vector is transfected into 
mammalian cells, this vector can then be used as a reporter molecule for GAS binding 
as described in Examples 13*14. 

10 Other constructs can be made using the above description and replacing GAS 

with a different promoter sequence. For example, construction of reporter molecules 
containing NFK-B and EGR promoter sequences are described in Examples 15 and 16. 
However, many other promoters can be substituted using the protocols described in 
these Examples. For instance, SRE, IL-2, NFAT, or Osteocalcin promoters can be 

15 substituted, alone or in combination (e.g., GAS/NF-KB/EGR, GAS/NF-KB, D- 
2/NFAT, or NF-KB/GAS). Similarly, other cell lines can be used to test reporter 
construct activity, such as HELA (epithelial), HUVEC (endothelial), Reh (B-cell), 
Saos-2 (osteoblast), HUVAC (aortic), or Cardiomyocyte. 

20 Example 13; Hieh-Thrp^gliput S^r^^^ffigg Assay for T-cell Activity. 

The following protocol is used to assess T-cell activity by identifying factors, 
such as growth factors and cytokines, that may proliferate or differentiate T-cells. T- 
cell activity is assessed using the GAS/SEAP/Neo construct produced in Example 12. 
Thus, factors that increase SEAP activity indicate the ability to activate the Jaks-STATS 

25 signal transduction pathway. The T-cell used in this assay is Jurkat T-cells ( ATCC 

Accession No. TIB-152), although Molt-3 cells (ATCC Accession No. CRL-1552) and 
Moll-4 cells (ATCC Accession No. CRL-1582) cells can also be used. 

Jurkat T-cells are lymphoblastic CD4+ Thl helper cells. In order to generate 
stable cell lines, approximately 2 million Jurkat cells are transfected with the GAS- 

30 SEAP/neo vector using DMRIE-C (Life Technologies)(transfection procedure 
described below). The transfected cells are seeded to a density of approximately 
20,000 cells per well and iransfectants resistant to 1 mg/ml genticin selected. Resistant 
colonies are expanded and then tested for their response to increasing concentrations of 
interferon gamma. The dose response of a selected clone is demonstrated. 

35 Specifically, the following protocol will yield sufficient cells for 75 wells 

containing 200 ul of cells. Thus, it is either scaled up, or performed in multiple to 
generate sufficient cells for multiple 96 well plates. Jurkat cells are maintained in RPMI 
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+ 10% serum with l%Pen-Strep. Combine 2.5 mis of OPTI-MEM (Life Technologies) 
with 10 ug of plasmid DNA in a T25 flask. Add 2.5 ml OPTI-MEM containing 50 ul 
of DMRIE-C and incubate at room temperature for 15-45 mins. 

During the incubation period, count cell concentration, spin down the required 
5 number of cells (10^ per transfection), and resuspend in OPTI-MEM to a final 

concentration of 10^ cells/ml. Then add 1ml of 1 x 10^ cells in OPTI-MEM to T25 flask 
and incubate at 3TC for 6 hrs. After the incubation, add 10 ml of RPMI + 15% serum. 

The Jurkat:GAS-SEAP stable reporter lines are maintained in RPMI + 10% 
serum, 1 mg/ml Genticm, and 1% Pen-Suiep. These cells are treated with supematants 
1 0 containing a polypeptide as produced by the protocol described in Example 1 1 . 

On the day of U:eatment with the supernatant, the cells should be washed and 
resuspended in fresh RPMI + 10% serum to a density of 500,000 cells per ml. The 
exact number of cells required will depend on the number of supematants being 
screened. For one 96 well plate, approximately 10 million cells (for 10 plates, 100 
15 million cells) are required. 

Transfer the cells to a triangular reservoir boat, in order to dispense the cells into 
a 96 well dish, using a 12 channel pipette. Using a 12 channel pipette, U-ansfer 200 ul 
of cells into each well (therefore adding 100, 000 cells per well). 

After all the plates have been seeded, 50 ul of the supematants are transferred 
20 duectly from the 96 well plate containing the supematants into each well using a 12 
channel pipette. In addition, a dose of exogenous interferon ganuna (0.1, 1.0, 10 ng) 
is added to wells H9, H 10, and HI 1 to serve as additional positive controls for the 
assay. 

The 96 well dishes containing Jurkat cells treated with supematants are placed in 
25 an incubator for 48 hrs (note: this time is variable between 48-72 hrs). 35 ul samples 
from each well are then transferred to ah opaque 96 well plate using a 12 channel 
pipette. The opaque plates should be covered (using sellophene covers) and stored at - 
20^C until SEAP assays are performed according to Example 17. The plates 
containing the remaining treated cells are placed at 4^0 and serve as a source of material 
30 for repeating the assay on a specific well if desired. 

As a positive control, 100 Unit/ml interferon gamma can be used which is 
known to activate Jurkat T cells. Over 30 fold induction is typically observed in the 
positive control wells. 
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Example 14: High-Throughput Screening Assay Identifying Myeloid 
Activity 

The following protocol is used to assess myeloid activity by identifying factors, 
such as growth factors and cytokines, that may proliferate or differentiate myeloid cells. 
5 Myeloid cell activity is assessed using the GAS/SEAP/Neo construct produced in 

Example 12. Thus, factors that increase SEAP activity indicate the ability to activate the 
Jaks-STATS signal transduction pathway. The myeloid cell used in this assay is U937, 
a pre-monocyte cell line, although TF-1, HL60, or KGl can be used. 

To transiently iransfect U937 cells with the GAS/SEAP/Neo constnict produced 
10 in Example 12, a DE/VE-Dextran method (Kharbanda et. al., 1994, Cell Growth & 
Differentiation, 5:259-265) is used. First, harvest 2xl0e7 U937 cells and wash with 
PBS. The U937 cells are usually grown in RPMI 1640 medium containing 10% heat- 
inactivated fetal bovine serum (FBS) supplemented with 100 units/ml penicillin and 100 
mg/ml streptomycin. 

15 Next, suspend the cells in 1 ml of 20 mM Tris-HCl (pH 7.4) buffer containing 

0.5 mg/ml DEAE-Dextran. 8 ug GAS-SEAP2 plasmid DNA, 140 mM NaCl, 5 mM 
KCl, 375 uM Na2HP04.7H20, 1 mM MgCl2, and 675 uM CaCl2. Incubate at 37^0 
for 45 min. 

Wash the cells with RPMI 1640 medium containing 10% FBS and then 
20 resuspend in 10 ml complete medium and incubate at 37^C for 36 hr. 

The GAS-SEAP/U937 stable cells are obtained by growing the cells in 400 
ug/ml G418. The G418-free medium is used for routine growth but every one to two 
months, the cells should be re-grown in 400 ug/ml G418 for couple of passages. 

These cells are tested by harvesting 1x10* cells (this is enough for ten 96- well 
25 plates assay) and wash with PBS. Suspend the cells in 200 ml above described growth 
medium, with a final density of 5x10* cells/ml. Plate 200 ul cells per well in the 96- 
well plate (or 1x10* cells/well). 

Add 50 ul of the supernatant prepared by the protocol described in Example 1 1 . 
Incubate at 31^C for 48 to 72 hn As a positive control, 100 Unit/ml interferon gamma 
30 can be used which is known to activate U937 cells. Over 30 fold induction is typically 
observed in the positive control wells. SEAP assay the supernatant according to the 
protocol described in Example 17. 
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Example IS; High-Throughput .Sc reening Assay Identifying Neuronal 
Activity. 

When cells undergo differentiation and proliferation, a group of genes are 
activated through many different signal transduction patfiways. One of these genes, 
5 EGRl (early growth response gene 1), is induced in various tissues and cell types upon 
activation. The promoter of EGRl is responsible for such induction. Using the EGRl 
promoter linked to reporter molecules, activation of cells can be assessed. 

Particularly, the following protocol is used to assess neuronal activity in PC 12 
cell lines. PC12 cells (rat phenochromocytoma cells) are known to proliferate and/or 

10 differentiate by activation with a number of mitogens, such as TPA (tetradecanoyl 

phorbol acetate). NGF (nerve growth factor), and EGF (epidermal growth factor). The 
EGRl gene expression is activated during this treatment. Thus, by stably transfecting 
PC 12 cells with a construct containing an EGR promoter linked to SEAP reporter, 
activation of PC 1 2 cells can be assessed. 

15 The EGRySEAP reporter construct can be assembled by die following protocol. 

The EGR-1 promoter sequence (-633 to +l)(Sakamoto K et al.. Oncogene 6:867-871 
(1991)) can be PCR amplified from human genomic DNA using the following primers: 

5' GCGCTCGAGGGATGACAGCGATAGAACCCCGG -3' (SEQ ID N0:6) 
20 5' GCGAAGCTTCGCGACTCCCCGGATCCGCCTC-3* (SEQ ID NO:7) 

Using die GAS:SEAP/Neo vector produced in Example 12, EGRl amplified 
product can tiien be inserted into tiiis vector. Linearize the GAS:SEAP/Neo vector 
using resuiction enzymes Xhol/Hindlll, removing the GAS/SV40 stuffer. Restrict the 
25 EGRl amplified product with tiiese same enzymes. Ligate die vector and the EGRl 
promoter. 

To prepare 96 well-plates for cell culture, two mis of a coating solution (1 :30 
dilution of collagen type I (Upstate Biotech Inc. Cat#08-115) in 30% etiianol (filter 
sterilized)) is added per one 10 cm plate or 50 ml per well of die 96- well plate, and 
30 allowed to air dry for 2 hr. 

PC 12 cells are routinely grown in RPMH640 medium (Bio Whittaker) 
containing 10% horse serum (JRH BIOSCIENCES, Cat. # 12449-78P), 5% heat- 
inactivated fetal bovine serum (PBS) supplemented with 100 units/ml penicillin and 100 
ug/ml streptomycin on a precoated 10 cm tissue culture dish. One to four split is done 
every three to four days. Cells are removed from the plates by scraping and 
resuspended with pipeuing up and down for more than 15 times. 
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Transfect the EGR/SEAP/Neo construct into PC12 using the Lipofectamine 
protocol described in Example 11. EGR-SEAP/PC12 stable cells are obtained by 
growing the cells in 300 ug/ml G418. The G418-free medium is used for routine 
growth but every one to two months, the cells should be re-grown in 300 ug/ml G418 
5 for couple of passages. 

To assay for neuronal activity, a 10 cm plate with cells around 70 to 80% 
confluent is screened by removing the old medium. Wash the cells once with PBS 
(Phosphate buffered saline). Then starve the cells in low serum medium (RPMI-1640 
. containing 1% horse serum and 0.5% FBS with antibiotics) overnight. 
10 The next morning, remove the medium and wash the cells with PBS. Scrape 

off the cells from the plate, suspend the cells well in 2 ml low serum medium. Count 
the cell number and add more low semm medium to reach final cell density as 5xl05 
cells/ml. 

Add 200 ul of the cell suspension to each well of 96-well plate (equivalent to 
15 1 X 1 05 cells/well). Add 50 ul supernatant produced by Example 1 1 , 37^0 for 48 to 72 
hr. As a positive control, a growth factor known to activate PC 12 cells through EGR 
can be used, such as 50 ng/ul of Neuronal Growth Factor (NGF). Over fifty-fold 
induction of SEAP is typically seen in the positive control wells. SEAP assay the 
supernatant according to Example 17. 

20 

Example 16: High-Throughput Screening Assay for T-cell Activity 

NF-kB (Nuclear Factor kB) is a transcription factor activated by a wide variety 

of agents including the inflanunatory cytokines DL-l and TNF, CD30 and CD40, 
lymphotoxin-alpha and lymphotoxin-beta, by exposure to LPS or thrombin, and by 

25 expression of certain viral gene products. As a transcription factor, NF-kB regulates 

the expression of genes involved in inunune cell activation, control of apoptosis (NF- 

kB appears to shield cells from apoptosis), B and T-cell development, anti-viral and 

antimicrobial responses, and multiple stress responses. 

In non-stimulated conditions, NF- kB is retained in the cytoplasm with I-kB 

30 (Inhibitor kB). However, upon stimulation, I- kB is phosphorylated and degraded, 

causing NF- kB to shuttle to the nucleus, thereby activating u-anscription of target 

genes. Target genes activated by NF- kB include IL-2, IL-6, GM-CSF, ICAM-1 and 
class 1 MHC. 
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Due to its central role and ability to respond to a range of stimuli, reporter 
constructs utilizing the NF-kB promoter element are used to screen the superaatants 
produced in Example 1 1 . Activators or inhibitors of NF-kB would be useful in treating 
diseases. For example, inhibitors of NF-kB could be used to treat those diseases 
related to the acute or chronic activation of NF-kB, such as rheumatoid arthritis. 

To constmct a vector containing the NF-kB promoter element, a PCR based 

strategy is employed. The upstream primer contains four tandem copies of the NF-kB 

binding site (GGGGACTTTCCC) (SEQ ID NO:8), 18 bp of sequence complementary 

to the 5' end of the SV40 early promoter sequence, and is flanked with an Xhol site: 
5':GCGGCCTCGAGGGGACTTTCCCGGGGACriTCCGGGGACrTTCCGGGAC 
TTTCCATCCTGCCATCTCAATTAG:3' (SEQIDNO:9) 

The downstream primer is complementary to the 3* end of the SV40 promoter 
and is flanked with a Hind III site: 

5':GCGGCAAGCTmTGCAAAGCCTAGGC:3^ (SEQ ID NO:4) 

PCR amplification is performed using the S V40 promoter template present in 
the pB-gal:promoter plasmid obtamed from Clontech. The resulting PCR fragment is 
digested with Xhol and Hind III and subcloned into BLSK2-. (Stratagene) 
Sequencing with the T7 and T3 primers confirms the insert contains the following 
sequence: 

5*:CTCGAGGGGACTTTCCCGGGGACTTTCCGGGGACTTTCCGGGACTTTCC 

ATCTGCCATCTCAATTAGTCAGCAACCATAGTCCCGCCCCTAACrCCGCCCA 

TCCCGCCCCTAACTCCGCCCAGTTCCGCCCATTCTCCGCCCCATGGCTGACT 

AATTTTTTTTATTTATGCAGAGGCCGAGGCCGCCTCGGCCrCTGAGCrATTC 

CAGAAGTAGTGAGGAGGCITTTTTGGAGGCCTAGGCTTTTGCAAAAAGCTT^ 

y (SEQ ID NO: 10) 

Next, replace the S V40 minimal promoter element present in the pSEAP2- 
promoter plasmid (Clontech) with this NF-KB/SV40 fragment using Xhol and Hindlll. 

However, this vector does not contain a neomycin resistance gene, and therefore, is not 
preferred for mammalian expression systems. 

In order to generate stable mammalian cell lines, the NF-KB/SV40/SEAP 

cassette is removed from the above NF-kB/SEAP vector using restriction enzymes Sail 
and NotI, and inserted into a vector containing neomycin resistance. Particularly, the 
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NF-KB/SV40/SEAP cassette was inserted into pGFP-1 (Clontech), replacing the GFP 

gene, after restricting pGFP-1 with Sail and NotL 

Once NF-KB/SV40/SEAP/Neo vector is created, stable Jurkat T-cells are 

created and maintained according to the protocol described in Example 13. Similarly, 
5 the method for assaying supematants with these stable Jurkat T-cells is also described 
in Example 13. As a positive control, exogenous TNF alpha (0.1,1, 10 ng) is added to 
wells H9, HIO, and HI 1, with a 5-10 fold activation typically observed. 

Example 17: Assay for SEAP Activity 
10 As a reporter molecule for the assays described in Examples 13-16, SEAP 

activity is assayed using the Tropix Phospho-light Kit (Cat. BP-400) according to the 

following general procedure. The Tropix Phbspho-light Kit supplies the Dilution, 

Assay, and Reaction Buffers used below. 

Prime a dispenser with the 2.5x Dilution Buffer and dispense 15 ^1 of 2.5x 
15 dilution buffer into Optiplates containing 35 |il of a supernatant. Seal the plates with a 

plastic sealer and incubate at 65^ for 30 min. Separate the Optiplates to avoid uneven 
heating. 

Cool the samples to room temperature for 15 minutes. Empty the dispenser and 

prime with the Assay Buffer. Add 50 \il Assay Buffer and incubate at room 

20 temperature 5 min. Empty the dispenser and prime with the Reaction Buffer (see the 

table below). Add 50 ^il Reaction Buffer and incubate at room temperature for 20 

minutes. Since the intensity of the chemiluminescent signal is time dependent, and it 
takes about 10 minutes to read 5 plates on luminometer, one should treat 5 plates at each 
time and start the second set 10 minutes later. 
25 Read the relative light unit in the luminometer. Set H 12 as blank, and print the 

results. An increase in chemiluminescence indicates reporter activity. 



Reaction Buffer Formulation: 

# of plates Rxn buffer diluent (ml) CSPD (ml) 

11 65 3.25 

12 70 3,5 

13 75 3.75 

14 80 4 
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Example 18: High-Throughput Screening Assay Identifying Changes in 
Small Molecule Concentration and Membrane Permeability 

Binding of a ligand to a receptor is known to alter intracellular levels of small 
5 molecules, such as calcium, potassium, sodium, and pH, as well as alter membrane 
potential. These alterations can be measured in an assay to identify supematants which 
bind to receptors of a particular cell. Although the following protocol describes an 
assay for calcium, this protocol can easily be modified to detect changes in potassium, 
sodium, pH, membrane potential, or any other small molecule which is detectable by a 

1 0 fluorescent probe . 

The following assay uses Ruorometric Imaging Plate Reader ("FLIPR'*) to 
measure changes in fluorescent molecules (Molecular Probes) that bind small 
molecules. Clearly, any fluorescent molecule detecting a small molecule can be used 
instead of the calcium fluorescent molecule, fluo-3, used here. 

15 For adherent cells, seed the cells at 10,000 -20,000 cells/well in a Co-star black 

96-well plate with clear bottom. The plate is incubated in a COj incubator for 20 hours. 
The adherent cells are washed two times in Biotek washer with 200 ul of HBSS 
(Hank's Balanced Salt Solution) leaving 100 ul of buffer after the final wash. 

A stock solution of 1 mg/ml fluo-3 is made in 10% pluronic acid DMSO. To 

20 load the cells with fluo-3, 50 ul of 12 ug/ml fluo-3 is added to each well. The plate is 
incubated at 37*'C in a CO^ incubator for 60 min. The plate is washed four times in the 
Biotek washer with HBSS leaving 100 ul of buffer. 

For non-adherent cells, the cells are spun down from culture media. Cells are 
re-suspended to 2-5x10* cells/ml with HBSS in a 50-ml conical tube. 4 ul of 1 mg/ml 

25 fluo-3 solution in 10% pluronic acid DMSO is added to each ml of cell suspension. 

The tube is then placed in a 37*'C water bath for 30-60 min. The cells are washed twice 
with HBSS, resuspended to 1x10* cells/ml, and dispensed into a microplate, 100 
ul/well. The plate is centrifuged at 1000 rpm for 5 min. The plate is then washed once 
in Denley CellWash with 200 ul, followed by an aspiration step to 100 ul final volume. 

30 For a non-cell based assay, each well contains a fluorescent molecule, such as 

fluo-3. The supernatant is added to the well, and a change in fluorescence is delected. 

To measure the fluorescence of intracellular calcium, the FLIPR is set for the 
following parameters: (1) System gain is 300-800 mW; (2) Exposure time is 0.4 
second; (3) Camera F/stop is F/2; (4) Excitation is 488 nm; (5) Emission is 530 nm; and 

35 (6) Sample addition is 50 ul. Increased emission at 530 nm indicates an extracellular 
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signaling even which has resulted in an increase in the intracellular Ca~*^ 
concentration. 

Example 19: High-Throughput Screening Assay Identifying Tyrosine 
5 Kinase Activity 

The Protein Tyrosine Kinases (PTK) represent a diverse group of 
transmembrane and cytoplasmic kinases. Within the Receptor Protein Tyrosine Kinase 
RPTK) group are receptors for a range of mitogenic and metabolic growth factors 
including the PDGF, FGF, EGF, NGF, HGF and Insulin receptor subfamilies. In 

10 addition there are a large family of RPTKs for which the corresponding Ugand is 
unknown. Ligands for RPTKs include mainly secreted small proteins, but also 
membrane-bound and extracellular matrix proteins. 

Activation of RPTK by ligands involves ligand-mediated receptor dimerization, 
resulting in transphosphorylation of the receptor subunits and activation of the 

15 cytoplasmic tyrosine kinases. The cytoplasmic tyrosine kinases include receptor 
associated tyrosine kinases of the src-family (e.g., src, yes, Ick, lyn, fyn) and non- 
receptor linked and cytosolic protein tyrosine kinases, such as the Jak family, members 
of which mediate signal transduction triggered by the cytokine superfamily of receptors 
(e.g., the Interleukins, Interferons, GM-CSF, and Leptin). 

20 Because of the wide range of known factors capable of stimulating tyrosine 

kinase activity, the identification of novel human secreted proteins capable of activating 
tyrosine kinase signal transduction pathways are of interest. Therefore, the following 
protocol is designed to identify those novel human secreted proteins capable of 
activating the tyrosine kinase signal transduction pathways. 

25 Seed target cells (e.g., primary keratinocytes) at a density of approximately 

25,000 cells per well in a 96 well Loprodyne Silent Screen Plates purchased from 
Nalge Nunc (Naperville, IL). The plates are sterilized with two 30 minute rinses with 
100% ethanol, rinsed with water and dried overnight. Some plates are coated for 2 hr 
with 100 ml of cell culture grade type I collagen (50 mg/ml), gelatin (2%) or poly lysine 

30 (50 mg/ml), all of which can be purchased from Sigma Chemicals (St. Louis, MO) or 
10% Matrigel purchased from Becton Dickinson (Bedford,MA), or calf seram, rinsed 

with PBS and stored at 4^C. Cell growth on these plates is assayed by seeding 5,000 
cells/well in growth medium and indirect quantitation of cell number through use of 
alamarBlue as described by the manufacturer Alamar Biosciences, Inc. (Sacramento, 
35 CA) after 48 hr. Falcon plate covers #307 1 from Becton Dickinson (Bedford,MA) are 
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used to cover the Loprodyne Silent Screen Plates. Falcon Microtest III cell culture 
plates can also be used in some proliferation experiments. 

To prepare extracts, A431 cells are seeded onto the nylon membranes of 
Loprodyne plates (20,00Q/200ml/well) and cultured overnight in complete medium. 
5 Cells are quiesced by incubation in serum-free basal medium for 24 hr. After 5-20 
minutes treatment with EGF (60ng/ml) or SO ul of the supernatant produced in Example 
11, the medium was removed and 100 ml of extraction buffer ((20 mM HEPES pH 
7.5. 0.15 M NaCl, 1% Triton X-100, 0.1% SDS, 2 mM Na3V04, 2 mM Na4P207 
and a cocktail of protease inhibitors (# 1836170) obtained from Boeheringer Mannheim 
10 (Indianapolis, IN) is added to each well and the plate is shaken on a rotating shaker for 

5 minutes at 4^C. The plate is then placed in a vacuum transfer manifold and the extract 
filtered through the 0.45 nun membrane bottoms of each well using house vacuum. 
Extracts are collected in a 96-well catch/assay plate in the bottom of the vacuum 
manifold and immediately placed on ice. To obtain extracts clarified by centrifugation, 

1 5 the content of each well, after detergent solubilization for 5 minutes, is removed and 
centrifiiged for 15 minutes at A^C at 16,000 x g. 

Test the filtered extracts for levels of tyrosine kinase activity. Although many 
methods of detecting tyrosine kinase activity are known, one method is described here. 
Generally, the tyrosine kinase activity of a supernatant is evaluated by 

20 detemndning its ability to phosphorylate a tyrosine residue on a specific substrate (a 
biotinylated peptide). Biotinylated peptides that can be used for this purpose include 
PSKl (corresponding to amino acids 6-20 of the cell division kinase cdc2-p34) and 
PSK2 (corresponding to amino acids 1-17 of gastrin). Both peptides are substrates for 
a range of tyrosine kinases and are available from Boehringer Mannheim. 

25 The tyrosine kinase reaction is set up by adding the following components in 

order. First, add lOul of 5uM Biotinylated Peptide, then lOul ATP/Mg2+ (5mM 
ATP/50mM MgCl2), then lOul of 5x Assay Buffer (40mM imidazole hydrochloride, 
pH7.3, 40 mM beta-glycerophosphate, ImM EGTA, lOOmM MgCl2, 5 mM MnCl2, 
0.5 mg/ml BSA), then 5ul of Sodium Vanadate(lmM), and then 5ul of water. Mix the 

30 components gently and preincubate the reaction mix at 30^C for 2 min. Initial the 
reaction by adding lOul of the control enzyme or the filtered supernatant. 

The tyrosine kinase assay reaction is then terminated by adding 10 ul of 120mm 
EDTA and place the reactions on ice. 

Tyrosine kinase activity is determined by transferring 50 ul aliquot of reaction 
35 mixture to a microtiter plate (MTP) module and incubating at 37^C for 20 min. This 
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allows the streptavadin coated 96 well plate to associate with the biotinylated peptide. 
Wash the MTP module with 3(X)ul/well of PBS four times. Next add 75 ul of anti- 
phospotyrosine antibody conjugated to horse radish peroxidase(anti-P-Tyr- 
POD(0.5u/mJ)) to each well and incubate at 37^ for one hour. Wash the well as 
5 above. 

Next add lOOul of peroxidase substrate solution (Boehringer Mannheim) and 
incubate at room temperature for at least 5 mins (up to 30 min). Measure the 
absorbance of the sample at 405 nm by using ELIS A reader. The level of bound 
peroxidase activity is quantitated using an ELISA reader and reflects the level of 
1 0 tyrosine kinase activity. 

Example 20: High-Through put Screening Assay Identifying 
Phosphorylation Activity 

As a potential alternative and/or compliment to the assay of protein tyrosine 

15 kinase activity described in Example 19, an assay which detects activation 

(phosphorylation) of major intracellular signal transduction intermediates can also be 
used. For example, as described below one particular assay can detect tyrosine 
phosphorylation of the Erk-1 and Erk-2 kinases. However, phosphorylation of other 
molecules, such as Raf. JNK, p38 MAP, Map kinase kinase (MEK), MEK kinase, 

20 Src, Muscle specific kinase (MuSK), IRAK, Tec, and Janus, as well as any other 
phosphoserine, phosphotyrosine, or phosphothreonine molecule, can be detected by 
substituting these molecules for Erk-l or Erk-2 in the following assay. 

Specifically, assay plates are made by coating the wells of a 96-well ELISA 
plate with 0.1ml of protein G (lug/ml) for 2 hr at room temp, (RT). The plates are then 

25 rinsed with PBS and blocked with 3% BSA/PBS for 1 hr at RT. The protein G plates 
are then treated with 2 commercial monoclonal antibodies (lOOng/well) against Erk-1 
and Erk-2 ( 1 hr at RT) (Santa Cruz Biotechnology). (To detect other molecules, this 
step can easily be modified by substituting a monoclonal antibody detecting any of the 
above described molecules.) After 3-5 rinses with PBS, the plates are stored at 4^C 

30 until use. 

A43 1 cells are seeded at 20,000/well in a 96-well Loprodyne filterplate and 
cultured overnight in growth medium. The cells are then starved for 48 hr in basal 
medium (DMEM) and then treated with EGF (6ng/well) or 50 ul of the supematants 
obtained in Example 1 1 for 5-20 minutes. The cells are then solubilized and extracts 
35 filtered directly into the assay plate. 
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After incubation with the extract for 1 hr at RT, the wells are again rinsed. As a 
positive control, a commercial preparation of MAP kinase ( lOng/well) is used in place 
of A431 extract. Plates are then treated with a commercial polyclonal (rabbit) antibody 
(lug/ml) which specifically recognizes the phosphorylated epitope of the Erk-1 and 
5 Erk-2 kinases ( 1 hr at RT). This antibody is biotinylated by standard procedures. The 
bound polyclonal antibody is then quantitated by successive incubations with 
Europium-streptavidin and Europium fluorescence enhancing reagent in the Wallac 
DELFIA instrument (time-resolved fluorescence). An increased fluorescent signal over 
background indicates a phosphoiylation. 

10 

Example 21: Method of Determining Alterations in a Gene 
Corresponding to a Polynucleotide 

RNA isolated from entire families or individual patients presenting with a 
phenotype of interest (such as a disease) is be isolated. cDN A is then generated from 
15 these RNA samples using protocols known in the art. (See, Sambrook.) The cDNA is 
then used as a template for PCR, employing primers surrounding regions of interest in 

SEQ ID NO:X. Suggested PGR conditions consist of 35 cycles at 95''C for 30 

seconds; 60-120 seconds at 52-58°C; and 60-120 seconds at 70°C, using buffer 

solutions described in Sidransky, D„ et al., Science 252:706 (1991). 

20 PGR products is then sequenced using primers labeled at their 5' end with T4 

polynucleotide kinase, employing SequiTherm Polymerase. (Epicentre Technologies). 
The intron-exon borders of selected exons is also determined and genomic PGR 
products analyzed to conflrm the results. PGR products harboring suspected mutations 
is then cloned and sequenced to validate the results of the direct sequencing. 

25 PGR products is cloned into T-tailed vectors as described in Holton, T.A. and 

Graham, M.W., Nucleic Acids Research, 19:1 156 (1991) and sequenced with T7 
polymerase (United States Biochemical). Affected individuals is identified by 
mutations not present in unaffected individuals. 

Genomic rearrangements are also observed as a method of determining 

30 alterations in a gene corresponding to a polynucleotide. Genomic clones isolated 
according to Example 2 are nick-translated with digoxigenindeoxy-uridine 5'- 
triphosphate (Boehringer Manheim), and FISH performed as described in Johnson, 
Gg. et al.. Methods Cell Biol. 35:73-99 (1991). Hybridization with the labeled probe is 
carried out using a vast excess of human cot-1 DNA for specific hybridization to the 

35 corresponding genomic locus. 
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Chromosomes are counterstained with 4,6-diamino-2-phenylidole and 
propidium iodide, producing a combination of C- and R-bands. Aligned images for 
precise mapping are obtained using a triple-band filter set (Chroma Technology, 
Brattleboro, VT) in combination with a cooled charge-coupled device camera 

5 (Photometries, Tucson, AZ) and variable excitation wavelength filters. (Johnson. Cv. 
et al.. Genet. Anal. Tech. AppL, 8:75 (1991).) Image collection, analysis and 
chromosomal fractional length measurements are performed using the ISee Graphical 
Program System. (Inovision Corporation, Durham. NC.) Chromosome alterations of 
the genomic region hybridized by the probe are identified as insertions, deletions, and 

10 translocations. These alterations are used as a diagnostic marker for an associated 
disease. 

Example 22: Method of Detecting Abnormal Levels of a Polypeptide in a 
Biological Sample 

15 A polypeptide of the present invention can be detected in a biological sample, 

and if an increased or decreased level of the polypeptide is detected, this polypeptide is 
a marker for a particular phenotype. Metiiods of detection are numerous, and thus, it is 
understood that one skilled in die art can modify the following assay to fit their 
particular needs. 

20 For example, antibody-sandwich ELIS As are used to detect soluble 

polypeptides in a sample, preferably a biological sample. Wells of a microtiter plate are 
coated with specific antibodies, at a final concentration of 0.2 to 10 ug/ml. The 
antibodies are either monoclonal or polyclonal and are produced by the method 
described in Example 10. The wells arc blocked so that non-specific binding of die 

25 polypeptide to the well is reduced. 

The coated wells are then incubated for > 2 hours at RT with a sample 
containing the polypeptide. Preferably, serial dilutions of the sample should be used to 
validate results. The plates are then washed three times with deionized or distilled water 
to remove unbounded polypeptide. 

30 Next, 50 ul of specific antibody-alkaline phosphatase conjugate, at a 

concentration of 25-400 ng, is added and incubated for 2 hours at room temperature. 
The plates are again washed three times with deionized or distilled water to remove 
unbounded conjugate. 

Add 75 ul of 4'methylumbelliferyl phosphate (MUP) or p-nitrophenyl 

35 phosphate (NPP) substrate solution to each well and incubate 1 hour at room 

temperature. Measure the reaction by a microtiter plate reader. Prepare a standard 
curve, using serial dilutions of a control sample, and plot polypeptide concentration on 
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the X-axis (log scale) and fluorescence or absorbance of the Y-axis (linear scale). 
Interpolate the concentration of the polypeptide in the sample using the standard curve. 

Example 23: F ormulating a Polypeptide 

5 The secreted polypeptide composition will be formulated and dosed in a fashion 

consistent with good medical practice, taking into account the clinical condition of the 
individual patient (especially the side effects of treatment with the secreted polypeptide 
alone), the site of delivery, the method of administration, the scheduling of 
administration, and other factors known to practitioners. The "effective amount" for 

10 purposes herein is thus determined by such considerations. 

As a general proposition, the total pharmaceutically effective amount of secreted 
polypeptide administered parenterally per dose will be in the range of about 1 |ig/kg/day 
to 10 mg/kg/day of patient body weight, although, as noted above, this will be subject 
to therapeutic discretion. More preferably, this dose is at least 0.01 mg/kg/day, and 

1 S most preferably for humans between about 0.01 and 1 mg/kg/day for the hormone. If 
given continuously, the secreted polypeptide is typically administered at a dose rate of 
about 1 p,g/kg/hour to about SO )ig/kg/hour, either by 1-4 injections per day or by 
continuous subcutaneous infusions, for example, using a mini-pump. An intravenous 
bag solution may also be employed. The length of treatment needed to observe changes 

20 and the interval following treatment for responses to occur appears to vary depending 
on the desired effect. 

Pharmaceutical compositions containing the secreted protein of the invention are 
administered orally, rectally, parenterally, intracistemally, intravaginally, 
intraperitoneally, topically (as by powders, ointments, gels, drops or transdermal 

25 patch), bucally, or as an oral or nasal spray. "Pharmaceutically acceptable carrier" refers 
to a non-toxic solid, semisolid or liquid filler, diluent, encapsulating material or 
formulation auxiliary of any type. The term "parenteral" as used herein refers to modes 
of administration which include intravenous, intramuscular, intraperitoneal, intrastemal, 
subcutaneous and intraarticular injection and infusion. 

30 The secreted polypeptide is also suitably administered by sustained-release 

systems. Suitable examples of sustained-release compositions include semi-permeable 
polymer matrices in the form of shaped articles, e.g., films, or mirocapsules. 
Sustained-release matrices include polylactides (U.S. Pat. No. 3,773,919, EP 58,481), 
copolymers of L-glutamic acid and gamma-ethyl-L-glutamate (Sidman, U. et al., 

35 Biopolymers 22:547-556 ( 1 983)). poly (2- hydroxyethyl methacrylate) (R. Langer et 
al., J. Biomed. Mater. Res. 15:167-277 (1981), and R. Langer, Chem. Tech. 12:98- 
105 (1982)), ethylene vinyl acetate (R. Langer et al.) or poly-D- (-)-3-hydroxybutyric 
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acid (EP 133,988). Sustained-release compositions also include liposomally entrapped 
polypeptides. Liposomes containing the secreted polypeptide are prepared by methods 
known per se: DE 3,218,121; Epstein et al., Proc. NatL Acad. Sci. USA 82:3688-3692 
(1985); Hwang et al., Proc, NaU. Acad. Sci. USA 77:4030-4034 (1980); EP 52,322; 

5 EP 36,676; EP 88,046; EP 143,949; EP 142,641; Japanese Pat. Appl. 83-118008; 
U.S. Pat. Nos. 4,485,045 and 4,544,545; and EP 102,324. Ordinarily, the liposomes 
are of the small (about 200-800 Angstroms) unilamellar type in which the lipid content 
is greater than about 30 mol. percent cholesterol, the selected proportion being adjusted 
for the optimal secreted polypeptide therapy. 

10 For parenteral administration, in one embodiment, the secreted polypeptide is 

formulated generally by mixing it at the desired degree of purity, in a unit dosage 
injectable form (solution, suspension, or emulsion), with a pharmaceutically acceptable 
carrier, i.e., one that is non-toxic to recipients at the dosages and concentrations 
employed and is compatible with other ingredients of the formulation. For example, the 

15 formulation preferably does not include oxidizing agents and other compounds that are 
known to be deleterious to polypeptides. 

Generally, the formulations are prepared by contacting the polypeptide 
uniformly and intimately with liquid carriers or finely divided solid carriers or both. 
Then^ if necessary, the product is shaped into tiie desired formulation. Preferably Uie 

20 carrier is a parenteral carrier, more preferably a solution that is isotonic with the blood 
of the recipient. Examples of such carrier vehicles include water, saline. Ringer's 
solution, and dextrose solution. Non-aqueous vehicles such as fixed oils and ethyl 
oleate are also useful herein, as well as liposomes. 

The carrier suitably contains minor amounts of additives such as substances that 

25 enhance isotonicity and chemical stability. Such materials are non-toxic to recipients at 
the dosages and concentrations employed, and include buffers such as phosphate, 
citrate, succinate, acetic acid, and other organic acids or their salts; antioxidants such as 
ascorbic acid; low molecular weight (less tiian about ten residues) polypeptides, e.g., 
polyarginine or tripeptides; proteins, such as serum albumin, gelatin, or 

30 inmiunoglobulins; hydrophilic polymers such as polyvinylpyrrolidone; amino acids, 
such as glycine, glutamic acid, aspartic acid, or arginine; monosaccharides, 
disaccharides, and other carbohydrates including cellulose or its derivatives, glucose, 
manose, or dextrins; chelating agents such as EDTA; sugar alcohols such as mannitol or 
sorbitol; counterions such as sodium; and/or nonionic surfactants such as polysorbates, 

35 poloxamers, or PEG. 

The secreted polypeptide is typically formulated in such vehicles at a 
concentration of about 0.1 mg/ml to 100 mg/ml, preferably 1-10 mg/ml, at a pH of 



wo 98/39448 



237 



PCT/US98/04493 



about 3 to 8. It will be understood that the use of certain of the foregoing excipients, 
carriers, or stabilizers will result in the formation of polypeptide salts. 

Any polypeptide to be used for therapeutic administration can be sterUe. 
Sterility is readily accomplished by filtration through sterile filtration membranes (e.g., 
S 0.2 micron membranes). Therapeutic polypeptide compositions generally are placed 
into a container having a sterile access port, for example, an intravenous solution bag or 
vial having a stopper pierceable by a hypodermic injection needle. 

Polypeptides ordinarily will be stored in unit or multi-dose containers, for 
example, sealed ampoules or vials, as an aqueous solution or as a lyophilized 

10 formulation for reconstitution. As an example of a lyophilized formulation, 10-ml vials 
are filled with 5 ml of sterile-filtered 1% (w/v) aqueous polypeptide solution, and the 
resulting mixture is lyophihzed. The infusion solution is prepared by reconstituting the 
lyophilized polypeptide using bacteriostatic Water-for-Injection. 

The invention also provides a pharmaceutical pack or kit comprising one or 

IS more containers filled with one or more of the ingredients of the pharmaceudcal 

compositions of the invention. Associated with such container(s) can be a notice in the 
form prescribed by a governmental agency regulating the manufacture, use or sale of 
pharmaceuticals or biological products, which notice reflects approval by the agency of 
manufacture, use or sale for human administration. In addition, the polypeptides of the 

20 present invention may be employed in conjunction with other therapeutic compounds. 

Example 24: Method of Treating Decreased Levels of the Polypeptide 
It will be appreciated that conditions caused by a decrease in the standard or 
normal expression level of a secreted protein in an individual can be treated by 

25 administering the polypeptide of the present invention, preferably in the secreted form. 
Thus, the invention also provides a method of treatment of an individual in need of an 
increased level of the polypeptide comprising administering to such an individual a 
pharmaceutical composition comprising an amount of the polypeptide to increase the 
activity level of the polypeptide in such an individual. 

30 For example, a patient with decreased levels of a polypeptide receives a daily 

dose 0.1-100 ug/kg of the polypeptide for six consecutive days. Preferably, the 
polypeptide is in the secreted form. The exact details of the dosing scheme, based on 
administration and formulation, are provided in Example 23. 
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Example 25: Method of Treating Increased Levels of the Polypeptide 

Antisense technology is used to inhibit production of a polypeptide of the 
present invention. This technology is one example of a method of decreasing levels of 
a polypeptide, preferably a secreted form, due to a variety of etiologies, such as cancer. 
5 For example, a patient diagnosed with abnormally increased levels of a 

polypeptide is administered intravenously antisense polynucleotides at O.S, 1.0, 1.5, 
2,0 and 3.0 mg/kg day for 21 days. This treatment is repeated after a 7-day rest period 
if the treatment was well tolerated. The formulation of the antisense polynucleotide is 
provided in Example 23. 

10 

Example 26; Method of Treatment Using Gene Therapy 

One method of gene therapy transplants fibroblasts, which are capable of 
expressing a polypeptide, onto a patient. Generally, fibroblasts are obtained from a 
subject by skin biopsy. The resulting tissue is placed in tissue-culture medium and 
15 separated into small pieces. Small chunks of the tissue are placed on a wet surface of a 
tissue culture flask, approximately ten pieces are placed in each flask. The flask is 
turned upside down, closed tight and left at room temperature over night. After 24 
hours at room temperature, the flask is inverted and the chunks of tissue remain fixed to 
the bottom of the flask and fresh media (e.g.. Ham's F12 media, with 10% FBS. 

20 penicillin and streptomycin, is added. The flasks are then incubated at 3TC for 

approximately one week. 

At this time, fresh media is added and subsequently changed every several days. 
After an additional two weeks in culture, a monolayer of fibroblasts emerge. The 
monolayer is trypsinized and scaled into larger flasks. 

25 pMV-7 (Kirschmeier, P.T. et al., DNA. 7:219-25 (1988)), flanked by the long 

terminal repeats of the Moloney murine sarcoma virus, is digested with EcoRI and 
Hindni and subsequently treated with calf intestinal phosphatase. The linear vector is 
fractionated on agarose gel and purified, using glass beads. 

The cDNA encoding a polypeptide of the present invention can be amplified 

30 using PCR primers which correspond to the 5* and 3' end sequences respectively as set 
forth in Example 1. Preferably, the 5* primer contains an EcoRI site and the 3' primer 
includes a Hindlll site. Equal quantities of the Moloney murine sarcoma virus linear 
backbone and the amplified EcoRI and Hindlll fragment are added together, in the 
presence of T4 DNA ligase. The resulting mixture is maintained under conditions 

35 appropriate for ligation of the two fragments. The ligation mixture is then used to 
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transform bacteria HB 101, which are then plated onto agar containing kanamycin for 
the purpose of confirming that the vector has the gene of interest properly inserted. 

The amphotropic pA317 or GP+aml2 packaging cells are grown in tissue 
culture to confluent density in Dulbecco's Modified Eagles Medium (DMEM) with 10% 
5 calf serum (CS), penicillin and streptomycin. The MS V vector containing the gene is 
then added to the media and the packaging cells transduced with the vector. The 
packaging cells now produce infectious viral particles containing the gene (the 
packaging cells are now referred to as producer cells). 

Fresh media is added to the transduced producer cells, and subsequently, the 

10 media is harvested from a 10 cm plate of confluent producer cells. The spent media, 
containing the infectious viral particles, is filtered through a millipore filter to remove 
detached producer cells and this media is then used to infect fibroblast cells. Media is 
removed from a sub-confluent plate of fibroblasts and quickly replaced with the media 
from the producer cells. This media is removed and replaced with fresh media. If the 

IS titer of virus is high, then virtually all fibroblasts will be infected and no selection is 
required. If the titer is very low, then it is necessary to use a retroviral vector that has a 
selectable marker, such as neo or his. Once the fibroblasts have been efficiently 
infected, the fibroblasts are analyzed to determine whether protein is being produced. 
The engineered fibroblasts are then transplanted onto the host, either alone or 

20 after having been grown to confluence on cytodex 3 microcarrier beads. 

It will be clear that the invention may be practiced otherwise than as particularly 
described in the foregoing description and examples. Numerous modifications and 
variations of the present invention are possible in light of the above teachings and, 
therefore, are within the scope of the appended claims. 

25 The entire disclosure of each document cited (including patents, patent 

applications, journal articles, abstracts, laboratory manuals, books, or other 
disclosures) in the Background of the Invention, Detailed Description, and Examples is 
hereby incorporated herein by reference. 
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(1) GENERAL INFORMATION: 

(i) APPLICANT: Human Genome Sciences, Inc. et al. 

(ii) TITLE OF INVENTION: 186 Human Secreted Proteins 

(iii) NUMBER OF SEQUENCES: 644 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Human Genome Sciences, Inc. 
15 (B) STREET: 9410 Key West Avenue 

(C) CITY: Rockville 

(D) STATE: Maryland 

20 

(E) COUNTRY: USA 

(F) ZIP: 20850 

25 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette, 3.50 inch, 1.4Mb storage 

30 

(B) COMPUTER: HP Vectra 486/33 

(C) OPERATING SYSTEM: MSDOS version 6.2 
35 (D) SOFTWARE: ASCII Text 



40 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: March 6, 1998 
45 (C) CLASSIFICATION: 



50 



55 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 
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15 



20 



30 



(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: A. Anders Brookes, Esq. 

(B) REGISTRATION NUMBER: 36,373 

<C) REFERENCE/DOCKET NUMBER: PS002.PCT 

(vi) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (301) 309-8504 

(B) TELEFAX: (301) 309-8439 

(2) JNFOBMATim FOR SEQ ID NO: 1: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LEt«3TH: 733 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

GGGATCCGGA GCCCAAATCT TCTGACAAAA CTCACACATG CCCACCGTGC CCAQCACCTG 60 

AATTCGAGGG TGCACCGTCA GTCTTCCTCT TCCCCCCAAA ACCCAAGGAC ACCCTCATGA 120 

35 TCTCCCGGAC TCCTGAGGTC ACATGCGTGG TGGTGGACGT AAGCCACGAA GACCCTGAGG 180 

TCAAGTTCAA CTGGTACGTG GACGGCGTGG AGGTGCATAA TQCCAAGACA AAGCCGCGGG 240 

AGGAGCAGTA CAACAGCACG TACCGTGTGG TCAGOGTCCT CACCGTCCTG CACCAGGACT 300 

40 

GGCTGAATGG CAAGGAGTAC AAGTGCAAGG TCTCCAACAA AGCCCTCCCA ACCCCCATOG 360 

AGAAAACCAT CTCCAAAGCC AAAGGGCAGC CCCGAGAACC ACAGGTGTAC ACCCTGCCCC 420 

45 CATCCCGGGA TGAGCTGACC AAGAACCAGG TCAGCCTGAC CTGCCTGGTC AAAGGCTTCT 480 

ATCCAAGCGA CATCGCCGTG GAGTGGGAGA GCAATGGGCA GCCGGAGAAC AACTACAAGA 540 

CCACGCCTCC CGTGCTGGAC TCCGACQGCT CCTTCTTCCT CTACAGCAAG CTCACCX3TGG 600 

50 

ACAAGAGCAG GTGGCAGCAG GGGAACGTCT TCTCATGCTC CGTGATGCAT GAGGCTCTGC 660 

ACAACCACTA CACQCAGAAG AGCCTCTCCC TGTCTCCGGG TAAATGAGTG CGAOGGCCGC 720 

55 GACTOTAJSAG GAT 733 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SE(^JENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Trp Ser Xaa Trp Ser 
1 5 



15 

(2) INFORMATI<»« FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 86 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

25 (xi> SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GCGCCTCGAG ATTTCCCCGA AATCTAGATT TCCCCGAAAT GATTTCCCOG AAATGATTTC 
CCCGAAATAT CTGCCATCTC AATTAG 

30 



35 



45 



(2> INFORMATION FOR SEQ ID NO: 4: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
40 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GCGGCAAGCT TTITGCAAAG CCTAGGC 27 



50 



60 



(2) INFORMATION FOR SEQ ID NO: 5: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 271 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
55 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
CTCGAGATTT CCCCGAAATC TAGATTTCCC CGAAATGATT TCCCCGAAAT GATTTCCCCG 60 
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AAATATCTGC CATCTCAATT AGTCAGCAAC CATAGTCCCG CCCCTAACTC CX3CCCATCCC X20 
GCCCCTAACT CCGCCCMTT CCGCXXATTC TCCGCCCCAT GGCTGACTAA TTTTTTTTAT 180 
5 TTATGCAGAG GCCGAGQCCG CTTCGGCCTC TGAGCTATTC CAGAAGTAGT GAGGAGGCTT 240 
TTTTGGAGGC CTAGQCrTTT QCAAAAAGCT T 271 



10 



50 



55 



(2) INFORMATION FOR SEQ ID NO: 6; 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

20 (xi) SEC3UENCE DESCRIPTION: SEQ ID NO: 6: 

GOGCTCGAGG GATGACAGCG ATAGAACCCC GG 32 

25 

(2) INFORMATION FOR SEQ ZD NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 
30 <A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

GCGAAGCTTC GCGACTCCCC GGATCCGCCT C 31 

40 



(2). INFORMATION FOR SEQ ID NO: 8: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 
iC) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GGGGACTTTC CC 12 



(2) INFORMATION FOR SEQ ID NO: 9: 
60 ti) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 73 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEI3NESS : double 

(D) TOPOLOGY: linear 

5 

(xi) SEQUENCE I^CRIPTION: SEQ ID NO: 9: 
GCX3GCCTCGA CSGGGACTTTC CCGQGGACTT TCCGGGGACT TTCCQGGACT TTCCATCCTG 
10 CCATCTCAAT TAG 



15 (2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 256 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS: double 

(D) TOPOIiOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

25 CTCGftQGGGA CTTTCCCGGG GACTTTCCGG GGACTTTCCG GGACTTTCCA TCTQCCATCT 60 

CAATTAGTCA GCAACCATAG TCCCGCCCCT AACTCCGCCC ATCCCGCCCC TAACTCCGCC 120 

CAGTTCCGCC CATTCTCCGC CCCATGGCTG ACTAATTTTT TTTATTTATG CAGAGGCCQA 180 

GGCCGCCTCG GCCTCTGAGC TATTCCAGAA GTAGTGAGGA GGCTTTTTTG GAGGCCTAGG 240 
CTTTTGCAAA AAGCTT 
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(2) INFORMATION FOR SEQ ID NO: 11: 



256 



40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 582 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GGCACGAGGT AATTTCTACC AGAAATTTCC AGAGCATTAT GTAQGTAGAA AAAAATGCAA 60 
50 GCAAGCTOTT AAAGATCTTG GATCCCATTA TATAGTATGT ATAGCTGAAA TCTCTAATTC 120 
AATCACTTTT TCTCTTTTAT CCTTCTAACCA AAAAATTGTT TAATTTTGCA TCCCAAATGT 180 
TTTTAATCTT TGTATATTTT TTAAAAATCC TTTTCTCCTC ATCATTGCCT TTTTTGTGGT 240 
TGTAAATAGA CTTACTTGCA CTTTGAAGAT GAGTTACTCC TTGTCATCTT ACAAATATGT 300 
GATATGOTAA TTTTCATAAC AGATGTCAGT TTTGAACCAA GAATTQGTGA TTTGTTTATA 360 
60 AGAAAAAAAC TGGCTTCATT TCTGTGAAAT TGCTCTTTGA AAATTTCTTT TTACACGTGT 420 



245 



AAGCXMCTG AGATACCGTG ATGC3TGTTGA TTTCTTTCAA TGATGCTTAC CATCTATTTT 
AGCCflCTGAG CCTTTTATTA TTTGTCTATT TGTAAAGTTT ATTTGTCTTA ACTCATTTAA 
TAAATATACT GTTTATCTGT TTCTGAAAAA AAAAAAAAAA AA 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 465 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GTTTGGGGGT GAGGCCGAGC TGCTQCGGQG CTTCGTCGCC GGCCAGGACA CAGCTACTCG 
CACGGCGGCG GCGCCTGGCT ATGATGTTCC TCACCCAGGG CGGGCCTCTG CCCTCTACTC 
GTQCCAGGCC CACTTGCCAG GCAGGAGCCC TCCCCAAGCC TTCAGGGCTG CTCGC3AGTCA 
CCTGTTGGAA TGGACTAAAA GGACCCTTGT GTGOGAACAG GTGCTCCCCA AACACCCTOC 
TGCTGGCTGC CAGGCAGGCC CTCTGGAAGG GAAQGQGCAG GACTCATCAG GACCTCCCTG 
GACCCCTGCA GGQCAGGCAG CTTGGGCCCG AQCCCAAGCA TTTGGCTCTG CTGCCCCCAA 
GGGGACAQGA AQCCTCTTGG GCCTCTTCCC TTCCTGGACA AGGCCCCCTG CCTTTGCCTC 
ACATAAACTG TACAGTATTT TCATTAAAAG OCTCTTTCAT AAAAA 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 474 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
ATGCAA1TCC TGCTCACAGC CTTTCTGTTG GTGCCACTTC TQGCTCTTTG TGATGTCCCC 
ATATCCCTAG GCTTCTCCCC CTCCTAGAAG GGCTTCTTGA TAGATTAGAA AATAAGAATG 
AGTGACATTT CCTATGTGCA TATAAGAAGG AGCCACAAGA CATGTCTTTT AAATAAAAQG 
ACAGTGTCCA TCCTTTTAGC TGCCGAATAG AACCTTQGTC TCATCCTCCT GGAGCTAGGC 
CTTTAAAACA GCTTCTGTGT TTCTCATTTG TCTCAGTGTT TTGCCAGGGT TTTATCGGAA 
AGATAATGTT COGTTTAAAA TATTTCCTAA TGAGGCCGGG CGTGGTGGCT CACGCCTGTA 
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ACCCTAOCAM TTCQQGGCK5 AGCGC5GTGC3A TCACGAGGTC ASGAGATCGA GACCATCCTG 420 
GSTAACATGG TGAAACCCCG TCTCTACTAA AAATACAAAA AAAAAAAAAA AAAA 474 



10 



(2) INFORMATION FOR SEQ ID NO: 14: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 314 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
15 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

TTATCTTGCG GAGCAAGACC TGATAGCCAG CCTTTACATG GGAGTATAAT TCTGTCCTCC 60 

20 

ATCTCATAAG CCCCAOTACC TGAGCCAGAA TGATTATAAC CAACCACACT GTCTCTTTAT 120 
CATOGATGGC TTTAGCAGTA GGTTATTTTC ATCATTGCCA TTTGTAQCIC TACAGTGGTT 180 
25 TATACTAATT TCTCATCTTT TAAGTCnCTC CCTCAGTGCC TGTrGTTATC AAACTCATTG 240 
CTCTCTCANG CAOTTCAGCT CTGCATTCTC CCYTATGGGG GAGaGCTCSPG TTGGAGAGAG 
AGAATATNAC TTCC 
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<2) INFORMATION FOR SEQ ID NO: 15: 



300 
314 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 613 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
40 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

CTCATATTGC CGrCTGGCTA AAAGTGAACA TGCCATTGAT CAATCTGCTT TTATTATATT 60 

45 

ATCTTCCTAA TGGTOSCAAG CAAGACAAGA AGTAGAAAGA AAGATGGTGT AAGCTCAAGA 120 

ACCCACTAAA TCTATCCTAT GGCCTCQGTT CACCCAGCCT GCTTTGTGGA TITrGTCTCA 180 

50 CTATAACAGA GCTCCCAAGG AGACTGCAGA GTCAGCTCCC TTAAGCACTG TAACTAAAGC 240 

CTAACTCTTC CGTTCCACCC AACAATGTYC CCAGCTCATC CTCTTTCCCR AAGTCCCCTT 300 

TCTOCCCCAG ATOCGAATTC CATTTAACTA ATCCTCAAGT GAAATCTCCA CACAGRATTC 360 

CATTTTAATT AGCATACCAT AGTTTTTGTG CAAATTTGCT TTCAGARGAC TCCCATTGCA 420 

GCTGCTCAGA GACGCTAAWG GCAGGGCCTC TTGAWGCTTT CCCGATAGCT TTCAGCTGCA 480 

60 ATAGCTCTTA GGCAGAATCC CATGAGCGTC CTGCCCAACT GTATTACrGG GGAACACCTG 540 
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ATTGGCTAGA AGTTGATCCT CCTGTAACTT TTCTGAGTTC TTTACATTTA CTCGTGAAAC 600 
CCAAATATGC CAC 613 

(2) INFORMATION FOR SEQ ID NO: 16: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 356 base pairs 
{B) TYPE: nucleic acid 
(C) STRANDEDNESS: double 
15 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

CCCCCCCCAT TGAACCCTGG GCTGTC3AAAG TTTTTGCCTG TGTGGGTCGT TCTGTGTGGC 60 

GCCTGGTGTG TGGKTCCCAA CTCCTGTTGC AAAGTGGCAG CAGCCAATCA TGAAGCGCCC 120 

TTATTTTTAG TTGCAGATGA CCAGGTCTCC CCCCCACAGC CTCTGTCTGG TCCCTCATTG 180 

25 GTGAGTGGTC TGCCTGCCCA AGGAGCCTGA TTGGTQGGAA ATGGCATCAT CTAATATC3AT 240 

GQGAAGQCAT TTGGTCCTGG TTATGTTTAT TACAACATCA TTGCACTCTG GGACTCCAGT 300 

CCCTGAAAAC C3TAATTTGTG GTGTTACCAA AGGACCACAG GOGAAAAAAA AAAAAA 356 

30 



(2) INFORMATION FOR SEQ ID NO: 17: 



(i) SEQtJENCE CHARACTERISTICS: 

(A) LENGTH: 414 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
40 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

GAAACTANAT CCCGGGGCTT TTAACNGGTA CTTQGGAAAT AAGTATTGGG TAATCACTAA 60 

GNGGACATTG ACTQCACCAA ACCAAAGCTA TAGAAAGAAA TGATTGACTT TTTAAAATAT 120 

ATTCACATTA ACTGTCCTAG GATACTTCTC TTGAGGCTTT QGAAAACTTC TTCCTTGAAA 180 

50 TTTCCATATC CACTCCAGTT CTGTCACCAA AGATTTTAAT CTTCAGATCG CAATTTCCTC 240 

TCTCCCAGAA AAAAGTACTA CAACAGGCTC AAGGGATATG CTTTGGTGGT CAAGGGATTA 300 

CACTATGGTT TTCCTTCTGT TCACAATGGT ATTTACAGGA GACCTTGTCA TCAGAGGACG 360 

TACTGAACTA TCTTTATGAC TTTGGATTTG ATCAGAGGTT TAAAAAAAAA AAAA 414 



60 
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(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 469 base pairs 

(B) TYPE: nucleic acid 

(C) ST^IANDEDTSESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

AATCACCATT GCAATACAAA TGATCTGCCT GGTGAATGYT GAGCTGTACC CCACATTCGT 60 

CAGGAACYTC CSGAGTCATGG TGTGTTCCTC CCTGTGTGAC ATAGGTGGGA TAATCACCCC 120 

15 CTTCATAGTC TTCAGGCTGA GGGAGGTCTG GCAAGCCTTG CCCCTCATTT TGTTTGCGGT 180 

GTKXSCSCCTC CITCCCGCOG GftGTGACGCT ACTTCTTCCA GAGACCAAOG QQGTCGCTTT 240 

GCCAGAGACC ATCAAGGACG CCGAGAACCT TQQGAGAAAA GCAAAGCCCA AAGAAAACAC 300 

GATTTACCTT AAGGTCCAAA CCTCAGAACC CTCGGGCACC TGAGAGAGAT GTTTTGCGQC 360 

GATOTCGTGT TGGAGGGATG AAGATGGAGT TATCCTCTGC AGAAATTCCT AGACGCCTTC 420 

25 ACTTCTCTGT ATTCTTCCTC ATACTTGCCT ACCCCCAAAT TAATATCAG 469 



30 (2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 550 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

40 CCCCCCCCCC CCCCCACACT TTCAGGAGTC ACCCCCCAGC ATTTGOQGTT GGGTTQGCCC 60 

TACTCCAGCC TO3AGCTCCC TGAGGGAGCC TGCACTCCCT GCTCCCAATC CCCGCTACTG 120 

GTCCAGGGAT GCAGCCTCGA GCTGGCGTCC TTGTTCTGGG CCTGCTGCTG CCGCCACCCC 180 

AGAGCCCCAG CCTOTCCTGA ATTGACATCA GTGCTTCCCT GAACTGCCTC CCCCACCCCT 240 

GGGCATTATC CCAGGAAACT TTATGTTTTC TAGAAGCTAA GCAGCTGCTG GGACTCAGGG 300 

50 ACTGGTOCAG GTAGGCTC^G TGGCAGCTCA GTCCTAGAAG GTCTCTGAAG ATCTGGACTG 360 

AQGACCTTGC TACTCCCCAA GCCAGAGCCC ATCAGCCAGG CCTGCTGTGA GCCACCTGCC 420 

TCTGGAGTGC TGAGCTCAAC CAAAGGCTGG CAAGCTCTGG GCCTCATTTA AQGGATTCTG 480 

A1GAGCCGAT GGGCCCTGGA GGCAGCCCAT TAAAGCATCT OGCTCGTTTT TGGAAAAAAA 540 
AAAAAAAAAA 
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(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQIGTH: 741 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQOQJCE DESCRIPTION: SEQ ID NO: 20: 
TCTTGAAGAG TGTACAGTAC AGGATTATTA TAATCSAAAGT TTATATCAAC AGGGTTTCGT 
TGGCTCTGCA TATATTATAA GCAAAAGAGA TTQGTAAAGT GCCACAGTAT TCCAGATAAC 
TTTTCAGTTG CGGCCTTTCT TCTCGTTCTT TAATTTGAAA CCTAGATACA TC3CAGTAAAA 
ACTAGGAGAA TGACTTTTAC CCTTGGGGAC AGCCAAGTTT TGTTGATAAA CCTATTTCCT 
AGCATGCCTT CAGGAAGTTG TGCCAGACCC TAGATTGTGA AGGACCCACT GTTCTTCTGT 
TGTAOGAGCT CCCTGAACCA TTGTTCAGAG GACCAATGTC ACATCGCTTC ATQGGCATO6 
NCCATC3GGAG CATCTGGGTG ATAYCTGTCT ACAGTATTQG CTCTTCTCSCG AGGCTGATAC 
ACAAGGCCTC TCTTCCACAT GATCATTTGC AAAOCTCCCC CAGCCCCTAC CATCCAATGT 
GGAAQGAAAA CAAGAACTGC CTGAAGAAGA GTCCAAGCTA CAGATACACA GCGTGTGCAT 
TGCGQCTGTC ACCTTCCTCC TCCCACTTCT GTATCCTCAG AGATGCTGCG TGGATGTTTC 
CTTAACCTCA GCTGACTTCC CTGTGAATGT CTAATGCTAG TTCAGGGCCT CCAGGCATTG 
ATTTGTACAG TQQTAACTCC CAATGAGGCT TCTGTTATCA TTTGGTGTGC TTTYTCTGTC 
ATTAAAAGAA AT6ATTTTCC C 

(2) INFORMATION FCR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 991 base pairs 

(B) TYPE: nucleic acid 

(C) STR/^EDNESS: do\ible 

(D) TOPOLOGY: linear 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GGCACGAGTC TCCCCTGGGG AAGTTTTTCT TTTTCAGGAG GGAGGAGGGC TTTCCCAGGT 
AATGTGTCTA GAGTGTTGOG CAGAAAATCT GGGACCACAC CACACCAGTT CTCTCCTTAA 
TCCACGTCAT TTGCCTTCTA TCCCAGCTAT GTTTCCAGTG TCCTCTGGGT GTTTCCAAGA 
GCAACAAGAA ATGAATAAAT CTCTGGTGAG TTGTTTATTT GTTCTTCACT TTGTTTTACA 
CTGTATTTTC TGAGTTTATG GGTGTCTGTG AATTAAAAAG GAAAAGTAGA AATAAGTAAA 
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ACTCAGGTTG AAGGAAATAT ACATAAATAA GATAAAGCTG ACCTGTAGAT ATAGCAQGTT 360 

ATAAfiGCTTA GAtSTTGTCTA AGTTC3AGTGC AAATTTTCCT CTGATCTTTC TGATGCCX3AA 420 

5 

CAAAAAftGCA GrrCATGTTTG TTATGTGATT QGAATGXaAC CX:GAGAAGAG AGCATGCTGT 480 

GTrCTTOTGG GACAGGAAAG CTTGCGTGCA CXrAAGTCTGA ACCACCACCT TCATQGTGAC 540 

10 ATAGATTATG TQCTGGAACA TATTTCACAC CGGCCTGGCA GTAAACACTT GTAaTOTTGr 600 

GCAGOXXAAA CXSGICATCTT CXX3CTAAAGC AC0GCX3TGTT GT0CAGCXX3A AATOGTCATC 660 

TQCTGCTAAA ACACAGCTTC CATCGTAATG TATGCTCCTT ACTCAAAGAG TGTGGTCCCA 720 

15 

AACAGCCTTT GGGAGGTCCT CCTTGATTCA TGGATGAAAC CTQGAACATC TTGAGGACTG 780 

AGTTAACCAT AQGTCCTTAA ATAACTCTCC ACAOGTTTTT CTTAGTTTAT CTCTACATGC 840 

20 AGGGTGTGCA GCAGCCTGTT CAAAGTCATA TTTTCTGGGA AATATTTCCA GTGTTTATTT 900 

GCACTTTAGC CCACTCTGTG TAGCCTTATT. TCTTCTAAAC TCACCATTAA TCTGAATAAT 960 

AGTCAAATTT AGGGGGACTG TATTTGCCTT A 991 

25 



30 



(2) INFORMATION FOR SEQ ID NO: 22: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 653 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 
35 (D) TOPOLOGY: lixiear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

CCACGCGTCC GGAATTCCCC TGAGGATCTT GGGCTATCTT TGACAQQGGA TTCTTQCAAG 60 

40 

TTGATGCTTT CTACAAGTGA ATATAGTCAG TCCCCAAAGA TGGAGAGCTT GAGTTCTCAC 120 

AGAAITGATG AAGATQGAGA AAACftCACAG ATTGAQGATA CGGAACCCAT GTCTCCAGTT 180 

45 CTCAATTCTA AATTTGTTCC TGCTGAAAAT GATAGTATCC TGATGAATCC AGCACAGGAT 240 

GGTGAAGTAC AACTGAGTCA GAATGATGAC AAAACAAAGG GAGATGATAC AGACACCAGG 300 

GATGACATTA GTATTTTAGC CACTGGTTGC AAGGGCAGAG AAGAAACGGT AGCAGAAGAA 360 

50 

GTTTGTATTG ATCTCACTTG TGATTCGGGG AGTCAGGCAG TTCCGTCACC AGCTACTCGA 420 

TCTGAGGCAC TTTCTAGTGT GTTAGATCAG GAGGAAGCTA TGGAAATTAA AGAACACCAT 480 

55 CCAGAGGAGG GGTCTTCAGG GTCTGAGGTG GAAGAAATCC CTGAGACACC TTGTGAAAGT 540 

CAAGGAGAGG AACTCAAAGA AGAAAATATG GAGAGTGTTC CGTTGCACCT TTCTCTGACT 600 

GAAACTCAGT CCCAAGQGTT GTGTCTTCGG AQGCATCCAA AAAAAAAAAA AAA 653 

60 
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^ (2) INPORMATIC»J FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1486 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEE»JESS : double 
10 (D) TOPOIOGY: linear 

(xi) SEQUEI«:E DESCRIPTION: SEQ ID NO: 23: 

GGCAGGCPGA CGACCTQCAA GCCACAGTGG CTGCCCTGTG CGTGCTGCGA GGTQGGGGAC 60 

15 

CCTGQGCAGG AAGCTGGCTG AGCCCCAAGA CCOCOQGGGC CATOGGCQGG GATCTGGTGC 120 

TTQGCCTGGG GGCCTTGAGA CGCCGAAAGC GCTTGCTGGA GCAOGAGAAG TCTCTRGCCG 180 

20 QCTGGGCACT GGTGCTQGCA SGARCTOGCA TTGC3ACTCAT GGTGCTGCAT GCAGAGATGC 240 

TGTGGTTCGG GQGGTGCTCG GCTGTCAATG CCACTGGGCA CCTTTCAGAC ACACTTTC3GC 300 

TGATCCCCAT CACATTCCTG ACCATCGGCT ATGC3TGACGT GGTGCCGGGC ACCATGTQQG 360 

25 

GCAAGATCGT YTQCCTGTOC ACTGGAGTCA TGGGTGTCTG CTGCACAGCC CTGCTGGTGG 420 

CCGTGGTGGC CCGGAAGCTG GAGTTTAACA AGGCAGAGAA GCACGTGCAC AACTTCATGA 480 

30 TGGATATCCA GTATACCAAA GAGATGAAGG AGTCCGCTGC CCGAGTGCTA CAAGAAGCCT 540 

GGATGTTCTA CAAACATACT CGCAGGAAGG AGTCTCATGC TGCCCGCANG CATCAGCGCA 600 

ANCTGCTQGC CGCCATCAAC GCGTTCCGCC AGGTGCGGCT GAAACACCGG AAGCTCCGGG 660 

35 

AACAAGTGAA CTCCATGGTG GACATCTCCA AGATGCACAT GATCCTGTAT GACCTGCAGC 720 

AGAATCTGAG CAGCTCACAC CGGQCCCTGG AGAAACAGAT TGACACGCTG GCGGQGAAQC 780 

40 TQGATGCCCT GACTQAGCTG CTTAGCACTG CCCPGGGGCC GAGQCAGCTT CCAGAACCCA 840 

GCCAGCAGTC CAAGTAGCTG GACCCACGAG GAGGAACCAG GCTACTTTCC CCAGTACTGA 900 

GGTGGTGGAC ATCGTCTCTG CCACTCCTGA CCCAGCCCTG AACAAAGCAC CTCAAGTGCA 960 

45 

AGGACCAAAG GGGGCCCTGG CTrOGAGTGG GTTGGCTTGC TGATGGCTGC TGGAGGGGAC 1020 

GCTGGCTAAA GTGGGKAGGC CTTGGCCCAC CTGAGGCCCC AGGTQGGAAC ATGGTCACCC 1080 

50 CCACTCTGCA TACCCTCATC AAAAACACTC TCACTATGCT GCTATGGACG ACCTCCAGCT 1140 

CTCAGTTACA AGTGCAGGCG ACTGGAGGCA GGACTCCTGG GTCOCTGGGA AAGAGGGTAC 1200 

TAGGGGCCCG GATCCAGGAT TCTGGGAGGC TTCAGTTACC GCTGGCCGAG CTGAAGAACT 1260 

55 

GGGTATGAGG CTGQGGCQGG GCTGGAGGTG GCGCCCCCTG GTGGGACAAC AAAGAGGACA 1320 

CCAITTTTCC AGAGCTGCAG AGAGCACCTG GTGGGGAGGA AGAAGTGTAA CTCACCAGCC 1380 

60 TCTGCTCTTA TCTTTGTAAT AAATGTTAAA QCCAGAAAAA AATAAAAAAA AAAAAAAAAA 1440 
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AACTCGAGGG GQGCCCRKAC CCAATCWCCC TATAGTAKAC GTANNN 

(2) INFORMATION FOR SBQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQJGTH: 2323 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEENESS : doiible 

(D) TOPOIOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
CITCGCCGTT TCTCCTGCCA GQGGAGGTCC CGGCTTCOCG TGGAGGCTCC GGACCAAGCC 
CCTTCAGCTT CTCCCTCCGG ATCGATGTGC TGCCGCOGCC GCCGCCGCCG TCCCGCGTCC 
TTCGGTCTCT GCTCCOQGGA CCCGGCTCCG CGCAGCCAGC CAGCATGTCG QGGATCAAGA 
AGCAAAAGAC GGAGAACCAG CAGAAATCCA CCAATOTAGT CTATCAGGCC CACCATGTGA 
GCAGGAATAA GAGAGGQCAA GTGGTTQGAA CAAGOGGTGG GTTCCGAGGA TGTACCGTGT 
GGCTAACAGG TCTCTCTGGT GCTGQGAAAA ACAACGATAA GTTTTGCCCT GGAGGAGTAC 
TTGTCTCCCA TGCCATCCCT GTTAATTCCT GGATGGGGAC AATGTCCGTC ATGGCCTTAA 
CAGAATCCCC CAGATGGCTT CATGGCCCCC AAAGCATGGA AGGTCCTGAC AGATTATTAC 
AGGTCCCTGC AGAAGAACTA AGCCTTTGGT CCAGAGTTTC TTTCTGAAGT GCTCTTTGAT 
TACCTTTTCT ATTTTTATGA TTAGATGCTT TGTATTAAAT TGCTTCTCAA TGATGCATTT 
TAATCTTTTA TAATGAAGTA AAAGTTGTGT CTATAATTAA AAAAATATAT ATATATATAC 
ACACACACAT ATACATACAA AGTCAAACTG AAGACCAAAT CTTAGCAGGT AAAAGCAATA 
TTCTTATACA TTTCATAATA AAATTAGCTC TATGTATTTT CTACTGCAOC TGAGCAGGCA 
GGTCCCAGAT TTCTTAAGGC TTTGTTTGAC CATGTGTCTA GTTACTTGCT GAAAAGTGAA 
TATATTTTCC AGCATGTCTT GACAACCTGT ACTCTTCCAA TGTCATTTAT CAGTTGTAAA 
ATATATCAGA TGTGTCCTCT TCTGTACAAT TGACAAAAAA AAAAATTTTT TTTTCTCACT 
CTAAAAGAGG TGTGGCTCAC ATCAAGATTC TTCCT6ATAT TTTACCTCAT GCTGTACAAA 
GCCTTAATGT TGTAATCATA TCTTACGTGT TGAAGACCTG ACTGGAGAAA CAAAATGTGC 
AATAACGTGA ATTTTATCTT AGAGATCTGT GCAGCCTATT TCTGTCACAA AAGTTATATT 
GTCTAATAAG AGAAGTCTTA ATGGCCTCTG TGAATAATGT AACTCCAGTT ACACGGTGAC 
TTTTAATAGC ATACAGTGAT TTGATGAAAG GACGTCAAAC AATGTGGCGA TGTCGTQGAA 
AGTTATCTTT CCCGCTCTTT GCTGTGGTCA TTGTGTCTTG CAGAAAGGAT GGCCCTCATG 
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CAGCAGCAGC GCXIAGCTGTA ATAAAAAATA ATTCACACTA TCAGACTAGC AAGGCACTAG 1380 

AACTOGAAAA GACCACAGAA AACAAAGAAT CCAACCCTTT CATCTTACAG OTGAACAAAC 1440 

TGTGATGATG CACATOTAT6 TGTTTTGTAA GCTGTGAGCA CCX3TAACAAA ATC5TAAATTT 1500 

©XATTATTA GGAAGTCCTC GTGGCAGTCA AGAAGCACCC AGGCCACTTG ACTCCCaGTC 1560 

TQGTGCCCTG TC?rACACX:AG ACAACACAGG AGCTGGGTCA GATTCCCCTC AGCTGCTTAA 1620 

CAAAGTTCCT CX5AACAGAAA GTGCTTACAA AGCTGCCTTC TCGC3ATACTG AAAGGTCXiAG 1680 

TTTTCTGAAC TCCACTGATT 'PTATTGCAGT TGAAAAAAAA AAAAAGCTAT TCCAAAGATT 1740 

15 TCAAGCTGTT CTGAGACATC TTCTGATGGC TTTACTTCCT GAGAGGCAAT GTTTITACTT 1800 

TATGCATAAT TCATTGJTTGC CAAGGAATAA AGTGAAGAAA CAGCACCTTT TAATATATAG 1860 

GTCTCTCTGG AAGAGACCTA AATTAGAAAG AGAAAACTGT GACAATTTTC ATATTCTCAT 1920 

20 

TCTTAAAAAA CACTAATCTT AACTAACAAA AGTTCTTTTG AGAATAAGTT ACACACAATG .1980 

GCCACAGCAG TTTGTCTTTA ATAGTATAGT GCCTATACTC ATGTAATCGG TTACTCACTA 2040 

25 CTGCCTTTAA AAAAAAAAAC CAGCATATTT ATTGAAAACA TGAGACAGGA TPATAGTGCC 2100 

TTAACCX3ATA TATTTTGTGA CTTAAAAAAT ACATTTAAAA CTOCTCTTCT GCTCTACTAC 2160 

CATGCTTAGT GCAAATGATT ATTTCTATGT ACAACTGATG CITOTTCTTA TTTTAATAAA 2220 

TTTATCAGAG TGAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 2280 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAA 2323 



30 



35 



(2) INFORMATION FOR SEQ ID NO: 25: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 683 bcise pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOIiOGY: linear 

45 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
GGCACGAGCC TGTGTGGTCA TGTTCCTCGT GGTQCAGTAC CTGACATGAG CCAGCCACGC 60 
50 TCAGTQGCTG AACAGCATTC CCACAGCCTG CAAGTGTGTG TC3TGTGTGAA AGAGAGAGGG 120 
GGGCCCAGAG CCGCCTTTTG AAATGTTTGC CTGTCTGAAC TGTGAAGACA CTTGGGAGTC 180 
ATTGTQGTCT AATTTCCAAC CTGCTCTGTT TTCTGTGACA TCTTGGAGGG GAGCTAGTGC 240 

55 

CACACCATGC GCGGTGCTTA GAAATGAAAA AGTCCCGGGT CTGTCTCTCT CACTCTCGCT 300 
CTCATGGGGG AGGGAAAGAA TGGCTTTGGT GGCTTTGTTC ACACAGCTGA TGCGTGCTGG 360 
60 GAAGGTGTCC ACAGTGAGCC TGTGTGCAGG ACTGTCCACA CGGTTCACAC TTGTCACCAT 420 
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CAGGCCTTTC TGGTCCTGAT AGGGTGGAGC AAAAGTGGAA AGGAAAGGAA AGAGGCTTTT 480 

CTCACAGOCA TTATATTAAA TAGTAQGTCG ATTCACATCT CX3TQCTCCTG GCCACXTTTCC 540 

5 

CCTGTGCXrrC AGTGACATGT AGATGACTGA CTGCCAATAC TTGTCAOCAT TCCCTGGAAG 600 

CAGCTACCTA GGQGAAACAA GATGTAGTGC TATTGCOGAT AACAAGTAAG ATTTTCCACA 660 

10 CTAAAAAAAA AAAAAAAAAA AAA 683 



15 (2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2036 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEE«QESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTIC»l: SEQ ID NO: 26: 

25 CTGAGAAAGG AAAQCATTCG GATCTGCTC3C AAAAACACAT ATATCCATAA AGACTCATGT 60 

TATTCAGAAA ACAGATTGTG AACACAATCA CATTCGCATG AATCdTTAA AAGGAAQAAG 120 

ACCTTAAAGT ATCTGCAAAT CTGAATTTCT ATTTATTCCT TCACTGAATA TAGAAACAAT 180 

30 

GGTTATCTGA TTATTAGAGA TATTATTTTG GATATGTTAC TTATTAACTT GCTATGGCTG 240 

GTAACCATGA TAAAGTCTGT TATTAATAAC AACATAATTC TTTTTTTAAA GAAGAAAAGC 300 

35 TTATTTTTCA TTGACAGTGT ATAGATTTAT CTACTTAGTT GTGTTTTGCT ATTAGTGTTT 360 

TAATTTTTTT TTTAAC3TTGA GTGTTTGATA AATTTTAAGA CCCTGTCCCC ACCTTGTTTT 420 

G7VGTCCTGTG TTGACTACAG GTATATAGCY CWWTTTAAAA ATCCTAAAGC AAAAGAATTT 480 

40 

TATTTATAAA AGAATCMAMC MGTTGCATGC ATGAGGCTGT GAAGTCAGAT ATTTAGTAAT 540 

AAAAGCAGCA GTGCCTTTTT TTGTATTTAC CCATTGACCC CCACCAAATG CAACTGTTTT 600 

45 ATATTAAGAA AATAGTAACA ATTTTAAAAT CTCAGAGTAA AATCTATTTC ACTACATGCT 660 

TTTCCCCCCT TGTTCTGATT TAAGCAGTGT GTACTTGGCA TCTCTACATT GTCCTAQGGA 720 

CAGTGGTGTT CTACAATATT ATCATGTATG ATGTTTTATT GGTGCTTTTT ATTCATAGTG 780 

50 

GCTTCTTACC AGAAACAGTA GGAAGAAACA CATGAACTGT GTACAAGACA TGAAACATTG 840 
CTGCTGATAT G T r &TTTTTT CACATGCTTT TGAGTTTTCA CTTTTTAAAC GAGAGCCAGC 900 
55 AAGCAAAATA GATGTGGCTG QGTCTGCCTG TCCGGGCGGC TYTTTGCACC GAGCTCTCAA 960 

ATCCTGTGTA TTGAGGGTTC CTTTTTGGTA CTCAGGATTG GAGCTACAGC TGGGCCCCCC 1020 

TCTCTCCCAT TCGTTTGAAG AGACACTGAG GGAAACAAGG GTTTCTTTTG AGGTGTCCTT 1080 

60 
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GGCTGCCTTT TACGGGATGG GAGCCTTCTC COSATCTTTT GTTCTTCTGC ACCTCTTGTA 
GCTACTQCCG GTGCAAGGTT GTAGATGTTA TTCCCCADGA GCXTPGOGCTK GGGGGCTGAG 
CTGGGCTGAA TGCAAAAGCA TGCAACCAGA AOGOOOQCAA GGGGAGGAAA AGCAOGCCT6 
GOCTCATTQG TCCCCTOGAG ATC3TCTOTAG CAGTCAGCTC CAGCTTGGGC CTGC3GGAAGC 
AGCCTGACCA AOGCGCTCAG GTGTGCCTGT TACAAGAAGA ACCTOCAGAA OGATAATTTG 
CACATGC3AGC TGTGATAACA CTAATGTTGA ' IViTrnTiT TTTTACAAGT CATCAGRGAT 
GTTTGCAAAG TGAGTPITAT TTTTTTGTAA TTCCTTTATC TTTACTTAAA GGTGAATGTG 
TATTCCTCTG GGAGGAATAG GAAC5AAAACA GGAATGTTAA TAATGTCGAA CAGAAAACTT 
CCTCCCTTAT TAATATATAA TCYTCATGTA TTTATGCCWr AATGTAAGCT GACTTTTAAA 
AAGCTTTCTT TTGTTGCATG CCXTTOTGCAG GCATCTGTAT TGTACATGCA TGCCTTTCGT 
CCTGTTTTCC TGTATAAAGT TAGTGAACAA AGAAATATTT TTGCCCTAGT TCATGTTGCC 
AAGCAATGCA TATTTTTTAA ATTTGTCATA TATQGAAAGA GCATGTTTGT TACATGTAAA 
AGCTTTACTG ATATACAGAT ATACTAATGT TTGAAGATGC TOTTCTTTGC AAGTOTACAG 
TTTTCAAATG TTGTTACCAG TGAAACACCC TTGTGGTTTA AACTTGCTAC AATGTATTTA 
TTATTCATTT CCTCCCATGT AACTAAGAAT CATGGCTATA TTTCATATCA ACGTTATATT 
GAAAGTGAAG GGAAATGATT AATACAA6GT TTTGTAACAA AAAAAAANAA ANNAAA 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 717 base pairs 

(B) TYPE: nucleic acid 

(C) ST^ANDEENESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
GGCACGAGAT AACATAGGCA CAATAATACT GTATGTCTAC TTCTAGGATT ATAAGGAATT 
AACATTGAGA TGACATTTCC ATTTGAGAAG AAAATAGTTG CrTTCAGTGC CTTTTATTTG 
ATTCCTGGAG AGAGCAGACT CGCACCAACA TTCAACCCCA GCGCTGATAT GACAGTAATC 
CTCAGAGGCA GAGCCCAGCA CAAAACAGCA ATGCTAGAAA GTTACAATTG GAAAGTTTCC 
TGCCAGCTTC GGGAATGACA CTGCAAAGCT GATGCCAGAA ACTGCCAGAG TAATTCTCCT 
CATTACTGCT CTACCCACCC ACTTTCAGCT CCCCAAATTA ACTAGTGCAG TTGACTAATC 
CTCTTTACCT TTATCATTTA GGTGAGGCAT TGCACAAAAA CTCTCGACTT TGCCATATAA 
GGGCTGTGGT TCTCTGTQGT CCTGGATAAG AGGCATCACC ATTATCTGGA AACATGCAGT 
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AAATGCAGAT TCTTCATCTT CTCCCCAGAC CTCCTGAGTT AGAAATTCAC AAGTTCrCCA 



540 



GGTGATCTCA TACATGCTAA AGTTTGAGAA CCATTGAGTA AAGTTAATGC ATTAAGAAGA 



600 



GATTAGATAG GGATQGTCGC GTATCTTCCT ACAGTTTCCC TGTTAACAAG AAAGTCAGAG 



.660 



GTCAGTTGAT CAGACATTAG ATTATTTATT GCTAAAACTA AAAAAAATTA AAAAAAA 



717 



10 



15 



20 



25 



30 



35 



40 



45 



50 



(2) INFORMATIC»a FXDR SEQ ID NO: 28: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 495 base pairs 

(B) TYPE: nucleic acid 

(C) STRANE3EE3NESS: double 

(D) T0P0UX3Y: linear 

(xi) SEQUENCE I^CRIPTIC»I: SEQ ID NO: 28: 

GAATTCGGCA CGAGCAGCAT CCTAATTTTA GTTTGGAGAT QCATTCTAAA GGATCTTCTC 60 

TATTGCTTTT TCTCCCACAA TTAATCTTGA TTCTGCCTGT CTGTGCACAT TTGCATGAGG 120 

AACTGAACTG TTC3TTTTCAT AGGTAAATGA GAGACTGAGT TTTTTCATTT CTCAAGAGAA 180 

AGGGCATTTG CTCCTACAAG CTC5AAAGGCA CCCCTGGGTG GCTQGGGCCC TCGTQQGAGT 240 

TTCTGGGGGA TTGACCCTTA CAACATGCAG TGGCCCTACA GAAAAACCTG CAACTAAAAA 300 

TTATTTTTTA AAAAGGCTCC TCCAGGAAAT GCATATAAGG GCTAATCACC CAGTATTTTG 360 

ARGCTTCGAA GARGTAATAR AMCCCTGGAG AGAGAAACTG AGACATGTAA GAGGGTGGGA 420 

ATGACTCAGT GGTGGCACAC TATGGAGTCC TGCCCACAAG TAGCACACAT CAACCCACTA 480 

CACAGAAATC CTAQG 495 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 556 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

AGCTTAACGT CATGATTCAT TAGGGGAATG CAAGGCAAAA CCATGATGAG AATGCCCCTA 60 

GACACCTCTT AGAAGAQCTG CTAGAAAGGC AGACAGCACC AAGCGCTTAA ATGAGATQQG 120 

GGCACTGGTG CTTCTTCTGT GCCTACTGGT AGGGGTGCAG CAGAGTGGTT CAGTCTGGGA 180 



60 



CAGTTAGCTG GACATCACGT GGACCCAACA CACGCATTTC CTGQGTTACT TACCAAGGAG 



240 
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AATAGAAAGC AGQCAGATCT TTACAGCAGC TCTTACCTGW TTGCAAAACA ATC3GAAATGC 
CXyvCATGTCC ACAAACAAGT KTGTGGTCTG CCTGTGCXIAT GAAOCACAGT GTGGCTGAGC 
GTCAAGAGTC CCCACACTCA AAGGAGGCAG CAGATACAGG GCTQCACACT GTGTGATTCC 
ACACATGTGA CATTCTGGAC ACGGACATGC TGGATGGCAA AACGAGCATC GGGCTGAGAG 
GACTGCTGAG AAGGGGAACG GGGCTGCTGG GATGTOGGTT GATTGTAGCA GTAGCTCATG 
GAGATGTGAC CTCAAA 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 434 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
CTAAATGGTG ACTGTQGCTT TGTCGAGACA QGCCCCAAAT GGTAGGTGTG AACACAACAT 
QCACAGAATG AGGAGACATG CAGAGTGCTG AAATACTGTC CTGGACAGAT GTGTTACATG 
ACTTTCTTTT CAGCTTATTT CTGTGGCCTG CCTTTGAAGA TAGAGCTTTG TTQATATTTA 
CATTAAACCA AATTGTATAA YTATGTTCCA TTCTGACATG TTATTTAGCA AARGAAAAAR 
GAGTAATTCT ACATCAGCAT CTTTAGTGCA TGCTAAAAGA TrAAAAATGT CTTTTGGGGA 
ACATGTTTTG TATACATAAA TGTTTAGATA GAAATATTTA TAGAATNCTC TATGTGAGTA 
TTNATCTCCC TATGTATATT TATATCTAGA TGTGTCAATC TTTGTATTGA TATGAAATGC 
TATGAATAGT GAGA 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 715 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
CCACGCGTCC GATCTCACAG CTCCGACACT ATTGCGAGCC ATACACAACC TGGTGTCAOG 
AAACGTACTC CCAAACTAAG CCCAAGATGC AAAGTTTGGT TCAATGGGGG TTAGACAGCT 
ATGACTATCT CCAAAATGCA CCTCCTGGAT TTTTTCCGAG ACTTQGTGTT ATTGGTTTTG 
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CTGGCCTTAT TGGACTCCTT TTGGCTAGAG GTTCAAAAAT AAAGAAGCTA GTGTATCCGC 
CTGGTTTCAT GQGATTAGCT GCCTCCCTCT ATTATCCACA ACAAaXATC GTGTTTGCCC 
AGGTCAGTOG QGAGAGATTA TATGACTGGG GTTTACGAGG ATATATAGTC ATAGAAGATT 
TGTOGAAGGA GAACTTTCAA AAGCCAQGAA ATGTCAAGAA TTCACCTGGA ACTAAGTAGA 
AAACTCCATG CTCTGCCATC TTAATCAGTT ATAGGTAAAC ATTGGAACTC CATAGAATAA 
ATCAGTATTT CTACAGAAAA ATGGCATAGA AGTCAGTATT GAATGTATTA AATTQGCTTT 
CTTCTTCAGG AAAAACTAGA CCAGACCTCT GTTATCTTCT GTGAAATCAT CCTACAAGCA 
AACTAACCTG GAATCCCTTC ACCTAGAGAT AATGTACAAG CCTTAGAACT CCTCATTCTC 
ATGTTGCTAT TTATGTACCT AATTAAAACC CAAGTTAAAA AAAAAAAAAA AAAAA 

(2) INFOPMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 486 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

(xi) SBQUEtKTE DESCRIPTION: SBQ ID NO: 32: 
GAGCCAGTGC CGGCGAAAGG GGACCTTCCT CTACTTCCTG CCACAGACCC TGTCCCCACA 
CACTTCCTGC CCCTGCTCTG CTGGGAQGCC ACTTCCTCCC CCAGTGCTQG ATTCCACCCC 
CAGCTCACCC TCAAACATGG CCCCCTCTCT CCTCCTGCTT GCCCCTCTCT GCTCCCTGGA 
GGCTGTTCTG TCCTCCCCTC TTGAAAAGCA ATGCCAGCTT CCTGGGATCT TCTGCCAACT 
CCAGCTACCA TOCCCTTTGC TCCTGTCAGC TCAGCTCCTC AAQG6AATTG TCTAMCCTCG 
GTGTCCTGCT TCCCTCCCTC AACCTCCTCA CCCTGCTCCA AGCTC3GCATC TGCCCCTCCA 
CTGCACAGAA CGGNTCCCCC ACCACCTGCC TTTACAGGGA GGAAGCAGCA ACATGGAAGA 
ANCGAACTAT AGQGGCTACA ANGATGCTCA GCTCTGATCC CGAAGGCAAA AAGNATCTTT 
GGGCAC 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 725 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DKSCR1PTIC»J: SEQ ID NO: 33: 
GITCCTCTGG TAATAATTAG GTTATTCCCA GAAGCACAGT GTCATTCTTT AAATAAAAGC 
•mCCTGTTT AAAGCTTTTC AAAGGAGCAG ACCACCTTGA AGATTCCCCX: TAGGGTTGAT 
ATOTGTCTAA TTCATTTTAT AAAAATTATT CTTGTCTTCA TTTTAAAGCT TTGGCTATAT 
AGTCAGAAAT GTCCTAAATA ACAAACTATT TTCTATTTAA TTTAGGGAAG ACTAAAC3GGA 
AGAAAAATGA AAACTCAOTC TTTATGTAAG CTCCAAQGAT ATTAGOGCTT AAAGGGCTTT 
TCTAGTTTTA TQAGAATTTG TACTACTGAT TTTTATATAT TCCTGTTTTT GATGAACAGA 
TCTCTQGGGA AATTGTTGAG TTAGAAOXSGC ATTTCACTGT GATCCCTCTC AAGCTCAGAT 
CAGTTCTATA ACCCAATGAC AACCTGTCTC TrPGGTTTAC TGTCCTGTGA AATGTCAGCT 
CAAGTTTCCC AGAAGTCGTG TGTTTATGAT GAQTCAGAGT GCTTTTCCTC GGTOGGACAS 
TTGCTGQCCC TCTTAATTTT GGTGTATGTG CTTCCAAGTA TCTAAACCTC CAGTCTGATC 
TGTATATGCT ATCCTAACTG TTAATTOTAT TATTGAITAT GTTGATTATC TTGCTTGAAG 
GTTCATACTT TTCAATTTGA TASAAATAAA GTTTrTTTCT GCTTATAAAA AAAAAAAAAA 
AAAAA 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 437 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
CACACAQCAT GCTGCCCTCA GACGTGTCCA TCCTGTACCA CATGAAAACG CTGCTQCTCC 
TGCAAGATAC TGAGAGATTG AAGCATGCTC TGGAAATGTT CCCAGAACAT TGCACGATGC 
CTCCTGCTTT TATTGGCTCT TGTCGAAATC AAATTGGAAG ATdTCAGTC CCAGCTGCAC 
CCAAOGTGGA AAAGTATTCC AGGTCCATCC CCAAGGAACC AACACCGATG ACATGGACTC 
AGGAATCTTA TAACCTACGT GGACTCTTTC CATCCGTACA TTGTCGTGCA CATGCCACTC 
ATCACCTGGC GTGCCCAGAT CCTCGCARGG CAACACCCTG TGATAATTCC AGGTGATTCT 
CTACATCTGC AGCTTGAGGT TAGCCTCATA TCACATTACA TTCTCACTAN AAACNAAAAA 
AAAAAAAAAA AACTCNA 
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(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQIcrrH: 943 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDECNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

C3GCACGAGCT GGAACAGAGA CTAAATCCCA CGAAACTGAC ATTGTTAAAC ACACTAAAAC 

AGAAGTACTT ACCTCTTGAA GATTTAATAT ATAATGGTT6 ACATGATACA T6TACATGAT 

GAATGACCAG ATGCTTATGG TCTACATTTT CCTPTATCCT GTTAGTATTA CCTTCCTTAA 

TCTTTGTTCA TTAACATGCT AATTCCTCTT CAGTGTTTAT TTTCTAGTGA CAGAATGCTA 

ACATTTCTTA CACCCTGGCA GAAGGGAGAG AAATGTGTTT TGGGGTGGGT AACTAAATTT 

1TGAGTGAAA TATCATAAGA TGANAATGGA AANAAGGAGA CACAAANAGT TATNACAAAA 

AAACAATGGT TTnTTAGCC ATTTGACTGG CTCITTAAAT AGTCTACAAfi ACATTCACGT 

TTAACATCAC TTTTAGTGAA ATAAAATGTG CCATACTAGT ATGTGCTTCA AAAGGGCAAA 

TGTQCTTTAG TQCCCTAAGG CTAAATTTTG GTCATTTGAC ATCAGAGATG TTGTAAGTAT 

TGCACTTAAT ACGCACCTAT T'DTPCAATAG TGTTATTTTT TGGNTAGCAT TTTTTTTACC 

ACTATNTTGT TGATAGCTTT TTGTTCTNTN AGGTTGNAAN ATGACAGTGC TNATNTCAAA 

CAGATTACCC ATNTQCAGAA CTAAGGGAAG CNATTTATGT ATGAAAGNAA TTNTTGAATT 

NGTCATTNTTC AACCNTTGNA TTAAAGCTTA GACTAAATAG TAATATATTIG TGOGNAGGAT 

TTTGGTTTTG TGATATTTMT GTGNATTAAG GNATAGATGT TAACCOTTAT TTTGTAlC2JAA 

AGTGANTTGT ATGTGGTTAA TTATAAATAA AACTGGTACC AGGNAAAAAA AAAAAAAAAN 

NAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAA 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 604 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
GGCACGAGAA ATCTTCATGC TGTAGTCACT CCAGACCATG GAGTGGCTTT CCAGCTGAAT 
GAATCCTATG TCTCGCGTGC AGGTGGTTGG TTTTCAATGT TCTTGCTAAT TTTTTTTCTA 
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TTGGATCTTG GGAGTTTTCT TTGTTTGCTC CTGTGTTTGC CCAGCTTTAA TAAAACCAGG 
CGCAAACAAA AACCATAGCA TTCTGAACAA TAGGGGGCCC ACATTGGACC CAGTATGTCA 
CTTTAATGGA CTTCAAGAAA AAATCTGAAT GGGAAAAATG ACACTAOGA/l TGTATACTCC 
ACACATTTTA TGCCATATAA TQGTGTGTTT TCTTAATTTT GTTTCTTGTG GCGAAATGTG 
GCTTTCAAAT TAAAATGACC ' rm X lTXCIT TGAAACTTTT TGamTGACP TGTATAATTA 
AGQGTTTGGA AAGATTCATA ATTCTGAGAG AGGTTTQCAA CCAGGAGATA CAAAGAAGTC 
TCAGTAGTAA TCTTGTTCAT GTGCTTTTAC AGCCAGCTAC ATTTAASGAT GTATTAGTTA 
CAGAAATTAT ATGTCTGTGT ATGTGTCTCT ACTCAATAAA GTACATOCCT CCACAAAAAA 
AAAA 

(2) INFORMATION FOR SBQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTXCS: 

<A) LENGTH: 349 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
GTGAGTGCCC GGGAGCCCCG AGGCCCTGCC CCTAAGAAGG ATATCTYTRA CCGCTCCCTT 
GTCCACACCC TAACCCCCCA GCTGCTCAGG CAGTGGGCAC ATGGCAGGGG CCTCACTGGG 
GGCACATAGA GCATTTQGGG GACTGCGAGT GCTCACCTTT GACTTCCTGC AGGTCGGGGG 
AAAACCAGAT CATGAT6ACC AAAGTYTACA TATTCTTGAT CTTCATGGTG CTGATOCTGC 
CCTCCCTGGG TCTCACCAGG TATATGCCAC CACYTTCTGy TCTAAATTCA GAATAAGAGT 
CACATCAGGA GAGCACTGTC CCCAOGANAA TGCAAACGGG TTGGCAGCA 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 672 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
GTAGTCGTTG CGGTTGCCGG GATGGCGAAG ATCTCGCCGT TTGAAGTCGT AAAACGCACC 
TCGGTACCGG TGCTTGTTGG TTTGGTGATT GTWATCGTTG CTACAGAGCT GATGGTGCCA 
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GGAAOGGCAG CAGCX^GTrCAC AGGCAAGTAA ATAGTAATGC CGGAOCAAGT TTCCTCCGOC 
TTTATCATGT CACCCACTGT GGTATATGCG TTGTGGTCTG CXIAACTTTGC CX3TGAACAAT 
TTCAGCAATA ATCAGATGGC GQCTQGCGCA ATATTCAAGA TAACGCCTGG CAGTGGTGCX3 
GCTGATGGTT CAGTQCCTGC GSCACCGTTT YTGCCXTTATG TTGCACACCA GGNTCTTTAA 
ACAGTTTTCG SACXXXCTTT AGCQTCAAGG GTTCAATGCC GGTCQGTAGC TCGTCCTTAG 
GTTCACCX3CG AGCATAAGCA TTAAACATCT CATCAATTTG CITPCTGGCTG QCGCTATCAA 
TACTTTCCAG CATATGTTTA CGCTGGCGGA AACGGGTTAG CGTTTGCCCC ARCMCMTTCAT 
AGGCAATGGG CTTAATGAGA TAATCAAATA CACCACAACG TACGGCTTCA GACACCGTTT 
CCATATCGCT GGCTGCAGTG GTAAACACCA CX3T0GCCGGG ATAATGCXjCC TGCACCASTT 
CATGCAGTAA AT 

<2) UJFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: X908 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(P) TOPOLOGy: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

AGAGTTGATA TTTITAGAAA CAGTAATTTT ACTTTTAAGG AAATTGGCTA GCTCTTTGAC 

TNNAGAGCTG TAGGAAGCTC AACATTTCTT TGTAGAGAAC GTTGCTTTTT TTQGATTGTA 

CAGGTATAAA AACATTGCTT TTGTTGAATT GTATAGGTGT AAAAAGQGAA TAACTGTATG 

CAGGTTTGAA AAGGAAATGT GCTTTAGGCA TGAGTCATAA GATGCCATTG TACTTGTAGG 

CATTTTATTT TCCTTTAGAA ATGGACATCA GCTCTTCTCT TCTGACTGGT AACACATAGC 

CCCAAAGCAT GAGATTATTT TTCATTGGGT TTTTATTGTT GTTTAGTTTT GGTTTGTTAC 

GCCAGCCCAG TCTGTCTGCG GAACACTGAC TCTGCTCTCT AATGAGAACA AAGTTAGAAA 

TCTGCCGATA ACCTAAAATA ATTTAGAAAT GAATTAAAAA TGTGAAATCG GGTTAAAGTG 

ATGATGATAA AATAGCATGC AAGAAACAAG CTCCTTCCAT CAGACTTQGC TACTGTTTTC 

TTCTGGTACG ATTTGGTTTG GAAGAGCCTC TTGTTTCCTT CTCTTTGOGG TATGTCTTCG 

TTTCTTAATA TGTTTGTAAC ATTATTGAGA TATAATTCAC ATACCTTACA ATTCACTTAT 

TTTAAGGGTA CAATTTAGTG GTTTTTAGTG TATTCACAAA GTTGTGTAAC CGTGACCACA 

GTCAATTTTA GAACATTTCG TTACCCCAAA ATVGAAACCCT GTACCCTTGA GCAGTCACCT 

CTCA1TTTCT CCCAGTGCCC ACCCCATCCC CGAGCCCCKG GAACCACTAA TCTATTTCTC 
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TCTCTGTAGA TTTOCTTATT CTGGTCATTT CATATAAATG GAATTCTACA ATATTCGGTC 900 

TTTTGGGACT GGCITCCCAA ATATGATTTT CTATATQGAG TGAQAAAATT CTTCTCATCT 960 

5 

TGAGAACTCT TATTC3CTGTG AAAGGGAC3TG GTTGGTAAAA TCAATAGATT TCAGGCAAGA 1020 

GGGCCAGkTA CCTAACAGGT TTTTCTCCGT GAATCTTATG CTGAGTAGTT TTTCCTCATA 1080 

10 ACCAAGCATT TATGATATAT TACTACTTAT AATACTGTGG CTAGTCTCTA GAATGGATGT 1140 

TGAAATCTTT GCCTCCTCAG TCGGGAAGAG TCCTGCTAAA. AATCAGGCTA AAAATCAGGC 1200 

CAAAAATCAG GCCAAATGAC TTGGCAAATA ATTGACAAAG TGGTTTTCAC QTGTGTCTAT 1260 

15 

CTTTGCTAGC AGCTTGTATA CCTCAGGCCA GGTGAGCTCC CCAAATTTCT TTTTTCATTT 1320 

ACrCCAGTGA GTTTCTGCTC TCTTTTICAA GTATGTACCA TAGGACTTAA AGGTGATTTG 1380 

20 GATGOGTTGT AACACTGCTA AATATGCTAA GTACAGAATT TTATCTACAG TACTGTGAGA 1440 

CAGTCAATTA TTGCCTAGGG TAGTTCAAAA ATATGATGTG AGCTAGTTAA GCCTTTQCTT 1500 

GACTGArrrC AGrrCATATTC AGAAGTOTGT ACCAATCAAG GCTCTTTAAA ATACGGAACX5 1560 

25 

ACTCACTTAA TAACCAGGGA ACCAGCCAAA TACTGTGCAG CCGCAGAATA TGCATATCAA 1620 

TGAGTTGGAG GTGATTATTC TCTGTAACTC CCTAATGATT GTTTTCTAAG CATTGTGGCT 1680 

30 TCTCAGTGGC TTGACAGCAT CTTCCTGGTT GTATGTGGCC TGTTTACATG ATGTATTGAA 1740 

TAATGrrTGTT TGITOTGAGC ATCAATGCCT GTAACACCAA ACTAAACACG TGTTTTTGQG 1800 

ATATGTTTCC AATCTTTAAA TQACCTTGCX CTGTCCAATA AATAAATGAT TGTCTCACCC 1860 

TCTTTAAAAAA AAAAAAAATT AAAAAAACTG GC93GGGGGOC COSGTACN 1908 



35 



40 



50 



(2) INFORMATION FOR SEQ ID NO: 40: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 458 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOIjOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

CCTCAAAAAA AAAAANGAAA GGAAAGAGGT CPCTACACAA GCCOGTGATT CTTCATGGCA 60 

AGGGATAACA TCAGAAATGT TTCATTTYCK GCTATTAGTT TCCATTCCTT TCCCCATCCA 120 

55 GGCATAAAGA GAAACAAAAG ACAATGATGG TATTCTCTGT GTCCTCAGCT TTGGCACTTT 180 

TGTTGATGTT GCTAAGGAGC AGTGACCTTG CTAAAAAGAC TGAATAATCC ACCCACTGAA 240 

TAGCTAACCT GGGGAGGAAA TGAAAATTTC CTTTGTGGAT CTCCCCAAAT CCATTGTTGT 300 

60 
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CACCAQGCCC TCCCAGAACC TCCTCAGrTC CTTCACAGTG CAACCCTGTG TACTTGOCXC 360 
GCAACCCAAT AGTATTGTOC CTCftCTTCAC CTTCCATGQG CAACTGCCCT CXCTTCTGGA. 420 
5 CATAAAACCT CATATTTTAA ATNAAGTTC3A AATTTGAA 458 
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10 (2) INFORMATION FOR SBQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1153 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANIffiDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
20 GGCACAGAGC CTCCGACCCA GGTGGTCTGG AGCCTGCCGG GAGAGTGGTG GCATCTGAGA 60 

GGCK3GTCGT GGACTGTGGT TGQGGGAGGT GGGACXTTCTT TTAACCGTGT GCCCCCTCTC 120 

CTGTGCCGGC GTGGGCATCC CCCGGGGCAG TGGAACCCGG QCGCTCCTCC AGCTTCOGAG 180 

TCCAQCCAGC CTGGGCGCGG GQCGCGCCCC GAGACACCCG AQGAGTCCGT TCCTCCCTGG 240 

TTACGTOGAC TGTGGAGCTG GTCTCTTGTG GCTCAGOGCC GTGCGGAGOT TGAAGCX3TAC 300 

30 CTCCGGAGGT CGCACCAGGG CGIGAGGAGG AGGAGGAAGG GCATGAGCCG AGCTTGflGGA 360 

ATCCGTGCTC CAAACTCTAC ACTCAAGGAT QCACTOOGCA ACTCTGGTGG OGATGGGCTG 420 

GGGCAGATOT CCTTGGAGTT CTACCAGAAG AAGAAGTCTC GCTQGCCATT CTCAGA06AG 480 

TGCATCCCAT GGGAAGTGTG GACGGTCAAG GrrOCATGTGG TAGCCCTGGC CACGGAGCAG 540 

GAGCGGCAGA TCTGCCGGGA GAAGGTGGGT GAGAAACTCT GCGAGAAGAT CATCAACATC 600 

40 GTCGAGGTGA TGAATCGGCA TGAGTACTTG CCCAAGATGC CCACACAGTC GGAGGTGGAT 660 

AACGTGTTTG ACACAGGCTT GCGGGACGTG CAGCCCTACC TGTACAAGAT CTCCTTCCAG 720 
ATCACTGATG CCCTGGGCAC CTCAGTCACC ACCACCATGC GCAGGCTCAT CAAAGACACC 780 
CTGCCCTCTG AGCGTCGCTG GATCTCTGGG AGCTCCTTGA TGGCTCCCAG ACCTTGGCTT 840 
TTQGGAATTG CACTTTTGGG CCTTTGGGCT CTGGAACCTG CTCTGGCSTCA TTQGTGAGAC 900 
50 TTOGAAGGGG CAGCCCCCGC TGGCTTCTTG GTTTTGTGGT TGCCAGCCTC AGGTCATCCT 960 

TTTAATCTTT GCTGACGGTT CAGTCCTGCC TCTACTGTCT CTCCATAGCC CTGGTGGGGT 1020 

CCCCCTTCTT TCTCCACTGT ACAGAAGAGC CACCACTGGG ATGGQGAATA AAGTTGAGAA 1080 

CATGAGTTTC GGCTGAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1140 

AAAAAAAAAA AAA ^^^^ 



35 



45 



55 
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(2) INFORMATIC»I FOR SEQ ID NO: 42: 

5 (i) SEQUENCE CHARACTERISTICS: 

<A) LOOGTH: 1983 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

10 

(xi) SEQUENCE DESCRIFTtOT] : SEQ ID NO: 42: 
GGCACGAGAG GGGCCGAGCC GACAAGATGT TCTTGCTGCC TCTTCCGGCT GCG0GGCX3AG 60 
15 TAGTCGTCCG ACGTCTGGCC GTGAGACGTT TCGGGAGCCG GAGTCTCTCC ACCGCAGACA 120 
TGACGAAGGG CCTTGTTTTA GGAATCTATT CCAAAGAAAA AGAAGATGAT GTGCCACAGT 180 
TCACAAGTGC AQGAGAGAAT TTTGATAAAT TGTTAGCTGG AAAGCTGAGA GAGACTTTGA 240 

20 

ACATATCTGG ACCACCTCTG AAGGCAGGGA AGACTCGAAC CTTTTATGGT CTGCATCAGG 300 
ACTTCCCCAG CGTQGTGCTA GTTGGCCTCG GCAAAAAGGC AGCTQGAATC GACGAACAOG 360 
25 AAAACTQQCA TGAAGGCAAA GAAAACATCA GAGCTGCTGT TGCAGCGGGG TGCAGGCAGA 420 
TTCAAGACCT GGAQCTCTCG TCTGTGGARG TQGATCCCTG TQGAGACGCT CAGGCTGCTG 480 
CGGAGQGAGC GGTGCTTQGT CTCTATGAAT ACGATGACCT AAAGCAAAAA AAGAAGATGG 540 

30 

CTGTGTOGQC AAAGCTCTAT QGAAGTGGGG ATCAG6AGGC CTGGCAGAAA GGAGTCCTGT 600 
TTGCTTCTGG GCAGAACTT6 GCACGCCAAT TGATQGAGAC GCCAGCCAAT GAGATGACQC 660 
35 CAACCAGATT TGCCGAAATT ATTGAGAAGA ATCTCAAAAG TGCTAGTAGT AAAACCGAGG 720 
TCCATATCAG ACCCAAGTCT TGGATTGAGG AACAGGCAAT GGGATCATTC CTCAGTGTGG 780 



CCAAAGGATC TGACGAGCCC CCAGTCTTCT TGGAAATTCA CTACAAAGGC AGCCCCAATG 840 

40 

CAAACGAACC ACCCCTGGTG TTTGTTGGGA AAGGAATTAC CTTTGACAGT GGTGGTATCT 900 

CCATCAAGGC TTCTGCAAAT ATQGACCTCA TGAGGGCTGA CATQGGAGGA GCTGCAACTA 960 

45 TATGCTCAGC CATCGTGTCT GCTGCAAAGC TTAATTTGCC CATTAATATT ATAGGTCTGG 1020 

CCCCTCTTT6 TGAAAATATG CCCAGCGGCA AGGCCAACAA GCCGGGGGAT GTTGTTAGAG 1080 

CCAAAAACGG GAAGACCATC CAGGTTGATA ACACTGATGC TGAGGGGAGG CTCATACTGG 1140 

50 

CTGATGCGCT CTGTTAOGCA CACACGTTTA ACCOGAAGNT CATCCTCAAT GCCGCCACCT 1200 

TAACAGGTGC CATQGATGTA GCTTTGGGAT CAGGTGCCAC TGGQGTCTTT ACCAATTCAT 1260 

55 CCTGGCTCTG GAACAAACTC TTCGAGGCCA GCATTGAAAC AGGGGACCGT GTCTGGAGGA 1320 

TGCCTCTCTT CGAACATTAT ACAAGACAGG TTGTAGATTG CCAGCTTGCT GATGTTAACA 1380 

ACATTGGAAA ATACAGATCT GCAGGAGCAT GTACAGCTGC AGCATTCCTG AAAGAATTCG 1440 
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TAACTCATCC TAACSTGGGCA CATTTAGACA TAGCAGGCGT GATGACCAAC AAAGATGAAG 1500 

TTCCCTATCT ACGGAAAGGC ATCACTGGGA GGCCCACAAG GACTCTCATT GftGTTCTTAC 1560 

TTCGTTTCAG TCAAGACAAT GCTTAGTTCA GATACTCAAA AATGTCTTCA CTCTGTCTTA 1620 

AATTGGACAG TTGAACTTAA AAGCmTTG AATAAATGGA TGAAAATCTT TTAACGGAGA 1680 

CAAAGGATGG TATTTAAAAA TGTAGAACAC AATGAAATTT GTATGCCTTG ATTTTTTrrr 1740 

CATTTCACAC AAAGATTTAT AAAGGTAAAG TTAATATCTT ACTTGATAAG GATTTTTAAG 1800 

ATACTCTATA AATGATTAAA ATTTTTAGAA CTTCCTAATC ACTTTTCAGA GTATATGTTT 1860 

15 TTCATTGAGA AGCAAAATTG TAACTCAGAT TTGTCATGCT AGGAACATGA GCAAACTGAA 1920 
AATTACTATG CACTTGTCAG AAACAATAAA TGCAACTTGT TGTOCAAAAA AAAAAAAAAA . 1980 

AAA 1983 

20 



(2) INFORMATION FOR SEQ ID NO: 43: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1406 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
30 (D) TOPOUXSY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

ATGATGATGA CTTTGAAGAC GATTTTATTC CTCTTCCTCC AGCTAAGCGC CTTGAGGTTA 60 

35 

ATAGTTGGAA AAGACTCTAT AGATATTGAC ATTTCTTCAA GGAGAAGAGA AGATCAGTCT 120 

TTAAGGCTTA ATGCCTAAGC NCTTGGTCTT AACTTGACCT GGGATAACTA CTTTAAAGAA 180 

40 ATAAAAAATT CCAGTCAATT ATTCCTCAAC TGAAAGTTTA GTGGCAGCAC TTCTATTGTC 240 

CCTTCACTTA TCAGCATACT ATTGTAGAAA GTC3TACAGCA TACTGACTCA ATTCTTAAGT 300 

CTGAOTTGTG CAAATTTTTA TCGTACTTTT TAAATAGCCT TCTTACGTGC AATTCTGAGT 360 

45 

TAGAGGTAAA GCCCTGTTGT AAAATAAAGG CTCAAGCAAA ATTGTACAGT GATAGCAACT 420 

TTCCACAC7U3 GACGTTGAAA ACAGTAATGT GGCTACACAG TTTTTTTAAC TGTAAGAGCA 480 

50 TCAGCTGGCT CTTTAATATA TGACTAAACA ATAATTTAAA ACAAATCATA GTAGCAGCAT 540 

ATTAAGGGTT TCTAGTATGC TAATATCACC AGCAATGATC TTTGCCTTTT TGATTTATTT 600 

GCTAGATGTT TCCCCCTTGG AGTTTTGTCA GTTTCACACT GTTTGCTGGC CCAGGTGTAC 660 

55 

TGTTTGTGGC CTTTGTTAAT ATCGCAAACC ATTGGTTGGG AGTCAGATTG GTTTCTTAAA 720 

AAAAAAAAAA AAAACGACAT ACGTGACAGC TCACTTTTCA GTTCATTATA TGTACCGAGG 780 

60 GTAGCAGTGT GTGGGATGAG GTTCGATACA GNCGTATTTA TTGCTTGTCA TGTAAATTAA 840 
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AAACXTTGTA TTTAACTCTT TTCAATCCTT TTAGATAAAA tTCTTCTTTG CAAGAATGAT 900 

TOGTOCTTAT TTmCAAAA ATTTGCTGTG AACAACOTGA TGACAACAAG CAACATTTAT 960 

5 

CTAATGAACT ACAGCTATCT TAATTTGGTT CTTCAAGTTT TCTGKTGCAC TTGTAAAATG 1020 

CTACAAGGAA TATTAAAAAA ATCTATTCAC TTTAACTTAT AATAGTTTAT GAAATAAAAA 1080 

10 CATGAGTCAC AGCTTTTGTT CTGTGGTAAC CTATAAAAAA AGTTTGTCTT TGAGATTCAA 1140 

TGTAAAGAAC TGAAAACAAT GTATATGTTG TAAATATTTG TGTGTTGTGA GAAATTTTTG 1200 

TCATAAGAAA TTAAAAGAAC TTACXIAOGAA GGTTTTTAAG TTAGAAATAT TCCATGCCAA 1260 

15 

TAAAATAGGA AATTATAAAT ATATAGTTTT AAGCCTGCAT CAGTGGGAGT CTTGGCTATG 1320 

TAGTTATGTA GTTATTATOJ AACCACCAAG ATTTTTPrGG CTATTTACCG TAACCAAAGG 1380 

20 GQCCGATTAA NTGGTTTGAA GNCTTG 1406 



25 (2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1391 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDECNESS: double 

(D) TOPOLOGY: linear 

(Xi> SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

35 GGGCCTGAAG GCGGCRCGCC AGTCCCGAGC AGTGCTCGCT CCTGCTCGGG GCGCTGCGGC 60 

CCCGGGCGTC GCCATGACCA GTGAGCTGGA CATCTTCGTG GGGAACACGA CCCTTATCGA 120 

CGAGGACGTG TATCGCCTCT GGCTCGATGG TTACTCGGTG ACCGACGCGG TQGCCCTGCG 180 

40 

GGTGCGCTCG GGAATCCTGG AGCAGACTGG CGCCACGGCA GCGOTGCTGC AGAGCGACAC 240 

CATGGACCAT TACCGCACCT TCCACATGCT CGAGCGGCTG CTGCATGCGC CGCCCAAGCT 300 

45 ACTGCACCAG CTCATCTTCC AGATTCCGCC CTCCCGGCAG GCACTACTCA TCGAGAGGTA 360 

CTATGCCTTT GATGAGGCCT TTGTTCGGGA GC5TGCTGGGC AAGAAGCTGT CCAAAGGCAC 420 

CAAGAAAGAC CTGGATGACA TCAGCACCAA AACAGGCATC ACCCTCAAGA GCTGCCGGAG 480 

50 

ACAGTTTGAC AACTTTAAAC GGGTCTTCAA GGTQGTAGAG GAAATGCGGG GCTCCCTQGT 540 

GGACAATATT CAGCAACACT TCCTCCTCTC TGACCGGTTG GCCAGGGACT ATGCAGCCAT 600 

55 CGTCTTCTTT GCTAACAACC GCTTTGAGAC AGGGAAGAAA AAACTGCAGT ATCTGAGCTT 660 

CGGTGACTTT GCCTTCTGCG CTGAGCTCAT GATCCAAAAC TGGACCCTTG GACCCGTCGA 720 

CTCACAGATG GATGACATGG ACATGGACTT AGACAGGAAT TTCTCCAGGA CTTGAAGGAG 780 

60 
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CTCAAGGTGC TAQTGGCTGA CAAOGACCTT 
GCTCTCCGGG AAAGCTGGGC GTCTTCTCTG 
GGGGGCTGGT GAACOTGCCG CCAAGCTGAC 
GGACCTCGTG GAGAAGTTTG TOGAACCXTTG 
GCX3GTTCTTC CTGAATCAGT ATTCAGCXSTC 
GGCCCTCTGG GACCGCTACA TOGGCACCCT 
CTGAGGTGCC TCCCAACGTC CGCCCACGCT 
GGTCCTCGCT CCGGGGAGCA AGTGGC5GGGC 
ACCTGGTGTC CGTGTACAAG GATGGATOTG 
CTCAQGGAAT GGTGTCTGTG CTCAGCCCAT 
AAAAAAAAAA A 



CTGGACCTGC ACAAGAQCXn* GGTOTGCACT 
AGATGGAASC CAACTTCAAG AACCTGTCCC 
CCACAATAAA GATGTCAGAG ACCTGTTTGT 
CCX3CTCCGAC CACTGGCCAC TCAGCGAOGT 
TGTCCAATCC CTCGATC3GCT TCCGACACCA 
CCGCGGCTGC CTCCTGCGCC TGTATCATGA 
GACAATAAAG TTGCTCTGAG TTTQGAGACT 
GTGCAGATGT GCCTGTGTCT GTCTCTGAGC 
TNCNGTGGCT CXOTGOGAAC TGAfiACATAT 
CCACCAGAAG AGTCTGCTCA CAAAAAAAAA 



(2) INFORMATION FOR SEQ ID NO: 45: 

<i) SEQUENCE OIARACTERISTICS : 

(A) LEr«GTH: 1569 base pairs 

(B) TWE: nucleic acid 

(C) STEIANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
GGCACGAGTG GftGATGGCTG CGGCCGTGGC GQGGATGCTG CGAGGGGGTC TCCTGCCCCA 
GGCGGGCCGG CTGCCTACCC TCCAGACTGT CCGCTATGGC TCCAAGGCTG TTACCCGCCA 
CCGTCGTGTG ATGCACTTTC AGCGGCAGAA GCTGATGGCT GTGACTGAAT ATATCCCCCC 
GAAACCAGCC ATCCACCCAT CATGCCTGCC ATCTCCTCCC AGCCCCCCAC AGGAGGAGAT 
AGGCCTCATC AGGCTTCTCC GCCGGGAGAT AGCAGCAGTT TTCCAGGACA ACCGAATGAT 
AGCCGTCTGC CAGAATGTGG CTCTGAGTGC AGAGGACAAG CTTCTTATTG CGACACCAGC 
TGCGGAAACA CAAGATCCTG ATGAAQGTCT TCCCCAACCA GGTCCTGAAA GCCCTTCCTG 
GAGGATTCCA AGTACCAAAA TCTGCTGCCC CTTTTTGTGG GGCACAACAT GCTGCTGGTC 
AGTGAAGAGC CCAAGGTCAA GGAGATGGTA CGGATCTTAA GGGACTGTGC CATTCCTGCC 
GCTGCTAGGT GGCTGCATTG ATGACACCAT CCTCAGCAGG CAGGGCTTTA TCAACTACTC 
CAAGCTCCCC AGCCTGCCCC TGGTGCAGGG GGAGCTTGTA G6AGGCCTCA CCTGCCTCAC 
AGCCCAGACC CACTCCCTGC TCCAGCACCA GCCCCTCCAG CTGACCACCC TGTTGGACCA 
GTACATCAGA GAGCAACGCG AGFAAGGATT CTGTCATGTC GGCCAATGGG AAGCCAGATC 
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20 
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30 



CTGACACTGT TCCX3GACTCG TAGCCAGCCT GTTTAGCCAG CCXTTGCGCAT AAATACACTC 840 

TGCGTTATTG GCTGTOCTCT CXTTCAATGCG ACATGTGC5AA GAACTTGGGG 'TCGOGGAGTO 900 

TGTTTOTCAC TTQQTTTICA CTAGTAATGA TATTGTCAGG TATAOGGCCA CTTOGAGATG 960 

CAGAGGATTC CATTTCAGAT GTCAGTCACC GGCTTCGTCC TTAGTTTTCC CAACTTGGGA 1020 

CGTGATAGGA GCAAAGTCTC TCCATTCTCC AGGTCCAAGG CAGAGATCCT GAAAAGATAG 1080 

GGCTATTCTC CCXTOCCTCC TTOGTCACTG CCTCTTOCTG CACGGQCTCC TGAGCCCACC 1140 

CXTTTOGGGC ACAACCTGCC ACTGCCACAG TAGCTCAACC AAGCAGTTGT GCTGAGAATG 1200 

GCACCTGGTG AGAGCCTC3CT GTGTGCCAGG CTTTGTCCTG AGTG C TGTTA CATGTATTAG 1260 

TTCCTTTACT GCTGACCACA TTGTACCCAT TTCACAGAGA AGGAGCAGAG AAATTAAGTG 1320 

GCTTGCTCAA GGTCATGCAG TTAGTAAGTG GCAGAACAGG GACTTGAACC AAGCCCTCTG 1380 

CTCTGAAGAC CGCGTCCTGA AOTTCTTCAC TAGAGCTTCC TCATCAGGTT ACCCAGAAGT 1440 

GGGTCCCATC CACCATCCAG GTCTQCTTGG ATGTTAGTTC TCCACCCTCG AGGTGTACGC 1500 

TGTGAAAAGT TTQGGAGCAC TGCTTTATAA TAAAATGAAA TATATTCTAA AAAAAAAAAA 1560 

AAAAAAAAA 1569 



35 



40 



45 



50 



55 



60 



(2) INFORMATION FCR SEQ ID NO: 46: 

(i) SEQUENCE €3iARACTERISTICS: 

(A) LENGTH: 1924 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

GGGCCCCCCC WCGWKTTTTT TmTlTriT TTTAATTAGG ATAATGCCTT TATTAACGAG 60 

AATGAAACGT TCATTCCTCC TTCCACTCCT TCTCGTTGGT TTTCTGGACA CAGCTCACCT 120 

GATCCrOCTA GAAACGTTGT CA(3TCTGCTT GTGGCTTCCC TCCTTGATTG ACTCACGCTG 180 

TGTGATGTCT TGAGAAGTAT CTATCCACTT CATGTGAATG AGCACTCCAA TATCAGCCAA 240 

CATCAATCAT TCTTACCTAA AGAATAATAA GAAAAAGTTA ATATAAAAGA CAAGGGTATA 300 

AAATAAAGGT TTGAAAATGC TAGTCAACTT CAAAATTTAA AGAGTTAAAAA TCCAGAGATA 360 

AAGATTGGGG GTAAGTTACA GCATAAAAAA ATAGGAAGAA ACTTCATGGT GGGGGGGAAA 420 

TCTAAAATTA TTCTTACATA AAATAAGTAG ACACCTGAAT TAGAATGAAA ACTGTATTTT 480 

CTTTAAAATG TAAAAGCCTG ACTCTCAGTT TCACCAGTCT GAGCACAAGT TTGACTGCAA 540 
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CCCAAAATAT ACTATCCCTT ATGTGAAGGT ATGTGACAAC GTTGACCTCA CXa^AATGACT 600 

TTTAACATCA GCTCTTTTTT. CATATGAAAG CACATACCCT GCTCCCCATT CAAGTATGTC 660 

TTCCATTGTC AGGCAGGCTG ACX::ACCTTCA GCAQGAGTCC TCCAAGAGTG CCCAACTCCC 720 

CTTCCCACAG TACACAACGC TGTAGTTGTT GTCCTGCAAT CCTTTGTATO TACCTCATTC 780 

TTTCCCATCT AAGTCCTCAC TGAGTTrrAA AGTTAGGGCT GGAAAAGCTA TGCCTTACTG 840 

GGACAGCAAG GAACX:AATTT TTTTCTGAGG GAGAAGACAT TCACCTTCAC TATATGCCTG 900 

GCAGGGCCAC AGTGCACAAA ACAAAGATCA GCXHTCATTC AAGTTCCAGG TTTTTCTTCC 960 

15 TCCXTTGAATG ATTACTGCAA AGGGTATATG AAGTAAGAGT TCCCTGTTGC ACATGTACCA 1020 

TCCATAAGGG ATACTATATC GTTTTGCATT CTTCCCCCXy^ TTCTCCACAT TGTCCTATCT 1080 

TAAOTCCAAG CXXTTTTTCAC TCTCAAAAAA AAAAAAAAAA TATTTTTrTC AGCACTGGTG 1140 

20 

TTCAAAAGCA ACGTTTTTAT GGTTAATGGT TTACCAGCAA CTGTTGAGAT TTCCAGTTGA 1200 

GTCTTAAAAA TTGCCAATCA TTATCTAGCA GCAATGACAG ATGATTAGGA GCAGTCAAAT 1260 

25 CCTCTGAATT CTTTCCCTAA TAGGCAGCCA TTTGAGAACT GCACTAGCTG ACATCACTAA 1320 

AACATTATCA GCTAAAGCCA AAACCAAATA AAGGCCCAGA CCAflCATCCT GQCTCTCTAA 1380 

AACCTGTCCA AAATCATTAA GTGAAAGGCA GTAAATGCAG GACTQTQGAT CATGTCACTG 1440 

30 

CAGCTGACAA TGATTAACAA TAGGAGACAT GCAACCCCCA OTAAGGITAA AAGTCCAAAA 1500 

CTAGTCACAC GCATCTCTTT ATTGGGGAAA AGTGAGACTA TTATGCATTC TTGGTAGGTT 1560 

35 TGCAACCTTG CATGAAGAGC ACCCATTGCA TTTCTTTCAT CTTTCAGAAA GCACCGGTAT 1620 

CTGTTCCAAG GGCCTAACAG TACGAAAATA CATTCTGGCA TCACACCTCT GAACCCAAGA 1680 

CTGTTCTCAT TAAAAATAAT TTTGGTTTGT AACAAAATTA TGAAATACAA TQCAAGCACC 1740 

40 

TCGGTATAGC ATTATTACTG AAACCACTTA ATTOXAGCT TTTTGAC3TTT TTTAAAAAAA 1800 

CXrCACTGCAC TAAGATTCAC AATTCATTGC TACATACAAA TTAAAGCTAG TAAGAACACA 1860 

45 CTAACX5TCAC AAGTTTCTCA TTCTAAAGTG CAAAAGCCTA ATCATCTGAA AGTGAACAGG 1920 

GTAA 1924 



50 



(2) INFORMATION FOR SEQ ID NO: 47: 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 475 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOLOGY: linear 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
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'TGOroTGGQG CCCAGAAAMC AAGGC3ACCAG TQAAAACAMC (TCAGAGACT 1CTATCCGCC 
^ AGGAAAGCCA TTGCTAMrYC TGAGCCCTTG AAGGGCAAGG AGQGAAACAG TCTTACCAGA 
GCCX^GTAAG AACTGCTGTC ATQAAGGAGG GQCC2WXTTG TAAGAGACAT CATTACTACC 
AGAACTGTGG TGCCAAATTG CTCGTGTCTC TCTTTGGAGA AACCAACCAG ATACATCTOC 
10 TGGAGACCCA GGTGGGCACA GAGAAGOGTG GAGAGAGAAT CK3GGAAGAG AAATCGAGAA 
TAAGCAGCAC AGTGTTATTC ATTTCTGTAA ATTCCTATGT AGAAGGCTCA GTOTTAGAAA 
TAAAGTTATT CTACTAGTTG CAAGTTAAGT G Tr T CitJiTr GTTCTGCTTT CCTOTTASCA 
TAAGTAAACT CCCTTTGGAA CTACACAGGT ATGTCTCTCC TTCAACATGT GTOAA 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQIGTH: 346 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBiaaESS: double 
<D) TOPOLOGY: line£ir 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

AAGQQACAGA GACCTGGATT CAGATCTCAT TTTACAATGA AGACCCCAAT GCAGAAAGTC 

ATGTCTGAAA TTCTGAGCTT ACTCTTCTGC CTGCTC3GGAC CTGCTCTGGA TCAGAGAAGG 

GAGGAAAAGG ACTAATCAGA GGAGCCAATG AAGTCACTCC ATCAGTTTCC TGAACCCTGC 

CCAGCTAGAG ATTAACGTYT GACCWTCAAC GTAGGACACT GTGCAGA'TCG CTACTTGCTC 

GCGCACATGA AGACCAAAGC CAGGACCAAG CCCCMASCCT GCTWAACACG GCAGARTCTT 

GCCCAGCCMA CYTCTGTGAR AATCTGCTTC CCTCCACAGC TGACCC 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1366 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 
TAGGTGTCAG CCGCCACCOC CCCCCCATAT QCAGATTTAC TSGQCATGGT AGTGGCCAGC 
TTCTAACACA GCTGGTATTT CAAGTCTCCT GGGACCTCAC TCAGGAATGA TACCCCCTCA 
GTAGAAGCAG CAGGTGATCT TAACTCCTTT CAAAGAGCAG GCCTGTCTCG GAAGCCATOT 
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CCTCAGCAGG CACAGCAACC CCTCTGGAAA TGGATCACAA ACTCACTTCT CAGCCAGGCA 240 

GQCCAAGCTT CTATTGTAAC AGTAGGCACA GTATAGTCGG ATCATCACAT CAGCTGGGTT 300 

5 

TTTGGTTTAG TCATCTAGAG TCGTCTGGAC TAAAGGTCTT TCAGGTCTCC TTGCCCTOTG 360 

AGTGCGTGAA CCPCXXrCACC CGAATTGCCT CAGTTGTCCT GAQCCTCATG TCTCTCCTGG 420 

10 TGGTQGQCCA GGCCCCTGCA TGQGAAOGGA GCCTQCTGCG GGQCAGQCCA GCTGGGGGTC 480 

CTCACCTATG CGCAATGANA GTTATTGAAG GACTGGTTC3T TGATQTTGGT GAGCGTATCC 540 

TTCATGGCCA GCGCGAAGTC GGCCAGGTCA GCCAGGTGCT GCCAGCGCTC TCTCTCQGAC 600 

15 

TTGTCTTCCT GTGCCAGGGG ACXX3TGGAGA AAGTGTCftGG QGCCX5CTCAC TGCAGCAC3CC 660 

TGCTCTGCTG CCTTCCCTGG CAGTGTTCTG GGGGTGGATT CXXTACAMCT AGATGTTCAA 720 

20 GGCCTTACTT TTCCTCCCAC AAAGGAGTCG CAGCXIACGCT AGCTCTGACT TGCCACTGTG 780 

ACAAAGTTCA CGTAGCAGGT CTAGGCAAAG ACTGGGCAAT TGAGCAGAGG AGACGGACCT 840 

GTGAGTCTGA CCRYGAGSCG GRCCCCTTCA CdTGGCTGG GCTQGTCCTG GTCCTTAGGT 900 

25 

TTTGTCAGGT TGTCCTTCTT TOGATCCCTC AACTAGGTGA TAAGCACTGG AGGGQGATGA 960 

CCXIXSCCTTGG ACX3TGTTTCT TTAACCTCAT CCATATAATA GGGCCGTGGG ATGGTTGTAG 1020 

30 AGGTAAAGCA GGATGATGGT GTTrTAAGAC CAGAGCTTGG GACCAGQGCT CCTACACCTA 1080 

ATTTTCTCTC CTGGTAGCTG AACAAAGGTC TAAATTAGCT TAACAAAAGA ACAGGCTGCC 1140 

GTCAGCCAGA GTTCTGAAGG CCATOCTTTC AGTTTCCCTT GTTGACAATT GCTCTCCAGT 1200 

35 

TCCTATGAAA GCACAGAGCC TTAQGQQGCC TGGCXIACAGA ACACAACCAT CTTAGGCCTG 1260 

AGCTGTGAAC AQCAGGGGGT TGTGTCTCTG TTCTGTTTCT CTGCTTGCCG AACTTTCTCA 1320 

40 ATAAACCXTTA TTTCTTATTT ATAAAAAAAA AAAAAAAAAA AAAAAA 1366 



45 (2) INFORMATION FOR SEQ ID NO: 50: 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1405 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOtiOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
55 GCAGTAATTC CTGTTAGCCA CTGCATCCAC CAAAACTAGT TTATTTTTCC CCTCAAATTC 60 
ATGATTTTTA CGTCTGTTAC AAAGGGAATT TTGCTGATAG CTdTTGGGT CCCACTGTTC 120 
CATTTTATGC TAATAGATTC CATTCTAGGG CCCAGCCGTC TCTTGACTGA TGGTGTTCCC 180 

60 
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TTTAACCCTT GGCATGTATA ATAGAATTTT 
AGTCCCCCCA GGCCCTCATA TCCATAAAAG 
5 GTCTTTTTGT TGTTTTAGAA AAAAAAAACA 
ATGGTTCCTA GCTGTGTACT CATAACTTTT 
GGAAAQGAAA TTAAATGTCC TTCTAGTATT 

10 

TAACCTATAT TGAAATGCCA AGAATTGTAT 
TCATGCCTTC ATATTACATA TTAAOTGAAA 
15 CACAAGGCAA AGGCACXTTAA ACCTTTTGTT 
TTTTTATTTT TTTAGGGGAA CCTATATAAT 
AGCTTATGAA TTCAGGTTAC AATTGAffTTG 

20 

GTTGGTGCCT TATGTGACAC GAGCCAGAGC 
GCXyy^CACCA ACTCGtAGCT TTAGTGGGCA 
25 TGTGQGGQCT TTGGGATTTT CCACACCATC 
ACACTCTTGG ATGGATWATT TGRAAATGGT 
WTGRGCCACC ATTCTTWATC ACACCATAAC 

30 

AACCTGTTAC CAGCCYKSMA KTWACCCWWA 
AATAGCAGGG CTAAGGAATT ACTGGCAAGT 
35 TTAAATATTT GTAATTTTAT TTAAATGTTC 
GGATQGCTCT TTGCCATGGG TCTTATTTTC 
GTCTTAATGT ATTTTTAAAG TTTTTGGTAT 

40 

TV/TRTAAAAA AAf^AAAAAAA AAAAA 



GGTGAATGAA AGAACCCAAA TAGGCCAGAT 240 

GCTTGGGAAT GCATTATGTA ATTGTCCTTA 300 

AGATGGGCTC AGATOGATGC CTAOGTAAAA 360 

CTTTGAATT6 AGTASTGAAA GGAAGGAOGA 420 

CTCTGGACTC AAGTCTGACA TATGAGATAA 480 

CTGAAACAAG AGAACAGTTT GACACATTTA 540 

cx:aattaata AACATATGAA ATATCCATTG 600 

TCTTTTTCTA CATAGCAGAA ATTGATTTTT 660 

TATGACCCAG TGATGTCTTT TGGTGACTTA 720 

ATTCTAGATG GTTACTACCT TGAAAAGGAT 780 

CTGCTGGGGA ATAAACAAAG CAGGTTTCAT 840 

GATOGGGAGT GGTTCACAGA CTTCCCAAAA 900 

CXy«X3TCTGT TGTTCATTCT TCCTCTTTTC 960 

GRAAWVMMCY YYKRAATTTG CCCAATAGCC 1020 

CAAATAGTTC CWTAATGTTG AAATATTAGA 1080 

WTTTTCCXIAT GrTTGTGGAA TTGATATTGA 1140 

TTTAGCCTGT GGGTAATACC TTAGGGITAT 1200 

ATGAATGTTT GAAAGGAACA AAATTATCAG 1260 

ACCCTCTTTT CTGTAAGAAA AAAGAACAAT 1320 

AGTTTCTAAT TCCAATTTTA ATAAAAGTTT 1380 

1405 



45 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 504 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC»J: SEQ ID NO: 51: 

55 

CGGATTTTCT AGGACCCCAA AAAAAAAAAA AGGCTJAAAAA AAACCCNCAA AACCANCCAA 60 
AACCCCAAAA AAAAAAAAAA TCCACAAAAA CAAAAAAACT ATAAAAAAGA AAGAATTAAA 120 
60 AACTTTCAGA GAATTACTAT TTACTTTATT AACTTACGQA TTTATTATAT AAATATATAT 180 
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TCACCTAGCA ACATATCTCT GCCGTCTCTC CTGCTCTCAT AATCAAGACA TAGCCGATTC 
TCTQCCCGGG CCCCTTGCTG ATCCTCCTCC GC5GTCTQCX5T CGGGCGTOCX5 TCTCTCQGGA 
CXXrrCCAGAG GTGGAGGTGG GCTGATGGCC TGGCIXXTTG GTQGTTCATO GTTTTCCTCC 
CCCTACCTTT TTTTTTTGAG TTTATTCTGA TTGATTTTTT TTCTTOCSTTr CTOGATAAAC 
CACCCTCTGG GGACAGGATA ATAAAACATG TAATATTTTT AAGAAGGAAA AAAAAAAAAA 
AAAAAACTTJG GGGGQGGCCC CGAA 

(2) INFORMATION FOR SBQ ID NO: 52: 

(i) SE5QUENCE CHARACTERISTICS: 

(A) LENGTH: 777 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESGRIPTIM^: SBQ ID NO: 52: 
NAAGTATCTT GGCCAGTTTA TTACAGAGGA CGATAAATC5A TTCCATGTGG ATAGGGCATA 
ACATACAGAG AATGAGACTA TQCCAGAAAT GGGAGGAGGC ATTTGAAACA ACATXSAGTAT 
CTCAGGGACA GATGGATTGA TTCTGCTATT GGTAGGCCTG GAAGCAANGG TCAGAAGTAG 
CAAAAAATGG ATACCAAAAG CACTATTO5T CACCCAAGCT AAGTQGAATA QCTOGCCCAG 
TAGGAGAAAT GCAGGTTrTG CTCTACACTA AGTTCTCCAA CTCTTGATAA GCCTCCAAAA 
ACAAATGTTA GGQGAAAAAA ACGCAGCTGG TTATQAAAAG ATATATCTCA TTTCATTAAA 
AAATCAATGT CAATGCTGTT AATAGAATCC TTTTATCTTC AGGACAGAGG CAATCCCCTA 
AACAAACACC AGCTCAA6AG CCTCTGATGC CAACCTAGAG GGTACCCAAA CACAAACTTA 
GCATAGAGGT AAGAATCTCT ATGTCTTTTG GTGGAGGCAA AQCCATTTGG TTGGTACTTC 
ACAGGAACAT CTTTCTACCA AGTCTTCATC ATATOGTATG TGCCACGAGT CTCCAGTTOT 
TTQCACCACT GTGTCATAGC TGAGAATACG CTGAAAGGTT AGTTTTGATC CTGGAAACCT 
ATTTACAATT GCCAGCTGAT GTCCCTGCTG CCACTTAAAA AAGGCriGGG TCTGGCATAG 
GCAGAMAGGC CTGTGGTCCC CTCGTGCCGA TTCTNGGCTC GAGGCCAATT NCCTTAT 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 602 base pairs 
{B) TYPE: nucleic acid 
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(C) STRANDEDNBSS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SBQ ID NO: 53: 

5 

ATGACTACAG TGTTATACCC TCXAATCTTT GCAGGTQGGC ATGGAACACT GCTTCTTATCA 60 

CTCTGTOCAC GGTATAAATC CATATATCCA CAAAAACACA CATCCATCCA TCAACATATA 120 

10 CATGGrTTGG GATGAGCAGG TCAATAGTTT TGAGAGGGAG TTTGTTCCTT TTTTTTTTCT 180 

CATTATACTC TTAAATTGTT GTCAGTTATC AAACAAACAA ACAGAAAAAT TGTTTGGAAA 240 

AACCTTGCAT ACGCCTTTTC TATCAAGTGC TTTAAAATAT AGACTAAATA CACACATCCT 300 

15 

GCTAGTTTTT TCTTACAGTG ACAGTATCCT TAOCTGCCAT TTAATATTAG CXTCCTATTT 360 

TTCTCAC?GTA TATTTACCTG TCACTTGTAT TTGTTATTTA AACAGGAAAA AAAACATTCA 420 

20 AAAAAAGAAA AATTAACTGT AGCGCTTCAT TATACTATTA TATTATTATT ATTATTGTGA 480 

CATTTTGGAA TACTGTGGAA GTTTTATCTC TTGCATATAC TTTATAOGGA AGTATTACGC 540 

CTTAAAAATA CGAAAATAAA TTTTACAAGG TTOCGGTTTT GGTGGTGGAA AGAGTAAATT 600 

GA 602 



25 



30 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1749 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: doiible 

(D) TOPOLOGY: linear 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

40 

AGTCACTGAC TTGGAGCCGC TCGGGGGAAG TCCCGCCCAG ACAGGCGGTG GGTGGGAATG 60 

CCTCACTTCA GTTTGAAGAG GGTCCGGATC CAAAGGGGTT AAAACGAGCG AACCCCGATC 120 

45 CCCGACCACA CTTCCCGCCT CCCTAAAACG CACACCCCGC TAGCCATGGG CAGCCGCGAC 180 

CACCTGTTCA AAGTGCTGGT GGTGGGQGAC GCCGCAGTGG GCAAGACGTC GCTGGTGCAG 240 

GATTATTCCC AQGACAGCTT CAGCAAACAC TACAAGTCCA CGGTGQGAGT GGATTTTGCT 300 

50 

CTGAAGGTTC TCCAGTGGTC TGACTACGAG ATAGTOCGGC TTCAGCTGTG GGATATTGCA 360 

GGGCAGGAGC GCTTCACCTC TATGACACGA TTGTATTATC GGGATGCCTC TGCCTGTGTT 420 

55 ATTATGTTTG ACGTTACCAA TGCCACTACC TTCAGCAACA GCCAGAQGTG GAAACAGGAC 480 

CTAGACAGCA AGCTCACACT ACCCAATGGA GAGCCGGTGC CCTGCCTGCT CTTGGCCAAC 540 

AAGTGTGATC TGTCCCCTTG GGCAGTGAGC CGGGACCAGA TTGACCGGTT CAGTAAAGAG 600 
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AACGGTTTCA CAGGTTGGAC AGAAACATCA 
ATGAGAGTCC TCATTGAAAA GATGATGAGA 
5 ACCCAAGGGG ACTACATCAA TCTACAAACC 
TGTTTGGCTT ATTTTCCATC CCAGTTCTGG 
CXCACCTGAC CATTTTATTA AGTACATTTG 

10 

gggccx:attg tcacttagaa aagacacctg 
ttagcctttc acatgttgct grctcacatt 
15 ttctcatcag ccctcaattt gtgatccgga 

TGCGTTTTAG AGATCATAAT TCTCACCTAC 
TTGATATCAT GACTTCCAAT TGAGAGGAAA 

20 

TGTAGGCX:X?r TGTTTCAGAT TCTTTCTGTC 
AGAAGGAGGG GTCTGGGCAT CTGTGGATTT 
25 GTATTTTTGA AACTTCTAAC GTCATAATTA 
CAAGTTTTTT GGCCGQGCAT GGTGGCICAT 
GGCAGGCGGA TCACATGAGG CCAQGAATTC 

30 

CGTCTCTACT AAAAGTACAA AAATTAGCCA 
TACTCTGGAG ACTGAGGTGG GAGAATCGCT 
35 CGAGATCATG CCACCGCACT TCAGCCTGGG 
AAAAAAAAAA AAAACTCGAG GGGGGGCCCG 
ACAATCNAA 

40 



GTCAAGGAGA ACAAAAATAT TAATGAOGCT 660 

AATTCCACAG AAGATATCAT GTCTTraTCC 720 

AAGTCCTCCA GCTGGTCCTG CTGCTAGTAG 780 

GAGGTCTTTT AAGTCTCTTC GCTTTGGTTG 840 

AATTGTCTCC TGACTACTGT CCAGTAAGGA 900 

GAACCCATGT GCATTTCTGC AOXrrCCTGGA 960 

AGTGCXIAGTT AGTGCCTTCG GTGTAAGATC 1020 

ATTTTGTGAG AAGGATTAGA AATCAGCACC 1080 

TTCTGAGCTT ATTTTTCCAT TTGATATTCA 1140 

ATGAGATCAA ATGTCATTTC CCAAATTPCT 1200 

TTGGAATGTA AACATCTGAT TCTQGAATGC 1260 

TTGGCTACTA GAAGTOTCCC AGAAGTCACT 1320 

AGTTTCTCTT GTCTTGGCAT CAAGAATAGT 1380 

GCCKGTAATC CCAGCACTTG GGGAGGCCAA 1440 

6AGACCAACC TGGTCAGCAT GGCAAAACOC .1500 

GGCGTGATGG CACGTGTCTG TAATCCCAGC 1560 

TGAGACTGGG AGGCAGAGGT TGCAGTGAAC 1620 

TGACAGAGAA GGACTCCGTC TCAAAAAAAA 1680 

GTACCCAAAT CXXTSTGATA GTGATCGTAW 1740 

1749 



45 



(2) INFORMATION FOR SEQ ID NO: 55: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1896 base pairs 

(B) TYPE; nucleic acid 

(C) STOANDEDNESS: double 
50 (D) TOPOLOGY: linear 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

AAAGAGATGG GCTCTTTATT TTCTCGAAAA ACCAATTTGG AGTTACTCAT TTTTCCATAA 60 

CATTAAATTT CTTACAGTGA ACTACATATT GTCCATAAGT GCTTCATCAG GACTCATCGC 120 

CCTCCTGTCT ACTGGCTCCA AATAGACCAT GTCAGCTTCA CCCCCTGGCT TTGTGTCTAT 180 

60 GGGTGGCCTG TGGTATATGG AAAAGTAGCA GGGTGGTCAG GGTGGGAGAC ACAAGATGTT 240 
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TTTATAGTCT AGAGCCTTTA AAAAACCXZAG CAGAATGTAA TTCAGTATTT GTTTATTC3GC 300 

TGfmTTTGA CAGATTOTTG AAATTAAATG AATTGAAAOG GAAACTCAGA GTACTAGGAC 360 

5 

GriTATTAAA AOGAAAAAAA TGTCTTGCAA TGTGCTGTAA TCACAAGAGG AGAAAATAAC 420 

TTGTrTCCTT GATCTGTCAG AQGTCACAGT AACCTGGGCC GAGCTGTTAT TATTTATTAT 480 

10 ATAATAGTAG TAGGAAGTTA ATAACTQGTT CTCTGTOTTC CAAGCACAAT ATTACAACTT 540 

CrrTTTGAACX: GTAAATATCA GAATGAATCC TCTTCCCAGG GGATTGAACA GAAGCTTAAT 600 

GTTTACAAC3T GTTTGAATTT GTCATCTC5AA ATAACACAAA ATTAAAAACA TGATTTCTCT 660 

15 

AATTTTCCAA CTAGAGGAAG AGAAACTTGT GC3AAAAGTTC TTTrmTTC TTTTrTTTTT 720 

CTTAAAGAAG GGCAGCXAAG GTAGrTAACCT AAAAATAGTG CCCAGGCATA TGAGAGTTGT 780 

20 CXTTACGACSGT TAAAGAACAC ACTGTTCCAC TGTATGGCTT TCGCCCTGAG TCQCXMQGA 840 

QGTCAACTIG ACCCTGCCAT GITOGTTTGA CTTACTAAGA CACAGGAATC ATTGTTTTCX: 900 

TTGACCAGGG TCTCACACCC TQGAGGAATG TTAAGTAAGA GAAAGAACCT CTTTCCTGAA 960 

25 

TATTGACATG TAAAAGACCA AAGTAATTTT TCTGAACTTC TGCAATTCTG AGAACTCTCC 1020 

AAGGAATTTA CAGTGATTTT AGTGCTTGTC AGCATTTTTC CATGAGGACT TTCATACATT 1080 

30 TGACTCnTA GTTCACAGGT TCCCATTGAT TGTGAGCAAG ATATTTATCT CTTTAGCCCT 1140 

TGGGGATCCA GCTGAGAGCA ATCTCTTGCA TTTTTTTACC CGTGTATGTA CAGATATCAT 1200 

TTCTTGTGTA TGCCATGACT TGAAAAAGTT TGGGAAGCTC TTTAGCAATA TCAGCTAAAA 1260 

35 

GGATATGAAA TCACAGGTGA TAGCAGTTGT CATTCAGTAA TTICCTACAA GCAGCACCCC 1320 

AAAC3GAAATA TAGTCCTAAT CTTTACTATC CACTTCTAAA TTTAATGTGA ATTTCATACA 1380 

40 TGTTATTAGT TGTTTTCTTT ATAATTTTAT AAAAATTATT CATCGGGAGT TTAACTTCCA 1440 

CTTCCATGCT ATCGGATGTG TTGGGCTCCA TGCAAGAACT TQGAAGAAAA ACAGGCAQGA 1500 

ATGCATTTGC ATAATGACCC AGATCATCAT TTTCTQCAAC TGAGAATTAT ATTTCATCAT 1560 

45 

TGCTTCTAGA AGTCTGCAAT TCTITACTTT TCTTTGGTGC ATTATTATCT AGGTGCCATC 1620 

ACTGGATAAT GTGGAGTGAC TAGAGAAGTC AYATATCACT GTAAGGTACA GTTAGGGGTA 1680 

50 ACACTTTAGA GGTrTATTAT TnTAAAAAA CTTTTCTTGA ACTCCTGGGC CAACATGGGT 1740 

GAAACCCCGT CTTCTTACTT AAAAATACCC AAAATTAGGC CAGQQGCGTG GATGGGTGGG 1800 

GTGCXTGTTA ATCTTCAGCT ACTTOGQGGA GGGCTTGAAG CX:AGGGAGGA ACTGCCCTGG 1860 

AircCCCGGGG NGGGCCAGNA QGTTTGCCAG TTGAGT 1896 



55 



60 
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(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LajGTH: 1753 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEnNESS : doxible 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

TCTTTTTAAA ATAGACATTT GTGQGGCTCA CACAATATAT GAAATAGTAC CCTCTAAAAA 60 

AGAGAAAAAA AAAATCAGGC QGTCAAACTT AGAGCAACAT TGTCTTATTA AAGCATAGTT 120 

15 TATTTCACTA GAAAAAATTT AATATCAAGG ACTATTACAT ACTTCATTAC TAGGAAGTTC IBO 

TTTTTAAAAT GACACTTAAA ACAATCACTG AAAACTTGAT CCACATCACA CCCTGTTTAT 240 

TTTCCTTAAA CATCTTQGAA GCCTAAGCTT CTGAGAATCA TGTGGCAACrT GTGATGGC3CA 300 

GTAAAATACC AGAGAAGATG TTTAOrAGCA ATTAAAQGCT GTTTGCACCT TTAAGGACCA 360 

GCTGGGCTGT AC5TGATTCCT GGGGCCAGAG TGGCATTATG TTTTTACAAA ATAATGACAT 420 

25 ATGTCACA'TC TTTGCATCTT TGTTTQCTTG TrGAATTTTT GAACAGCCAG TTGACCAATC 480 

ATAGAAAGTA TTACTTTCTT TCATATGGTT TTTGGTTCAC TGGCTTAAGA GGTTTCTCAG 540 

AATATCTATG GCCACAGCAG CATACCAGTT TCCATCCTAA TAGGAATGAA ATTAATTTTG 600 

TATCTACK3A TAACAGAATC TCGGTCACAT GAAAAAAAAT CATTTTATCC GTCTTTTAAG 660 

TATATCTTTA AAATAATAAT TTATGTGTCT GCATATTGCA GAACAGCTCT GAGAGCAACA 720 

35 GTTTCCCATT AACTCTTTCT GACCAATAGT QCTQGCACCG TTGCTTCCTC TTTGGGAAGA 780 

GGAAAQGGTG TGTCAACATG GCTAACAATC TTCAAATACC CAAATTGTGA TAGCATAAAT 840 

AAAGTATTTA TTTTATQCCT CAGTATATTA TTATTTAATT TTTTAGCTAA TQCCTATCTC 900 

TTO3TCTATT AAGGAAAGAA GCAATCAGTA GAGAATTCAG GATAGTTTTG TTTAAATrCT 960 

TGCAGATTAC ATGTTTTTAC AGTGGCCTGC TATTGAGGAA AGGTATTCTT CYATACAACT 1020 

45 TCTTTTAACC TTTGAGAACA TTGACAGAAA TTATGCAATG GnTGTTGAG ATACGGACTT 1080 

GATGGTGCTG TTTAATCAGT TTGCTTCCAA AGTGGCCTAC TCAAGAGGCC CTAAGACTGG 1140 

TAGAAATTAA AAGGATTTCA AAAACTTTCT ATTCCTTTCT TAAACCTACC AGCAAACTAG 1200 

50 

GATTCTGATA GCAATGAATG GTATGATGAA GAAAGTTTGA CCAAATTTGT TTTTTTGTTG 1260 

TTOirorrcT TTTGAATTTG AAATCATTCT TATTCCCTTT AAGAATGTTT ATGTATGAGT 1320 

55 GTCAAGATGC TAGCGAACCT ATGCTCAGAT ATTCATCGTA AGTCTCCCTT CACCTGTTAC 1380 

AGAGTTTCAG ATCGGTCACT GATAGTATGT ATTTCTTTAG TAAGAATGTG TTAAAATTAC 1440 

AATGATCTTT TAAAAAGATG ATGCAGTTCT GTATTTATTG TGCTGTGTCT QGTCCTAAGT 1500 
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GGAGCCAATT AAACAAGTTT CATATGTATT TrTCCAGTGT TGAATCTCAC ACACTGTACT 1560 

TTGAAAATTT CCTTCCATCC TGAATAACGA ATAGAAGAGG CCATATATAT TGCCTCCTTA 1620 

TCCTTGAGAT TTCACTACCT TTATGTTAAA AGTTGTGTAT AATTGTTAAA ATCTGTGAAA 1680 

GAATAAAAAG TGGATTTAAA TTAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1740 

AAAAAAAAOG GG6 X753 



15 



(2) INFORMATION FOR SEQ ID NO: 57: 



(i) SEQUENCE CHARACTERISTICS: 

(A) IiENGTH: 1220 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
20 (D) TOPOLOGY: linear 



25 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

GCQGAAGTTA CTGCAGCCGC GGTGTTC3TGC TGTGGGGAAG GGAGAAGGAT TTGTAAACCC 60 

CGGAGCGAGG TTCTGCTTAC CCGAGGCCGC TGCTGTGCGG AGACCCCCGG GTGAAGCCAC 120 

CGTCATCATG TCTGACCAGG AGGCAAAACC TTCAACTGAG GACTTGQGGG ATAAGAAGGA 180 

30 AGGTGAATAT ATTAAACTCA AAGTCATTGG ACAGGATAGC AGTGAGATTC ACTTCAAAGT 240 

GAAAATGACA ACACATCTCA AGAAACTCAA AGAATCATAC TGTCAAAGAC AGGGTGTTCC 300 

AATGAATTCA CTCAGGTTTC TCTTTGAGGG TCAGAGAATT GCTGATAATC ATACTCCAAA 360 

35 

AGAACTGGGA ATGGAGGAAG AAGATGTGAT TGAAGTTTAT CAGGAACAAA CGOOGGGTCA 420 

TTCAACAGTT TAGATATTCT TTTTATTTTT TTTCTTTTCC CTCAATCCTT TTTTATOTTT 480 

40 AAAAATAGTT CTTTTGTAAT GTGGTGTTCA AAACGGAATT GAAAACTGGC ACCCCATCTC 540 

TTTGAAACAT CTGGTAATTT GAATTCTAGT GCTCATTATT CATTATTGTT TGTTTTCATT 600 

GTGCTGATTT TTGGTGATCA AGCCTCAGTC CCCTTCATAT TACCCTCTCC TTTTTAAAAA 660 

45 

TTACGTGTGC ACAGAGAGGT CACCTTTTTC AGGACATTGC ATTTTCAGGC TTGTGGTGAT 720 

AAATAAGATC GACCAATGCA AGTGTTCATA ATGACTTTCC AATTGGCCCT GATGTTCTAG 780 

50 CATGTGATTA CTTCACTCCT GGACTGTGAC TTTCAGTGGG AGATGGAAGT TTTTCAGAGA 840 

ACTGAACTGT GGAAAAATGA CCTTTCCTTA ACTTGAAGCT ACTTTTAAAA TTTGAGGGTC 900 

TGGACCAAAA GAAGAGGAAT ATCAGGTTGA AGTCAAGATG ACAGATAAGG TGAGAGTAAT 960 

55 

GACTAACTCC AAAGATGGCT TCACTGAAGA AAAQGCATTT TAAGATTTTT TAAAAATCTT 1020 

GTCAGAAGAT CCCAGAAAAG TTCTAATTTT CATTAGCAAT TAATAAAGCT ATACATQCAG 1080 

60 AAATGAATAC AACAGAACAC TGCTCTTTTT GATTTTATTT GTACTTTTTG GCCTGGGATA 1140 
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TGGGTTTTAA ATGGACATTG TCTGTACCAG CTTCATTAAA ATAAACAATA TrTGTAAAAA 1200 
TCAWAAAAAA AAAAAAAAAA ^220 



10 



(2) INFORMATION FOR SEQ ID NO: 58: 



20 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1049 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
15 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

TCGCGCCTGC AGACACAGCA TCTACTCflOC GTGQGTCACC TCTGTGAACA TCACTGACTG 60 

CAAGCCTCCC TCAATTTCTG GTGCAGCCCA TCAGGGACCC ACAQCGCCTG GGAGGATGGT 120 

GCGGATCTTG GCCAATGGGG AAATCGTGCA GGACQACGAC CCCCGAGTGA GGACCACTAC 180 

25 CCAGCCACCA AGAGGTAGCA TTCCTCGACA GAGCTTCTTC AATAGGGGCC ATGGTGCTCC 240 

CCCAGGQGGT CCTGGCCCCC GCCAGCAGCA GGCAGGTQCC AGGCTGGGTG CTGCTCAGTC 300 

CCCCTTCAAT GACCTCAACC GGCAGCTGGT GAACATGGGC TTTCCGCAGT GGCATCTCGG 360 

CAACCATGCT GTGGAGCCGG TGACCTCCAT CCTGCTCCTC TTCCTGCTCA TGATGCTTGG 420 

TGTTCGTGGC CTCCTCCTGG TTGGCCTTGT CTACCTGGTG TCCCACCTGA GTCAGCGGTG 480 

35 ACCTCTGAGG GCTGATAGGG GTGGGTTTGT TGAGAGGGAC TTGCTGGGCC TTGGTGTGAG 540 

AGCAGGCATA TTTQGAGGGG ATCTGGTGGT QCCTTGAAGG TATGATCAGA GAGGGGACCA 600 

CAGGTGTGTC TTTCCCCTTT GTGITAAGCG TGAGGCAGAG GGAGACGTTA GTCCCAGCAT 660 

TTCCCAAAGT GTGQGTGGGT CCGTTGGTTC CCGAGATACT TTTAGGTGGT ATGGGGCCTG 720 

CATTAAGTGG CACAAAATCA GAGCAAGAAA GCGATGCCCT TCCCAATTCT CTCAATCCTT 780 

45 TTATGCCGAG AAGATCTCAG CTGGATGCCA ACATGTTCCG ATGCCTGTQG AAGACATQCC 840 

GACGTCTCCT CTGCCTAGGG AGCAGGACTT GGQCTTAGGG CAGGTGGAAA AAATTCCAGA 900 

CTTTTTTAGC ACrGTTTTTG TTTTAATOGT ATATTTTTAT TGGCTACTTT ATTG?rTTAGG 960 

ACAAGTGGTA GTGGCATTCT ATTTATTGTG ACCTTTTCAA TAAATAGATT TAAGTAAAAA 1020 

AAAAAAAAAA AAAACTCGAG GGGGGGCCC ^049 



40 



50 



55 



(2) INFORMATION FOR SEQ ID NO: 59: 
60 (i) SEQUENCE CHARACTERISTICS: 



THIS PAGE BLANK (MSPTO) 



i His PAGE BLANK (USPTO) 
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(A) LENGTH: 1776 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

AAAGAGGATG TGMAGCTAGA GGTCCCCGAT GGCTGGTCGG ATGGGAAGCA CAAGGCTGAG 60 

10 GGACTGGATT GTAAAGGCAC TAAGTCGTTC TGCGGTGAGA ATCAGACATG GGGGACCTCT 120 

AGCTTCACAT CCTCTTTCCT TGCAGSTCTC GACATCCTGA OCCCAAGTCC CCCACACTCA 180 

GTGCAGTGAT GAGTGCGGAA GTGAAGGTCA CAGGGCAGAA CCAGGAGCAA TTTCTC5CTCC 240 

15 

TAGCCAAGTC GGCCAAGGGG GCAGCGCTGG CCACACTCAT CCATCAGGT6 CTGGAGGCCC 300 

CTQGTGTCTA CGTGTTTQGA GAACTGCTGG ACATGCCCAA TGTTAGAGAG CTGGCTGAGA 360 

20 GTGACTTTGC CTCTACCTTC CGQCTGCTCA CAGTGTTTGC TTATGGGACA TACGCTGACT 420 

ACTTAGCTGA AGCCCGGAAT CTTCCTCCAC TAACAGAGGC TCAGAAGAAT AAGCTTCGAC 480 

ACCTCTCAGT TGTCACCCTG GCTGCTAAAG TAAAGTGTAT CCCATATGCA GTGTTGCTGG 540 

25 

AGGCTCTTGC CCTGCGTAAT GTGCGGCAGC TGGAAGACCT TGTGATTGAG GCTGTGTATG 600 

CTGACGTGCT TCGTGGCTCC CTGGACCAGC GCAACCAGCG GCTCGAGGTT GACTACAGCA 660 

30 TCGGGCGGGA CATCCAGCGC CAGGACCTCA GTGCCATTGC CCGAACCCTK AANAAAAACC 720 

ATTAAAGTTA CGACGGCAGC AGCAGCCGCA GCCACATCTC AGGACCCTGA GCAACACCTG 780 

ACTGAGCTGA GGGAACCAGC TCCTGGCACC AACCAQCGCC ASCCAGCAAG AAAGCCTCAA 840 

35 

AGGGCAAGGG GCTCCGAGGG ANCGCCAAGA TTTGGTCCAA GTCGAATTGA AAGRACTCTC 900 

GTTTCCTCCC TGGGGATGTG GGGTCCCAGC TGCCTGCCTG CCTCTTAGGA GTCCTCAGAG 960 

40 AOCCTTCTGT GCCCCTGGCC AGCTGATAAT CCTAGGTTCA TGACCCTTCA CCTOCCCTAA 1020 

CCCCAAACAT AGATCACACC TTCTCTAGGG AGGAGKCAAA TGTAGGTCAT GTTTTTGTTG 1080 

GTACTTTCTG TTTTTTGTGA CTTCATGTGT TCCATTGCTC CCCGCTGCCA TQCTCTCTCC 1140 

45 

CITGTTTCCT TAAGAGCTCA GCATCTGTCC CTGTTCATTA CATGTCATTG AGTAGGTGGG 1200 

TAGCCCTGAT GGGGGTCGCT CTGTCTGGAG CATAACCCAC AGGCGTTTTT TCTGCCACCC 1260 

50 CATCCCTGCA TGCCTGATCC CCAGTTCCTA TACCCTACCC CTGACCTATT GAGCAGCCTC 1320 

TGAAGAGCCA TAGGGCCCCC ACCTTTACTC ACACCCTGAG AATTCTGGGA GCCAGTCTGC 1380 

CATGCCAGGA GTCACTGGAC ATGTTCATCC TAGAATCCTG TCACACTACA GTCATTTCTT 1440 

55 

TTCCTCTCTC TGGCCCTTGG GTCCTGQGAA TGCTGCTGCT TCAACCCCAG AGCCTAAGAA 1500 

TGGCAGCCGT TTCTTAACAT GTTGAGAGAT GATTCTTTCT TGGCCCTGGC CATCTCGGGA 1560 

60 AGCTTGATGG CAATCCTGGA AGGGTTTAAT CTCCTTTTGT GAGTTTGGTG GGGAAGGGAA 1620 
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GGGTATATAG ATTGTATTAA AAAAAAAAAG GTATATATGC ATATATCTAT ATATAATATG 1680 

ACGCAGAAAT AAATGTAOXSA GAAATCTATC TACAAAMWAA AAAAAAAAAA AAAAAAAAAA 1740 

5 

AGGAATTCGA TriTCAAGCTT ATCGATAOCG TCNACC 1776 



10 

{2) INFORMATION FOR SEQ ID NO: 60: 

U) SEQUENCE CHARACTERISTICS: 

(A) LETKSTH: 443 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

20 

ACAGATAAAT AAATAAATAA TAAATTAAAT TAAATAAAAA ATCTGAGCTA ATCTGAATAA 60 
ATTGAGAGAT TTCACATGAA AGCCAGGATT TCTGGCTTCC CAGGAACAGT CAGAAGAGCT 120 
25 AGCTAGCAAC ACTGGTCTGC TTGGCTACCT TCTTTGGAAC AACATGAAAT CTAGCTCCCT 180 

riTm ' i ' rrr tttttggccc acttcatcca ttcacatgac ctgoctggcc tctgcaggta 240 

AGTGAGTATG CAACAAAAAT GTAGCACAGG TTTTGTCGCT GAACTACGTG GTTTCAGGTC 300 

30 

CAGCTCTGCC ACTTGCTAGC ATGACCTCGT GCCGAATTCC NGCACGAAGT TTTTTTTTTT 360 

TTTTTCAGTG CTCCAGTCCC CCTATTGGAG AATCCTGCCC CCCCCTGGGA CAGAATGTTC 420 

35 ACCCTGGCCC CGCGANTCCC TGA 443 



40 (2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2888 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS: double 

(D) TOPOliOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

50 TTAATGTTGT CAATAACCAC CAGGCCAAAC AGAATTTATA TGACCTGGAT GAAGATGATG 60 

ATGGTATAGC TTCCGTTCCT ACTAAACAGA TGAAGTTTGC AGCCTCAGGC GNCTTTCTCC 120 

AOCACATGGC TGGGCTAAGC AGTTCCAAGC TTTCCATGTC CAAGGCCCTC CCTCTCACCA 180 

55 

AAGTGGTTCA GAATGATGCA TACACAGCTC CTGCTCTCCC TTCCTCTATT CGAACAAAAG 240 

CCTTGACCAA CATGTCCCGG ACACTGGTGA ACAAQGAAGA ACCCCCCAAA GAGCTGCCAG 300 

60 CTGCTGAGCC TGTTCTCAGC CCATTGGAAG GCACCAAGAT GACTGTGAAT AATCTGCACC 360 
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CTCGAGTCAC TGAGGAGGAC ATTGTTGAGC TTTTCTCTGT GTGTGGGC3CC CTCAAGCGAG 420 

CTCGftCTGGT CCATCCTGGG GTAGCX5GAGG TG G T G TTTGT GAAAAAGC5AC GATGCCATCA 480 

5 

CCGCATATAA GAAGTACAAC AACCQGTCTC TGGACQGGCA GCCGATGAAG TC3CAACCTTC 540 

ACATGAATQG GAATGTTATC ACCTCAGACC AGCCXATCCT GCTGCGGCTG AGTGACAGCC 600 

10 CATCAATGAA AAAGGAGAGC GAGCTGCCTC GCAGGGTGAA CTCTGCCTCC TCCTCCAACC 660 

CCCCTGCYGA AGTGGACCCT GACACCATCC TGAAGGCACT CTTCAAGTCC TCAGGGGCCT 720 

CTKTGACCAC GCAGCCCACA GAATTCAAAA TCAAGCTTTG AGCAGGGGAG TGAGGCAGCX: 780 

15 

AGAAOTGGGG GCAGAGGAGG GTC3QCTCTGT TTCCCCAAGG CAAAGCTTAT GACCAATGGG 840 

CCATCGGACT GGAGACCXCT GATTCTQGGA AGGGTTGCCA GGGATAAAGA GCTTCCTCAC 900 

20 TOGATGGGiAC CXXXCTPTCT GTGTTGTGTT CTGCCCTGTG CTCTTCTCTC TACGTTAACG 960 

TTTCCTGTAG TATGTTTCTT CATCTCATCG CCAAGGTAGG CTTGTGTTTT TCAGTGTGTG 1020 

CX:tCCXXX3AG CCTCAGCXXX: AAGCTGATTT CTTATCTGGA AATGGTACAC TGAATTCTCT 1080 

25 

GGGTOGCTTT CTTGTGGCCC CATGGGATGC AGCGTGQGQG CTGTCTGAAG GACCCTGCTT 1140 

TTTCCAGGGG CCGAGGGGCT GCCTTTCCTT TGTGTGTATT AAGCTTTTCA AACAATGGAG 1200 

30 GGGATGGAGA GCCCTGGTGT CCTGACGGGA GCCAGGTOGG CCTGAGAGCT GTGCCGCTCC 1260 

TCTGTCTTGT CAGTGGAGGT GCCTGGGTGG GGAGCAGGTC 'TCAGGCCTCT TGTCCTCTCC 1320 

CCAGTGGCTC CAGGCCTCAC TAGTGGCAAG GGCAGGATGA GGCTGCACCG CTGGGAAGAG 1380 

35 

TCTATCTAAG YTCTTGGCTT GGAGTCCXGT GTCX3TCTCCR CCCAGAGGAA GTTCTCCAGA 1440 

GTTCACCTTT CCCTTTTCCT TGAGTTGTOC TGAATGCCCX: ACCCCAGCTC TCTTTCCCTT 1500 

40 CTGGGTGTCT TTGCTGQGAG GGGGCTGTOT TC3TGAGCXXT CCCQGTTCTC ACCTCGCX7PG 1560 

GCACTTAACC ACAaXTQGT TTTGTGTAGC CGCCAGCTCT CTTCTGGTTG QGCCTTTGAA 1620 

AGGCTCAGCC TCCXMTCTG CAGTCCTTGG GTTTGGAGCT TATTTGAATG GAAGAGGTCA 1680 

45 

GTrTGTTCCT GGCTCTCCAT TTCTGGCCTC AGTTGTCTAC AGGACAGTC3G TCAGGGATGC 1740 

CTGGAGGCAT ATATCCAGCT GCCACCAAGG GGCACTGTTT GTTCCCACTT ATGTGAGTGA 1800 

50 CCCCATCCAT CCATGACCAG AGGATTATTT TCCTGCCTTG GCAGAGGAGG AGGAGTCAAG 1860 

GGAGCAGGGC AGCTCTACCA GGCAAGGTGT TTCCCCAGCA TAGGCGCAGA CAGTTGGGAC 1920 

GAAACTTCAG AGCCCAGGCA GTCCCTGAAT GACCAGGCCA GTGTTGTCAC TGAGTGGTCC 1980 

55 

CCTGCTGGTT GGGAGTGAAG AGAATCCAGG CTGGCAGAGC TGGAGCCAGT TGGGGAGCAC 2040 

GGTTCTGGGA GCTCTGCAAA ATCAGTAGCA AGTGCTGGAA AAGGCACATG CCGAAGATAC 2100 

60 TCAAGAGCTC CCAAGATTTG CTTGAGGCTA GCCCAGTGAA RAAAACCAGA GACTCATGTT 2160 
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TCCAGGGGTC AGTCTGTCAG GCAGGAAGGA CCCAGGATTT GAACCCAGCT TCAGTGTGCA 2220 

GGCTCTGAGG CTGCCCAGGA CX3GGAAAGTC CAAGQAAGGG GCCTGGTGGT GCTCCACTTC 2280 

5 

CAGTTCTTTA AAGAATGCOX* CTTTrrATTC TCCTAACCCT TTCAAGTGGG TGCAGACTTC 2340 

TCGTTAGCAG CTGGAAGACA TTCCTCCCAC ACTTTTCCCT OXXTOGCCCA AGAGAGCATC 2400 

10 CAGAAGGCAG TAGGACCTGG TTTTTCAGGT ACTGGGAGCC GGGGGCTCAC TGCTTGCACT 2460 

GTGCTTAGGG TAGQGATGGT AAATATCCTC CCTGCATQGC TTTATCCTCC CTCTCATCCC 2520 

AAAQCAGGTA TCTTCTQGTT GTCACAGAGT TTCATTGAGT CCAGCTGCAS CCACGTOQCC 2580 

15 

ATCTQGAQCT GGTQCTATAG GTGACCATCT GGTACATTGA GGGGAOCTGT TTGCCTCCTC 2640 

CACTCTATAA GCAGTCATCT TGQGAGACCG GGAQGAGAAG GTGGTGGGCT AGTCCTGTGT 2700 

20 CCTCCTCCAC TTCCCATGCC TCTATGTTAC CXATCTGTGT CTCCTGTGCA GAAQGAGAGG 2760 

AAGGGGCATT AAGAGATGAA GGGTGATTAT GTATTACTTA TCCATTTCTG AATAAACATT 2820 

TGTTATTCCT AAAAAAAAAA AAAAAAAACT CGAGGGGGGG CCCGGWACCC AWATCOXSK 2880 

AAAGT6AG 2888 



25 



30 



40 



(2) INFORMATION FOR SEQ ID NO: 62: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1851 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNBSS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

CACTAGTATA ATTTATAATT ATAAOCTATT CTGATTTCTT TTCAAATATT AGGTGTCCTA 60 

GTTGCCTATG AAGGTTTGCC ACTTCATCTT GCACTGTTCC CCAAACTTTG GACTGAGCTA 120 

45 TGCCAGACTC AGTCTGCTAT GTCAAAAAAC TGCATCAAGC TTTTGTGTGA AGATCCTGTT 180 

TTCGCAGAAT ATATTAAATG TATCCTAATG GATGAAAGAA CTTTTTTAAA CAACAACATT 240 

GTCTACACGT TCATGACACA TTTCCTTCTA AAGGTTCAAA GTCAAGTGTT TTCTGAAGCA 300 

50 

AACTGTGCCA ATTTGATCAG CACTCTTATT ACAAACTTGA TAAGCCAGTA TCAGAACCTA 360 

CAGTCTGATT TCTCCAACCG AGTTGAAATT TCCAAAGCAA GTGCTTCTTT ^ AAATGGGGAC 420 

55 CTGAGGGCAC TCGCTTTGCT CCTGTCAGTA CACACTCCCA AACAGTTAAA CCCAGCTCTA 480 

ATTCCAACTC TGCAAGAGCT TTTAAGCAAA TGCAGGACTT GTCTGCAACA GAGAAACTCA 540 

CTCCAAGAGC AAGAAGCCAA AGAAAGAAAA ACTAAAGATG ATGAAGGAGC AACTCCCA1T 600 

60 
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AAAAGGCGGC GTGTTAGCAG TGATGAGGAG CACACTGTAG ACAGCTGCAT CAGTGACATG 660 

AAAACAGAAA CCAGGGAGGT CCTGACCCCA ACGAGCACTT CTGACAATGA GACCAGAGAC 720 

TCCTCAATTA TTGATCCAGG AACTGAGCAA GATCTTCCTT CXTCCTGAAAA TAGTTCTGTT 780 

AAAGAATACC GAATGGAAGT TCXATCTTCG TTTTCAGAAG ACAIGTCAAA TATCAQGTCA 840 

CAGCATC3CA6 AAGAACAGTC CAACAATQ6T AGATATGACG ATTGTAAAGA ATTTAAAGAC 900 

CTCCACTGTT CCAAGGATTC TACCCTAGCC GAGGAAGAAT CTGAGTTCCX: TTCTACTTCT 960 

ATCTCTGCAG TTCTGTCTGA CTTAGCTGAC TTGAGAAGCT GTGATOGCCA AGCTTTGCCC 1020 

15 TCCCAGGACC CTOAGGTTGC TTTATCTCTC AGTTGTGGCC AOTCCAGAOG ACTCTTTAGT ' 1080 

CATATGCAGC AACATCACAT TTTAGATACC CTGTGTAGGA CCATTGAATC TACAATCCAT 1140 

GTCGTCACAA GGATATCTGG CAAAGGAAAC CAAGCTGCTT CTTGACATTA GGTCrTAGCAT 1200 

20 

GTCTACTTTT AAGTCCCTCA CCCCCAACCC CCATGCTGTT TGTATAAGTT TTGCTTATTT 1260 

GTTTTTGTGC TTCAGTTTGT CCAGTGCTCT CTGCTTGAAT GGCAAGATAG ATTTATAGGC 1320 

25 TTAATTCTTG GTCAGGCAGA ACTCCAGATG AAAAAAACTT GCATCTTCAG TATACTTCCT 1380 

AAAGGQCAAT CAGATAATGG ATATGTTTTA TGTAATTAAG AGTTCACTTT AGTQGCTTTC 1440 

ATTTAATATG GCTGTCTGGG AAGAACAGGG TTGCXTTAGCC CTGTACAATG TAATTTAAAC 1500 

30 

TTACAGCATT TTTACTGTGT ATGATATGGT GTCCTCTGTG CCAGTTITCr ACCPTATAGA 1560 

GGCAGATTGC CTCCGATCGC TGrrQGTTCTT ATTATCAAAA TTAAGTTTAC TTGTATACGG 1620 

35 AACAACCACA AGAAATTTGA TTCTOTAAAG AATCCTCTTT AGCTGTGGCC TQGCAGTATA 1680 

TAAATGGTGC TTTATTTAAC AGAATACCTG TGGAGGAAAT AAAGCACACT TGATGTAAAA 1740 

ATAATTGrrr TA1TTTTATT GACATGACTG ATTGATTGCT ATTCTGTGCA CTTAATTAAA 1800 

CTGATTGTGA TGACTTWWAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA A 1851 



40 



45 

(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3542 base pairs 
50 (B) TYPE: nucleic acid 

(C> STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

55 

TCCAATGCTG ATGAGCGTCT TCGCTGGCAG GCCAGCTCCT TGCCTGCTGA TGACCTTTGC 60 
ACAGAAAATG CCATCATGCT GAAACGATTC AATAGGTATC CGCTGATCAT TGACCCCTCT 120 
60 GGACAGGCCA CAGAATTCAT TATGAATGAA TATAAGGWTC GTAAGATCAC ACGGACCAGC 180 
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TTCCTGGATG ACGCCTTCAG AAAGAACTTA GAGAGTGCAC TGAGATTCGG TAACCCCCTT 240 

CTGGTCCAGG ATGTGGAAAG CTACGATCCA GTTTTGAACC CGGTGCTGAA CCGTGAAGTG 300 

5 

CGGCGAACAG GGGQGAGAGT GCTGATCACT CTCGGGGACC AGGACATAGA CCTGTCGCCA 360 

TCGTTTGTCA TCTTCCTGTC CACCCGQGAT CCAACTGTCG AGTTCCCAOC AGATCTCTGr 420 

10 T0CXX3GGTTA CTTTTCTAAA CTTCACAGTT ACCCGTAGCA GTTTACAAAG CCAC3TGTCTA 480 

AATGAAGTAC TTAAAOCAGA AAGACCTGAT GTGGACGAGA AACGATCTGA TCTTCTTAAA 540 

CTTCAAGGGG AATITCAGCT COGTTTGCGT CAGCTGGAAA AATCTCTACT ACAAGCTCTG 600 

15 

AACGAGGTGA AAGQGCGCAT TTTGGATGAC GACACGATCA TAACX^CTCT GGAGAACCPG 660 

AAGAGAGAGG CTGCAGAGGT CACCAGGAAA GTTGAGGAGA OGGACATTGT CATQCAGGAG 720 

20 GTOGAGACCG TGTCCCAGCA GTAOCTCCCG CTCTCCACCG CCTGCAGCAG CATCTACTTC 780 

ACCATGGAGT CCCTCAAGCA GATACACTTC TTGTACCAGT ACTCCCTCCA GTTTTTCCTG 840 

GACATTTATC ACAACGTCCT ATACGAGAAC CCGAACCTGA AGGGTGTCAC CGACCACACA 900 

25 

CAGCGCCTGT CCATTATAAC AAAGGACCTC TTCCAGGTGG CXHTTAACCXS AGTGGCTCGA 960 

GGCATGCTGC ATCAGGACCA CATTACCTTT GCCATGCTGC TGGCAAGAAT CAAACTGAAG 1020 

30 GGCACXXSTQG GGGAGCCCAC CTACGATQCA GAATTCCAGC ACTTCTTGAG AGGAAA3GAG 1080 

ATTGTCCTGA GTGCTGGCTC CACXXXXAGG ATCCAGQGCC TGACTGIGGA GCAGGCX3GAG 1140 

GCGGTOGTGA GGCTGAGCTG CCTTCCCGCX3 TTTAAGGACT TGATTGCAAA GGTTCAGQCA 1200 

35 

GACGAGCAAT TTGGCATCTG GCTGGACAGC AGCICCCCGG AGCAGACTffP GCCCTAOCTC 1260 

TQGAGTGAAG AAACACCTGC AACACCCATT GGCCAGGCCA TCCACCGCCT GCTCCTGATC 1320 

40 CAGGCTTTCC GGCCCGATCG CCTGTTGGCC ATGGCCCACA TGTTTCTTTC AACAAACCTT 1380 

GGGGAGTCTT TCATGTCCAT CATGGAGCAG CCGCTCGACC TGACXTACAT TGTGGSCACA 1440 

GAGGTGAAGC CCAACACTCC TGTCTTAATG TGCTCTGTGC CTGGTTATGA TGCCAGTGGA 1500 

45 

CATGTCGAGG ACCTTGCAGC CGAGCAGAAC ACGCAGATCA CTTCAATTGC AATCGGCTCT 1560 

GCAGAAGGCT TTAACCAAGC AGATAAOGCA ATAAACACCG CTGTAAAGTC GGGCAGGTGG 1620 

50 GTGATGCTGA AGAATGTGCA TCTQGCCCCA GQGTGGCTGA TGCAGCTG6A GAAGAAGTTG 1680 

CATTCCCTGC AGCCGCATGC CTGCTTCCGA CTCTTCCTCA CCATQGAGAT CAACCCCAAG 1740 

GTGCCTGTGA ATCTGCTCCG TGCGGGCTGC ATCTTTGTGT TCGAOOCACC GCCAGGGKTG 1800 

55 

AAGGCCAACA TGCTGAGGAC GTTCAGCAGC ATTCCCGTCT CACGGATATG CAAGTCTCCC 1860 

AACGAGCXTTG CCCGCTTGTA CTTCCTGCTG GCCTGGTTTC ATGCGATCAT CCAAGAACGC 1920 

60 TTACGATACG CACCACTGGG GTGGTCAAAG AAGTATGAAT TTGGAGAGTC TGACCTGCGG 1980 
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TCANVTTGCG ATACGGTGGA CACGTGGCTG 
TCACCGGATA AGATCCCGTO GTCTGCACTA 

5 

GQGCXXX3TGG ACAAC3GAGTT TGACCAGCCT 
ACAACCAOGA GTTTCGACAG TGAGTTTAAG 
10 ATTCAAATGC CAGATGC3CAT GCAGGCGAGA 
CGACACCXy^G ACGCCCTCCT GGCTGGGCCT 
CACACAGQGT GTGGACATGA TCAGTAAAAT 

15 

CGACXrrGGCC TACGCAGAGA CTGAGAAGAA 
OXTGCCTGG ATQCX3GACAC TGCACAOCAC 
20 GACGCTGAGC CACCTCAAGC GCACCGTGGA 
TGAGAGAGAA GTTGAAGATGG GCGCAAAGCT 
TGTCGTCCAG GTGTGCGAAG GAAAGAAGAA 

25 

CGAGCTAGTG AAAGGGATCT TGCCTCGGAG 
GACCGTCATC CAGTGGGTGT CCGACTTCAG 
30 ACTGGCAGCT GCATCTGGTG GCX3CCAAGGA 
CCICTTCGTG CCTGAGGOGT ACATCACTGC 
CTGGTCCCTG GAGGAGCTCT GCCTGGAAGT 

35 

TGACGCTTGC AGCTTCGGAG TCACXX3GTTT 
CAAGCTGTCA CTGTCCAATG CCATCTCAAC 
40 CAAGCAGACA AACACCX5AGA AGAAGGCCAG 
CACCCGTQCA GACCTCATCT TCACCGTGGA 
CAGCTTCTAC GAGCGGGQTG TCGCAGTCTT 

45 

TTTCTGTAAT AGTGAAAGTT GGTATTTAAC 
CTGAGCTTGT GAAAAGAAAG TGGTTGGTCT 
50 GGTTGGGAGT GGTX3GAAATT GGAAGGATAC 
GCTCCTTTGA GGAAATAAAA CACTAAGCAT 
GG 

55 



GATGACACGG CCAAGGGCAG GCAGAACATC 2040 

AAGACCTTAA TGQCXX3WGTC CATTTATGGC 2100 

CTQCTCAACA CCTTCCTGGA GCGCCTGTTC 2160 

CTGQCATGCA AGGTCQACGG ACATAAAGAC 2220 

GGAGTTTOTG CAGTGGGrTGG AGTTGCTCCC 2280 

GCCCAACAAC GCCGAGAGAG TCCTCCTTAC 2340 

GCTGAAGATG CAGATGTTGG AGGATGAGGA 2400 

GACX3AGGACA GACTCCACGT CCGACGGGCG 2460 

CGCGTCCAAC TQGCTGCACC TCATCCCCCA 2520 

GAATATCAAG GATCCTTTGT TCAGGTTCTT 2580 

QCTTCAQGAC GTTCGCCAGG ACCTTGCAGA 2640 

GCAGACCAAC TACTPOCGCA CXXTTGATCAA 2700 

CTGGTCCXIAC TACACGGTCC CTGCCGGCAT 2760 

OGAGAGGATC AAACAGCTGC AGAACATCTC 2820 

GCTAAAGAAC ATCCACGTGT GCCTGGGTGG 2880 

CACCAGGCAG TATGTGGCCC AGGCCAACAG 2940 

CAAOSTCACC ACCTCACAGG GCGCCACCCT 3000 

GAAACTTCAA GGGGCCACGT GCAACAACAA 3060 

OGCCCTTCCC CTGACGCAGC TGCX3CTGGGT 3120 

TGTQGT7UVCC TTACCTGTCT ACXTTGAACTT 3180 

CTTCGAAATT GCTACAAAGG AQGATCCTCG 3240 

GTOCACAGAG TAAACTTTTC TAGCTGCXXC 3300 

ATTTATTCAT TTTTAAAATA TTTQGAAGGT 3360 

GAGGTTGGAG GAAGCTGAAT GGAATCTGAC 3420 

CAGGAGGTAT TTGGGAAGGC CAATGGCGTG 3480 

GAAAAAAAAA AAAAAACTTA CAANCCNCAA 3540 

3542 



60 



(2) INFORMATION FOR SEQ ID NO: 64: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 883 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEZmSS: double 
5 (D) TOPOLOGY: linear 



(xi) SEQUQICE DBSCRIPTION: SEQ ID NO: 64: 



AGGTGATTTT AATGATAGGT GTCATATATA QGAOGGATAA TCTGTTTACA TTCTCTTCTT 60 

10 

CTCGATGCAC TCACAAGCGG GTAACTAGGT GACAAGAAAA CAAAGATCTT ATTCAAAAGA 120 



GGTCTTACAG CAACCCAACG TCTCATCTTC CCATAGTAAA GATGACQGCG CCTTGAGGTA 180 



15 AGCTACAGGC AACACCACTT CCGCGTTTCT CTTGCGCCCT GGTCCAAGAT GGCGGATGAA 240 



GCCACGCGAC GTGTTGTGTC TGAGATCCCG GTGCTGAAGA CTAACGCCGG ACCCCGAGAT 300 



CGTGAGTPGT GGGTGCAGCG ACTGAAGGAG GAATATCAGT CCCTTATCCG GTATGTGGAG 360 

20 

AACAACAAGA ATGCTGACAA CGATTGGTTC CGACTGGAGT CCAACAAG6A AGGAACTCGG 420 



TGGTTTQGAA AATGCTQGTA TAT0CAT6AC CTCCTGAAAT ATGAGTTTGA CATCGAGrTTT 480 



25 GACATTCCTA TCACATATCC TACTACTGCC CCAGAAATTG CAGTTCCTGA GCTQGATGGA 540 



AAGACAGCAA AGATGTACAG GQGTGGCAAA ATATGCCTGA CGGATCATTT CAAACCTTTG 600 



TGGGGCCAGG AATGTGCCCA AATTTGGACT AGCTCATCTC ATGGCTCTGG GGCTGGGTCC 660 

30 

ATGGSTGGCA GTGGAAATCC CTGATCTQAT TCAGAAGQGC GTCATCCAAC ACAAAGAGAA 720 



ATGCAACCAA TGAAGAATCA AGCCACTGAG GCAGGGCAGA GQGACCTTTG ATAQGCTACG 780 
35 ATACTAWTTT CCTGTGCATC ACACTTAACT CATCTAACTG TTCCCCQGAC ANCCTCCACT 840 



CTAGTTGTTA CTAAGTANTG CAGTAGCATT NTGGGGAAGA ACA 883 



40 



(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 1541 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

50 <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

GGCACGAGGT GGCCTCTACC CTQGGCTCAT CTGGCTACAC AQGGACTCTA AACGCTTCCA 60 



GATTCCCTGG AAACATGCCA CCCGGCATAG CCCTCAACAA GAAGAGGAAA ATACCATTTT 120 

55 

TAAGGCCTGG GCTGTAGAGA CAGQGAAGTA CCAGGAAGGG GTGGATGACC CTGACCCAGC 180 



TAAATGGAAG GCCCAGCTGC GCTGTGCTCT CAATAAGAGC AGAGAATTCA ACCTGATGTA 240 



60 TGATGGCACC AAGGAGGTGC CCATGAACCC AGTGAAGATA TATCAAGTGT GTGACATCCC 300 
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TCAGCCCCAG GQCTCGATCA TTAACCCAGG ATCCACAGGG TCTGCTCCCT GGGATGAGAA 360 

GGATAATGAT GTQGATOAAG AAGATGAGGA AGATGAGCTG GATCAGTCGC AGCACCATGT 420 

TCCCATCCAG GACACXTTTCC CCTTCCTGAA CATCAATGGT TCTCCCATGG CXXXaOOCAG 480 

TGTQGGCAAT TGCflGTGTGG GCAACTGCAG CCCGGAGGCA GTGTGGCXrCA AAACTGAACC 540 

CCTGGAGATG GAAGTACCCC AGGCACCTAT ACAGCCCTTC TATAGCTCTC CAGAACTC3TG 600 

GATCAGCTCT CTCCCAATGA CTGACCTGGA CATCAAGTTT CAGTACCGTG GGAAGGAGTA 660 

COQGCAGACC ATGACCGTGA GCAACCCTCA GGGCTGCXrGA CTCTTCTATG GC5GACCTGGG 720 

TCCCATGCCT GACCAGGAGG AGCTCTTTGG TCCCGTCACai CTGGAGCAGG TCAAATTCCC 780 

AGGTCCTC5AG CATATTACCA ATGAGAAGCA GAAGCTGTTC ACTAGCAAGC TGCTGGAOGT 840 

CATGGACAGA GGACTCATCC TGGAGGTCAG CX3GTCATGCC ATTTATGCCA TCAGGCTGTG 900 

CCAGTCCAAG GTGTACTGGT CTGGGCCATG TGCCXXATCA CTTGTTGCTC CCAACCTCAT 960 

TGAGAGACAA AAGAAGGTCA AGCTATTTTG TCTGGAAACA TTCCTTAGCX3 ATCTCATTGC 1020 

CCACCAGAAA GGACAGATAG AGAAGCAGCC ACCX3TTTGAG ATCTACTTAT GCTTTGGGGA 1080 

AGAATQGCCA GATGGC3AAAC CATTGGAAAG GAAACTCATC TTGGTTCAGG TCATTCCAGT 1140 

AGIGGCTCGG ATGATCTACG AGATGTTrTC TGGTGATTTC ACACGATCCT TTGATAGTGG 1200 

CAGTGTCCGC CTGCAGATCT CAACCCCAGA CATCAAGGAT AACATCGTTG CTCAGCTGAA 1260 

GCAGCTGTAC CGCATCCTTC AAACCCAGGA GAGCTGGCAG CCCATGCAGC CCACCCCCAG 1320 

CATGCAACTG CCCCCTGCCC TGCCTCCCCA GTAATTGTGA ATGCXATCTT CTTCCTTCTC 1380 

TTTTTTATAA TATTGTACAT ATGGATTTTT TTATTGTTTA GATTTAACCA GCTTTTAAAT 1440 

CTCTGTTTTC TGTGACAGTG TTAGAAGTTT GTGATTCTCC AAATATGCCT AGATTTAAAG 1500 

CTGATTTAAT TTATGGAAAA AAAAAAAAAA AAAAAAAAAA A 1541 



45 



50 



55 



60 



(2) INF0BMATIC»7 FOR SBQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 732 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 66: 
AGAAAATGAA TGTTAGAAGG TGCCTGCCGA GGCGGGACAG AGTGTTTGCT CGCGCIGGAG 
AAGGCTCTGC TCAGCCCTGA GAGTCCCTTC CTGCCCCACC GATACTGGCA CTTTAAAAAG 



60 
120 
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GAAGCK5ACC GCACAGTGTC CAGACGAATT QGCCCXrCAGA AGATGGGGAG TlCTGTCCTG 180 

CCCTTCTGTG TCTGCGTGAC CTCACCCAGC CTAQGAiQGGA GGTGCATTCA GGGTAGATTT 240 

GCCTCTCATT CAAAGTTCTG GGGCTTTGGG CGGAAAACAG CXrAGCTITOG CXXnGTTGGG 300 

GAGACTCCTC CAGACCAGGA ACCCCAGAAG GAGACAGAGC CTQCCACATC CTCCCACGCC 360 

AGGCCXrrGGG CCAQGGTGAT TQGACTGAGA ATTTGGCCAC AACCAAATTG ATGCTGGCTG 420 

GAACCAGAGG CCAGAAAGCC TQGCCTTGTC CCCATGTC3GG AGCCCTGTCC TCAGCCCTCT 480 

TCTCCCCTTC AGCTCAGTGA ATTCCCACCA GGTGCCCACA GCTCCTQGAC TTCAAATTCT 540 

15 ATATATTCAG AGAGTTGGAG AGTATATCAG AGATATTTTT GGAAAGGAGT TGGTCTATQC 600 

AATCTCAGTT TOGAATCTTC TTGAAAGTTT AATQTTTTTA TTAGGAGATT TAAAC3AAAAT 660 

AAAGGTCTAC AATATCAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAT^AAAAAA 720 
AAAAAAAAAA AA 



20 
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25 



35 



45 



55 



(2) INFORMATION FOR SBQ ID NO: 67: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 629 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 67: 

TTAAGGAATT CGGCMCGATC CCGGCAAGTA ACATGACTAA AAAGAAGCGG GAGAATCTGG 60 

GCGTCGCTCT AGAGATCGAT GGGCTAGAGG AGAAGCTGTC CCAGTGTCGG AGAGACCTGG 120 

40 AGGCCGTGAA CTCCAGACTC CACAGCCGGG AGCTGAGCCC AGAGGCCAGG AGGTCCCTGG 180 

AGAAGGAGAA AAACAGCCTA ATGAACAAAG CCTCCAACTA CGAGAAG6AA CTGAAGTTTC 240 

TTCGGCAAGA GAACCGGAAG AACATGCTGC TCTCTGTGGC CATCTTTATC CTCCTGACGC 300 

TCGTCTATGC CTACTCGACC ATGTGAQCCT QGCACTTCCC CACAACCAGC ACAGGCTTCC 360 

ACTTOSCCCC TTGGTCAGGA TCAAGCAGGC ACTTCAAGCC TCAATAGGAC CAAGGTGCTG 420 

50 GQGTGTTCCC CTCCCAACCT AGTGTTCAAG CATGGCTTCC TQGCQGCCCA GGCCTTGCCT 480 

CCCTGGCCTG CTQQGGGGTT CCGGGTCTCC AGAAGGACAT QGTGCTQGrrC CCTCCCTTAG 540 



CCCAAGGGAG AQGCAATAAA GAACACAAAG CTGAAAAAAA AAAAAAAAAA AACTCGTAGG 



600 



GOGGGCCCGT ACCCAATCGC CCTNTCGTG ^29 
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(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1751 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBCNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

CTGCTAGCCG GCCGGCGCAG GCTGCCGAGC GGGTGAQCGC GCAGGCCAGG CCAAAGCCCT 60 

GGTACCCGCG CGGTGCGGGC CTCAGTCTGC GGOCATGQGG GCGTCCGCGC GGCTGCTQOG 120 

AGCGGTGATC ATC30GGGCCC CGGGCTCGGG CAAGGGCACC GTGTCGTCGC GCATCACTAC 180 

ACACTTCGAG CTGAAGCACC TCTCCAGCGG GGACCTOCTC CGGGACAACA TCXTTGCGGGG 240 

CACAGAAATT GGCGrTGTTAG CCAAGGCTTT CATTGACCAA GGGAAACTCA TCCCAGATGA 300 

TGTCATGACT CGGCTGGCCC TTCATGAGCT GAAAAATCTC ACCCAGTATA GCTGGCTGTT 360 

GGATGGTTTT CCAAQGACAC TTCCACAGGC AGAAGCCCTA GATAGAGCTT ATCAGATCGA 420 

CACAGTGATT AACCTGAATG TGCCCTTTGA GGTCATTAAA CAACOCCTTA CTGCTCGCTG 480 

GATTCATCCC GCCAGTGGCC GAGTCTATAA CATTGAATTC AACCCTCCCA AAACTGTGGG 540 

CATTGATGAC CTGACTQGGG AGCCTCTCAT TCAGCGTGAG GATGATAAAC CAGAGACGGT 600 

TATCAAGAGA CTAAAGGCTT ATGAAGACCA AACAAAGCCA GTCCTGGAAT ATTACCAGAA 660 

AAAAGGQGTG CTGGAAACAT TCTCCGGAAC AGAAACCAAC AAGATTTQGC CCTATGTATA 720 

TGCTTTCCTA CAAACTAAAG TTCCACAAAG AAQCCAGAAA GCTTCAGTTA CTCCATGAGG 780 

AGAAATGTGT GTAACTATTA ATAGTAAGAT GGGCAAACCT CCTAGTCCTT GCATTTAGAA 840 

GCTGCnTTC CTAAGACTTC TAGTATGTAT GAATTCTTTG AAAATTATAT TACTTTTATT 900 

TCTACTGATT TTATTTTGGA TACTAAGGAT GTGCCAAATG ATTCGGATAC TAAGATGCAT 960 

CGTTTGAAAT CATCTAGTGT GTTGTATGCA GTTATCCTCA AAAACATCAG CGATGTCTGA 1020 

ACCTTTAAAA CATCTGTTAG AGCAAAATTA AAAGAGCATT TGGTAGTAAT CTAACTTTTT 1080 

GTTCAGTTAA TAAGTQGTTG ATAAAGTTTC CATATTTTTC TGGAAAAGTT AAAAAAAGTT 1140 

ACATGTCATT TGGAGAAAAT ACGTAATCAG AAATTTGTGC ATAGATTGAT GCCAAAAAAG 1200 

ACATTTCCAG CATTGTGGAA CATOGTGAGA CACTATATAA AATTCCAGAA AGAAACCAAC 1260 

TGGATTTACA GATTTATTGT GAGACACAAA TTCACTQCTG CCTTTACACT AAGAAATGTA 1320 

TATGTTAACC ATATATGCTG TATTTATTTT GTCGTTAAGC ATACTTTCAG TTTACTCAGA 1380 

ATTTTCAATT TGCTATAAAG ATGTATCAAT TAGCATATAG AAAAATATTA CTTTAAGATG 1440 

ACTTGTTTCC TTTGAAAATA CCTGTGTACT GAGGGTTATG ATTTGTGTCA AAAATTGACA 1500 
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TAWJiGcrrr TACAAGCACC AAAGTTCSAAT GAAOTTTCAA CAAAATGTAA TTAAAGTCTA 1560 
TGTTrrCACT TATGACTCAG GTTAAGAAAT GTOrrTTAGG ATCTACTTQC TQGTTTTTCT 1620 
TTTTCATCCA AATOTOTGAT CTGCCCTCAT AAATAACAAG TTATNCyTACC ATCTCCCCCX5 1680 
CCAATAAAAA AAAAAAAAAA AAAAAAAAAC TCGftGGC5GGG GCCCGGTACC CAATTCTOCG 1740 
NAATAGGNAG T 

(2) INFORMATION FOR SEQ ID NO: 69: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 508 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 
20 (D) TOPOLOGY: linear 



1751 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

GGCACGAGAT TATCTATTAA AATCTTTTTG AATTGTGAAA TATTAGAATA TTGTTACTAT 60 

TEGACCCAAC TCAAAATCTC CATQQGAAAA TACCTGTCGA TACCCACAGT ATTCTTGAAA 120 

ATAATCAGAT GCAOTATCAC AGCTGTGTCA GACTCTAGTA CCAGTTC3GGC AATCAAGGCA 180 

30 CAGCTAAAAA TTGAAAACAA AGATCTGGAC AACAAAACAG CCAAAGGTGG GGGTCAAGAA 240 

GCTCTGACGT GTACCTAGCT GTAGAATGCT ATGCACACCT QCCAGGTGTA GTGTGCATAT 300 

CCAGGAAAAA CIGCAGAGAG CCCCAC?rCTT CACCTCTGGT TGACCATGAG CTCTGTGTAA 360 

GCAGGAAGTC AAGGCTAAGG CAGATTTAAG CTCTGAAAGC ATTCCACAAC ATACACACAA 420 

ATCGTCCAAA GCATTAAGGA AATCTTGTTA CTGCTAAGTG TTGCTGACCC AGGAACAACT 480 
40 CCTACTCAGC TGGACTTAAA AATAAAAA 



508 



45 (2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 245 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 
55 TACATAGAGC AAAGAGAAAT TTCCAGAATT TCTARAATTC TGGAAAGAGA ATTTTCCTGA 
GATTOCAGAT TTCCTTGTGT CCTCAGGTCA TGATGAGGGC TGTTTTCCCC TGTTGTCCTT 
TCCTCACACT CATCCTTCCT CTCCTAGAGT GTCTGGTTGG CATGATCATG TGCTACCTAG 

60 



60 

120 
180 
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GCATTTCTTT CACTGATACA AGGAAAACTG CAGGGTTAAA AAAAAAAAAA AAAAAAAAAA 240 
NCNCG 245 



(2) INFORMATION FOR SBQ ID NO: 71: 

10 (i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 361 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: double 

(D) TOPOLOGY: linear 

15 

(xi) SEQUENCE INSCRIPTION: SEQ ID NO: 71: 
ATGTTCCTCA TGAGGATGCA CTTGTGCTTC TGCAAGTATT GCTGCAGCTT CATAGTGACT 60 
20 CCCACCAGCA CCAGCAATAC AGCTAGCTAC CTGTGGCCTT QGATCTCAGC CAGCATGGCT 120 
GGGAGAGGGA GCAGCTGGGC ATGTACCCTA AATGCTGTTA CCAOGGAAGG ACTCCCAGAG 180 
TGAAGACAAG TAGGGACTTC CTGCACAGGT GGTACATGTG CTCTCTQTAT CCATACTTTT 240 

25 

TTTTTTTTTT TTTTGAGATA GAGTTTCACC CTTGTTGCCC TQGCTGGfiGT GCAATOGTGC 300 
GATCTCAGCT CACTGCAACC TCTCTGCCTC CCOGGTTCAA GTGATTCTCC TGCCTCAGCC 360 
30 T 361 



35 (2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 713 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEimSS: doxible 

<D) TOPOLOGY; linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

45 AGGATCACAC AATAGAGAAC ACTGTAGTAA CATTTCGGTC TGCTCACAAG ACCCAGAACA 60 

TTGATCAGTT TTTGTTGTTG GTTTATTATT TTTCTGTTAA AAAATTGTGA AAAGTTTGTT 120 

TTAGCTAGAT GATATTTTAA TAGCTQOGAG TGCTTTGGAA CTATAAAGAT GTCACTACTT 180 

50 

AACACACATA CCTTATGTTT TGTTTTGTTT TGTTTTACAC TCAGTATAAA TCAGGAGAAG 240 

TTAGCCAACC ATCTAGCATT TAGAATCCTC TTTTTTATTG TCTTCTAAGG ATATGGATGT 300 

55 TCCCATAACA GCAACAAAAC AGCAACAAAA ACATITCATA AATATCACTT GATAGACTGT 360 

AAGCACCTGC TTAACTTTGT GTCCCAAATA TTTAGTGTGT ATATATATAT ATATATATAC 420 

ACACACACAC ACATATATAT TCAACAAATA AAGCAAAATA TAACATGCAT TTCACATTTT 480 

60 
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GTXrrTTCCCT GTTACGATTT TAATAGCftGA ACTGTATGAC AAGTTTAGCT GATCCTAGCA 540 

TATGTTAAAT TCAAATTAAT GTAAAACftGA TTAACAACAA CAAAGAAACT GTCTATTTGA 600 

5 GTGAAGTCAT GCTTTCTATT ATAATAACTT GGCTTCGGTT ATCCATCAAA TGCACACTTA 660 

TACTGTTATC TGATTGfTTTA TAATAAAGAA TACTGTACTT ATAAAAAAAA AAA 713 



10 



25 



35 



45 



(2) INFORMATION FOR SEQ ID NO: 73: 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 862 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

GAAAGTCAGA GCTGTCCAAT CCCTCAGCAC CTTTTAGATT TGCTCCAAAT TAGAAACGTC3 60 

GGGACTATCT GTTCTGGGCA ATCACAGGTC TGGAAAATGG CTCTGCAGGC TCTTGATAGT 120 

GAGACAGTOG TCATCTTACC AGACATQCAT CTGATTTTAA GCCTCAGGCT AATCCACAAT 180 

GCTCGGCCAT GCCTATGATT AACAAACAAA AGCAAAATCT GCTTTTATAG TTTAGGAAAC 240 

30 CTGGATAGAA CAGTATTTTT CAGCATTCTT QGATAAAGCA GTTCTGCATT TTTAAATTQG 300 

GACTGCAGAA GTGACTGTCT ATAGTTGTGA AATACAAAAA ATQGTATGTT TGATCAGAAA 360 

AGGAAGCCCG TGCCTGGCAC TTGGAAAGAT ACT6AGCATC ATAACGCEAA TGAGAAAATG 420 

TAGGCTCTGT GAATGTTAAC TACAAATCAG GTTAGGAAAG CATATGACAC CCTTTGTCAA 480 

ACTAAGCTTC ACTAGGAQGA CCTGTGCTCA TAGAAGAATA TGCTTTAAAA GTATCAATTT 540 

40 TCCACAGTCG AT6ATGGAGA AAAGTTCATT TGCACCAGAA TGCTGATAGT CACAATACAC 600 

ACX:CIGACAT ATATAACAAT ACAGTTTTCT GTAAACAGAA GTTCTTCCTC TTCCAATTCA 660 

GGAGTCAGTC AGAGCATAAA TATTGCATGT TTCACTTTAG AAACTGATTC ATTTTAGAAA 720 

GCAGATCTGG ATTATTTTGC AGGGTAGAAA TGAAGGCTAT TTCTGGCATT CTTGCTCAAA 780 
AAGTCAATAT ATGTACATTA AGTATAAAAA AGGGTCTCTT TCACCTCTTT TGTTTCGTAG 



840 



50 CATTQGCTAC ATAACTCGTG CC 



55 (2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4602 base pairs 

(B) TYPE: nucleic acid 
60 (C) STRANI^DNESS: double 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 74: 

5 GCGAQGC3GGC GKGGGGAGCA GCGCCGARGC CGCCC3CCTCC GCCTCCGCCG CCTAGGACTA 60 

GGGGCTC5GGG GRCGGACAAG CCCCGATGCC GC3C3GGWCACG C3AAGAGCCGA GACCCCCGGA 120 

QCAGCAGGAC CAGGAAGGQG GAGAGGOSQC CAAGQCGGCT CCGGAGGACC CGCAACAACG 180 

GCCCCCTGAG GCGGTCGCGG CQC3CGCCTGC AGGGACCACT AGCAGCCGCG TGCTGAGGGG 240 

AGCTCGGGAC CGAGGCCGGG CCGCTGCGRC CGCCGCGCMG CAGCTGTGTC CCGCCGGAGA 300 

15 AGGCCGAGTA TCCCCGCCGG CGAQGAGCAG CCCCAGCGCC AGGCCTCCCG ACGTCCCCGG 360 

GCAGCAC3CCC AGGCCGCGAA GTCCCCGTCT CCAGTTCAGG GCAAGAAGAG TCCGCX3ACTC 420 

CTATCCATAG AAAAAGTAAC AACTGATAAA GATCCCAAGG AAGAAAAAGA GGAAGAW3AC 480 

20 

GATTCTGCCC TCCCTCAGGA AGTTTCCATT GCTGCATCTA GACCTAGCCG GGGCTGGCGT 540 

AGTAGTAGGA CATCTOTTTC TCGCCATCGT GATACAGAGA ACACCCGAAG CTCTCGGTCC 600 

25 AAGACCGCTT CATTGCAGCT CATTTGCAAG TCAGAACCAA ATACAGACCA ACTTGATTAT 660 

GATCTTOGAG AAGAGCATCA GTCTCCAGGT GGCATTAGTA GTGAAGAGGA AGAGGAGGAG 720 

GAAGAAGA6A TCTTAATCAG TCAAGAGGAG ATAOCATTCA AAGATGATCC AAGAGATGAG 780 

30 

ACCTACAAAC CGCACTTAGA AAGGGAAACC CCAAAGCCAC GGAGAAAATC AGGGAAGGTA 840 

AAAGAAGAGA AGGAGAAGAA GGAAATTAAA GTGGAAGTAG AGGTGGAGGT GAAAGAAGAG 900 

35 GAGAA-KSAAA TTAGAGAGGA TGAGGAACCT CCAAGGAAGA GAGGAAGAAG ACGAAAAGAT 960 

GACAAAAGTC CACOTTTACC CAAAAGGAGA AAAAAGCCTC CAATCCAGTA TGTGCGTTGT 1020 

GAGATCGAAG GATOTGGAAC TCTCCTTGCC CATCCTCGCT ATTTGCAGCA CCACATTAAA 1080 

40 

TACCAGCATT TGCTCAAGAA GAAATATGTA TGTCCCCATC CCTCCTGTGG ACGACTCTTC 1140 

AGGCrrCAGA AQCAACITCT GCGACATGCC AAACATCATA CAGATCAAAG GGATTATATC 1200 

45 TOTGAATATT GTGCTCGGGC CTTCAAGAGT TCCCACAATC TQGCAGTGCA CCGGATGATT 1260 

CACACTGGCG AGAAGCATTA CAATGTGAGA TCTGTQGATT TACTTGTCGA CAAAAGGCAT 1320 

CTCTTAAITC GCACATGAAG AAACATGATG CAGACTCCTT CTACCAGTTT TCTTGCAATA 1380 

TCTGTCGCAA AAAATTTCAG AAGAAGGACA GCGTAGTGGC ACACAAGGCA AAAAGCCACC 1440 

CTCAGGK5CT GATTCCAGAA GCTCTGGCTG CCAATGCAGG CGCCCTCATC ACCAGCACAG 1500 

55 ATATCITCGG CACTAACCCA QAGTCCCTGA CGCAGCCTTC AGATGGTCAG GGTCTTCCTC 1560 

TTCTTCCTCA GCCCTTGGGA AACTCAACCT CTGGAGAGTG CCTACTGTTA GAAGCTGAAG 1620 
GGATGTCAAA GTCATACTGC AGTOGGACGG AACGGGTGAG CCTGATGGCT GATGGGAAGA 1680 



10 



20 



30 



40 
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TCTTTOTOGG AAGCGGCAGC AGTGGAGGCA CTGAftGGGCT GC3TTATC3AAC TCAGATATAC 1740 

TCGGTCCTAC CACAGAGGTT COOATTGAAG ATTCAGACTC TGCXXMACCT TAGTGGACAG 1800 

5 GAAGACTTGG GGCATOQC3AC AXTCAGACT TrOTATTTAA AAGTTAAAAA GGACAAAAAA 1860 

AAAATCTAAA GCATTTAAAA TCTAGTGAAA TAACTOAAGG QCCTGCTCTT TCCATTGTGG 1920 

ATCACAGCAC ACACATACAT ACACCCTCCA CCTCCCX3VTC CaTOnTCTC CCTCTC31TGC 1980 

TCCCCTTATA AAATTCATTCT TCTCTXTACC AGAAAGGTAG ACAAAAAAGA AOCAGCAGCA 2040 

GCTCTTAAAG TGAGGGTTAT TCTCATACTC GGTTCCAGCC ATCAGCAGAC TTCCTGCTCA 2100 

15 TCGGCAGATC CCCCTTTCCA ACCTGTAACT CTGATGTGCT CTGGATCAGC TTTTAACTTT 2160 

TAATCATATA TTACTCTCTT CTAAATCCCT TCTCCnCCTC TACTGCIGCC CTATGGTTCT 2220 

GGCTOCTACC CCCTCCGGCA CACTTATCTT CAAATACCAT ASAATTCTAA TCTCIOAAAT 2280 

CATAGCTCTC CAGTCGCTTT TAAAGAAAGC TGGTCCTCAG CACTAACAAA ATCACTACAA 2340 

TAGCCTAGTG CTTTrTTGGA AGCXTTTTTTA GQGAAGAATG TTAGGTTCAT GGTAACTAGT 2400 

25 ATCCICTTTG AGATTTTTAC AGTCTTGAAA CTTAAGAATT TTGAGAGGGT GAGGAGQGTT 2460 

CTTCAGAATC TAAATTACAG ATAGATGATT GrTTTCTTOTG AATTTGTTTC TTrTCCTTTT 2520 
nTTTCTCCC TACCATTTCC TTACATTTCC CTTGGGGCCC ATCTCTGGCT CCTrGCTTTT 2580 
TGTTKnTGC TTTGCTTTAT CAGTTCATTC CAGCTCCCTG TTAGTGAAGG ACACTGCTGT 2640 
TAGTCAAGGA ACAAAOTCTA TGAGTCCTAA AATTTTAAGT CAAAGAAAAC TGCTCTGTTT 2700 
35 CCCCTPTAGT AACACTTCTC AAGAGGAAAA ACTTCAATAG CCAAAGTTAA TAATCCTATA 2760 
TAATAATTGC TTTOGCTTTC ACCTAAAATT CTGGGCATCA CAATTTCCTT GGGATAGAGG 2820 
TTCTOTTOGG GAATAGATTG CTTATTGCTG TTCACTGGAG AGAAAAGGTA GTGTTTTTGT 2880 
ACAAGGTCAT ACXTGCCAGAA QCCCCAAATC CTATTTTGGC TCATCTTCAG GTAAAGflGTA 2940 
ATTCCTATCC TOTCTOXTC AGAAGCTAGA ATCGAAGGCT TACCCTATTC ATTGITTATr 3000 
45 GTCAGAAATG CATCATGGCT CTTGGAAAGA ATGACGTTTT GCTQGAAAAA AAAAAAARAA 3060 
CMGTTTOTGT TTCACAAACA TGGCTTATCA ATTTTTTCAA AGAATTCTTT TTTCCCAAAA 3120 
AGAGGAGTAA CAAAATOTCA TTTCTGAAAG AGGCTTACTT TATACCAACT AGTGTCftGCA 3180 

50 

nTGGGATGC CAGGGAACAG AGAGTGAGAC ACCTACAATC ACCAGTCTCA AATGCGCTAT 3240 
TGTTTCmT CAGAGTOTTG CAGATTTGCC ATTTCTCCAT AATATGGGGA TAGAAAATGG 3300 
55 AATAAAGATA GAAGGGATOT AGAATATGCT TTCCTGCCAA CATGGTTTGG AGTCGACTTT 3360 
GGTATAITCA CTAGATTTGA AAATACAAGA TTGATTAGAT GAATCTACAA AAAAGTTGTC 3420 
CTCCTCTCAG GTCCCTTTTA CACTTTTTGA CTAACTAGCA TCTATATTCC ACACTTAGCT 3480 

60 



wo 98/39448 



PCT/US98/04493 



297 



10 



15 



20 



25 



30 



35 



TrnTOTCAC ACTTATOTTT TGTCTCCOTA AATTTCATTT GCAGTGGTTA GTCATCAGAT 
ATTTTAGCCA CCTACACAAA AGCAAACTGC ATTTTTAAAA ATCTTTCTGA GATGGGAGAA 
AATOTATTCT CCTTTCCTAT ACCGCTCTCC CAACAAAAAA ACAACTAGTT AGTTCTACTA 
ATTAGAAACT TOCTCTACTT TTTCTTTTCT TTTAQGGGTC AAGGACCCTC TTTATAGCTA 
CCATTTOCCT ACAATAAATT AITOCAGCAG TTTGCAATAC TAAAATATTT TTTATAGACT 
TTATATinT CCTTTTGATA AAQGGATGCT C5CATAGTAGA GTTGGTGTAA TTAAACTATC 
TCAQCXXTTTT CrCTOCTTTC OOTCTGCTC CATATGCCTC ATTCTTCCTTC CftGGGAGCTC 
inTAATCTT AAAGTTCTAC ATTTCATGCT CTTAGTCAAA TTCTGTTACC TTTTTAATAA 
CTCTTCCCAC TGCATATTTC CATCTTGAAT TGGTX3GTTCT AAATTCTGAA ACTGTAGrTG 
AGATACAGCT ATTTAATATT TCTGGGAGAT GTGCATCCCT CTrCTrTGTG C3TTGCCCAAG 
GnOTTTTGC GTAACTGAGA CTCCTTGATA TGCTTCAGAG AATTTAGGCA AACACTGGCC 
ATGGCCGTGG GACTACTGGG AGTAAAATAA AAATATCGAC3 GTATAGACTA GCATCCACAT 
AGAGCACTTO AACCTCCITr GTACCTGTTT GGGGAAAAAG TATAATGAGT GrTACTACCAA 
TCTAACTAAG ATTATTATAG TCTGGTTGTT TGAAATACCA TrTTTTTCTC CTITrGTCTT 
TTTCCX^CTT TCCAATOTAC TCAAGAAAAT TGAACAAATG TAATGGATCA ATTTAAAATA 
TTPrATTTCT TAAAAGCCTT TTTTGCCTGT TGTAATGrTGC AGGACCCTTC TCCTTTCATG 
GGAGAGACAG GTAGTTACCT GAATATAGGT TGAAAAQGTT ATGTAAAAAG AAATTATAAT 
AAAAGGGATA CTTTOCTTTT CAAATCTTTG rTTTCTCTTA TTCTAGGTAA QGCATATTAA 
AAATAAATAT GTAAAGAAGA AAAATAAAAG TTGTCTTCAT GG 



3540 
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(2) INFOroiATION FOR SEQ ID NO: 75: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: X255 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 
CGCGCCCCGG GCCGGCGGGT TTCTCTAACA AATAAACAGA ACCCGCACTG CCCAGGCGAG 
CGTTQCCACT TTCAAAGTOG TCCCCTGGGG GAGCTCAGCC TCATCCTGAT GATGCTGCCA 
AGGCGCACTT TTTATTTTTA TTTTATnTT ATTTTTrTTT TAGCATCCTT TTGOGGCTTC 
ACTCICAGAG CCAGTTTTTA AGGGACACCA GAGCCGCAGC CTQCTCTGAT TCTATQGCTT 
GCTTTGTTACT ATAAGAGTAA TTGCCTAACT TGATTTTTCA TCTCTTTAAC CAAACTTGTG 



60 
120 
180 
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TCATAAATCT AGAAAATCCA TICATACAAT TACSGAATTC AA&TATTOCA AAAGGATCTG 
20 TOICTTTCTC CCCGAGCrcC CCTOTrcCCC TTCArTGAAA ACCACCACGG TGCCATCTCT 
TOICTATCCA GGGCTATOCA CCTCCACSGCA CGTGTGTATG CACTCCCCGC CTC3TGTTrAC 
ACAAGCrore GQCTCTIftCG CATCCCTOCT TITrrCACrr AATAATACAG CTTGGAGAGA 
rmTOTATC ACATTATAAA TCCCaCTCGC TCTTTTTGAT GGCCACKTAA TAACTACTGC 
ATAATATCSA TACGCCTTAT TOSATTTAAC TAGITCCCTA ATGftTCCSACT TrTAAGTTCT 

30 TTCcrrmT irrcrmTr gctactgcaa acgatgctat aataaatgtc cttatcaaaa 

AAAAAAAAAA AAAAAAAAAA AAAAAANCXT NtSGGQOQGGG CCCOGGGAflC NCAAT 



25 



35 



(2) INFORMATION FOR SBQ ID NO: 76: 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENC3TH: 475 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOSy: linear 

45 (xi) SEQUENCE I^SCRIPTICM: SBQ ID NO: 76: 

GGCACGftGAG AAATOTITCA TTCTCTITCC TATTITAAGG GATCTTCTCT CTTGTTGATG 
rraAAAACTT ACCTTAGIGA AGATOTCTTT CAACATGCTG TPGTCdTrA CCTGCATAAT 
CACAGCTATC CATCTATTCA AAGK3ATGAT CTC3TC3CWATA GmTAATGA GGTCACAAftC 
CAAACACr AG ATOTAAAGAG AATCATCAAA ACCTGGACCC TGCAGAAAGG AmCCTITA 
55 GTCACreTTC AAAAGAAAGG AAAGGAACTT TTTATACAAC AAGAGAGATT CTTTrTAAAT 
ATCAAGCCTC AAAITCAGCC TICftGATACA AGGTACATGC CCTCTITCrr TTCATGCCAT 
CTCTTTTCCA CTCTCAGGTC GAAATATTTT GAAGTGTTTT ATAATCATAA GTTCTTGTGA 



360 
420 
480 
540 
600 
660 
720 



GCCAAAAGAT ATTTGACCCyr ITCCAAAATr CAGATTCTGC CTCTGCQGAT AAATATTTGC 
CACGAATCftG TAACTCCTCT CACCftCTCTG AftGGTCCAGA CACSAflQGTTT TGACACATTC 
^ CTACXaCTOA ACICCTCTCT GATCTftGGAT GATCIOTrCC CCCTCTOGAT GAACATOCTC 
TGATCATCAA GGCTCCCAGC AQQCTACm GAAOQCaACA ATCftGATGCA AftAGCTCTTO 
10 GCTCTTTATP TAAAATACTA CTCTCACITP CTGAGTACCC GCCGCITCaC AC5C3CIGAGTC 
CAQGCCroro TGCTTTOTAG AGCCAGCTGC TTGCTCACAG CCACATTTOC AtTTGCATCA 
TTACTCCCrr CACCTCCATA (JKaCICm TGATCCTGGG GAACCAAAAT QCJTGATGATA 
TATAGACm ATGTATAGCC ACAeTTCATC CCCAACOCTA GTCITOGAAA TGTTAATATT 780 

840 

900 
960 
1020 
1080 
1140 
1200 
1255 
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AAOCTAACAA GATTATCCCT TCCTAAGAAT ACTTAACCIT CCTACCAAAT TAAAA 475 



(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CKARACTERISTTICS : 

(A) LENGTH: 465 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEENESS: double 

(D) TOPOLOGY: linear 



15 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 77: 

TTCTCTCTGC TCTTCGACTG CACCGCACTC GCGOGTGACC CTGACTCCCC CTAGTCAGCT 60 

CAGCQGTGCT GCCATGGCGT GGCGGCGGCG CGAACCRQCG TCGGGGCTCG CGGCGTGTTG 120 

20 GCTCTGGCGT TOCTCGCCCT GGCCCTGTGC GTGCCCGGGG CCCGGQGCCG GGCTCTOGAG 180 

TGGTTCTCGG CCGTGGTAAA CATCGAGTAC GTGGACCCGC AGACCAACCT GACGGTGTGG 240 

AGCGTCTCGG AGAGTC3GCCG CTTCGGCGAC AGCTCGCCCA AGGAGGGCGC GCATGGCCTG 300 

CGTGGGCGCC CGGCGGAGAM CTCGARGGCT KCGOGCCCGA CACGCGCTTC 360 

AGCCCGGCGG CCGAGGGGCC GCC3CCCTQGG TCGCCCTGGT GGTCGTGGGG 420 

30 GCTGCACCTT TCAAGGftCAA AGTGCTGGTG GCGGCGCNGA ANGAA 465 

35 (2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1907 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

45 ACATQCAGCC CAACTACAGA TTCTTATGGA ATTCCTCAAG GTTGCAAGAA GAAATAAGAG 60 

AGAGCAACTG GAACAGATCC AGAAOGAGCT AAGTGTTTTG GAAGAGGATA TTAAGAGAGT 120 

GGAAGAAATG AGTGGCTTAT ACTCTCCTGT CAGTGAQGAT AGCACAGTGC CTCAATTTGA 180 

AGCTCCTTCT CCATCACACA GTAGTATTAT TGATTCCACA GAATACAGCC AACCTCCAGG 240 

TTTCAGTGGC AGTTCTCAGA CAAAGAAACA GCCTTGGTAT AATAGCACGT TAGCATCAAG 300 

55 ACGAAAACGA CTTACTGCTC ATTTTGAAGA CTTGGAGCAG TGTTACTTTT CTACAAGGAT 360 

GTCTCGTATC TCAGATGACA GTCGAACTGC AAGCCAGTTG GATGAATTTC AGGAATGCTT 420 

GTCCAAGTTT ACTCGATATA ATTCAGTACG ACCTTTAGCC ACATTGTCAT ATGCTAGTGA 480 

60 



50 
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TCTCTATAAT GCTTCC^GTA TAGTCTCTAG TATTGAATTT GACCGGGATT GTGACTATTT 
TOCGATTOCT GGAGTTACAA AGAAGATTAA AC3TCTATGAA TATGACACTG TCATCCAGGA 
TOCAGTGGAT ATTCATTACC CTCAGAATGA AATGACCTGC AATTCGAAAA TCAGCTGTAT 
CAGTTCGAC?r ACTTACCATA AGAACCTGTT AGCTAGCAGT GATTATGAAG GCACTGTTAT 
TTTATCOGAT GGATTCACAG GACAGAGGTC AAAGGTCTAT CAGGAGCATG AGAAGAGGTG 
TTCGAGTCTT GACTTTAATT IXSATCGATCC TAAACTCITC GCrTCAGGTT CTGATGATGC 
AAAACTX3AAG CTCIX3GTCTA CCAATCTAGA CAACTCAGTG GCAAGCATTG AGGCAAAGGC 
TAATOTOTGC TCTOTTAAAT TCAGCCCCTC TTCCAGATAC CATVTGGCTV TCGGCTGTGC 
AGATCACTGT GTCCACTACT ATCATCTTCG TAACACTAAA CAGCCAATCA TGGTATTCAA 
AGGACACCGT AAAGCAGTCT CTTATGCAAA GTTTGTGAGT GGTGAGGAAA TTGTCTCTGC 
CTCAACAGAC AC?rCAGCTAA AAdGTGGAA TCTAGGGAAA CCATACTQCC TACGTTCCTT 
CAAGGOTCAT ATCAATCAAA AAAACTTTOT AGGCCTGGCT TCCAATGGAG ATTATATAGC 
TTCTGGAAGT GAAAATAACT CTCTCTACCT GTACTATAAA GGACTTTCTA AGACTTTGCT 

AACirrrAAG tttcatacag tcaaaagtct tctcgacaaa gacogaaaag aagatgatac 

AAAIXSAATTT GTTAGTCCTC TOTGCTOGAG GGCACTACCA GATQGGGAGT CCAATGTGCT 
GATTGCrcCT AACAGTCAGG GTACAATTAA GGTC3CTAGAA TTGGrTATGAA GGGTTAACTC 
AAGTCAAATT GTACTTGATC CIGCTCAAAT ACATCTGCAG CTGACAATGA GAGAAGAAAC 
AGAAAATCTC ATCTCATCTC TCTCCCCAAA GTCATCATCG GTTTTGGATT TGTTTTGAAT 
ATTTTrrrCT TTmrCTTT TCCCTCCTTT ATGACCTTTG GGACATTGGG AATACCCAGC 
CAACTCTCCA CCATCAATGT AACTCCATGG ACATTGCTGC TCTTGGTGGT GITATCTAAT 
rrrTOTGATA GGGAAACAAA TTCTTTTCAA TAAAAATAAA TAACAAAACA ATAAAACrrr 
ATTCAGCCAC ACTTGAGCTT GGAAAGTTTr TGTCAAATGC NGCAAGAGAT AACTCITITr 

angaagtagc atatctcaac tataatgtaa cagtgaataa tttgtaaagt tcgtatttcc 

CAACCTCTTT GQGAATTACA CATATCAATA TAAACAAAAT ATAAAGT 
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(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1168 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOIjOGY: linear 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 
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GCTGQGGTGT CCCXICCSGCC ACCATCGTCA TCGCTTACTT GATGAAGCAC ACTCGGATGA 60 
1 

CCCATCACTG ATGCTTATAA ATTTOTCAAA GGCAAACX5AC CAATTATCTC CCCAAACCTT 120 

5 

AACTTCATGG GGCAGTTGCT AGAGTTCGAG GAAGACCTAA ACAACGGTGT GACACCGAGA 180 

ATCCTTACAC CAAAGCTGAT GGGCGTGGAG ACGGTTGTGT GACAATGGTC TGGATGGAAA 240 

10 GGATTCCTGC TCTCCATTAG GAGACAATGA GGAAGGAGGA TGGATTCTGG TTTTTTTTCT 300 

TTCTTTTTTT mTGTAGTT GGGAGTAAGT TTGTGAATGG AAACAAACTT GTTTAAACAC 360 

TTTATTTrTA ACAAGTGTAA GAAGACTATA ACTTTTGATG CCATTGAGAT TCACCTCCCA 420 

CAAACTGACA AATTAAGGAG GTTAAAGAAG TAATTnTTT AAGCCAACAA TAAAAATATA 480 

ATACAACTTC TTTCTCCXrCC TTTTCCTTTT AAGCTATTTG TAGAGTTTAT GACTAAATAG 540 

20 TCTGTGCAGG TTCATAGACC GAAGATACTA CACACTTTAA ACCAATTAAA AAGAACCAAA 600 

AGTAAATAGA AAAGACATTG AATCACCAAG GCCTGGGATC AACCTGGGCT GTCCACACAG 660 

AAAACAAAAA CCTAACCAAA CCAAGCCCTG TTGTGCTCAC TQGTGCAAAG AGAAGATCAG 720 

25 

GGCAGCTTAA GTCGTCTAAG RATCCTTCAG GCATTCTTTA AGGAGAAAAA GGATACCTTT 780 

GATTTTGTGT GTTTCATGCT CTGGATTTTT TTTTTTTTTC CTTCTCTGGG TTTAAGAGAT 840 

30 TTTTTTTGAA ATAGTGAGGA ACTGACCATT ATATGCCTTC ACTGGCTTCT TGTGCAATAA 900 

TATCATGTTT TAAGTGTGCA AACAAGTTAG AGCTGGCAGC TGAATGATAG ACAAATAGTG 960 

CAAATTTOCC AGCTTGGAGA TAGAAAGGAA TTCAACAATA TATCAAATAC TTTCCTTCCC 1020 

ACXTTTTTCC ' riTlTr rTTT TTTTTTCTGA TTTGATTCTG GTTACAGTGC CATAAACCTT 1080 

GTTACATATG TATATCAGAA TGTAAGAAAA AAAAATTTAT TTAAAAATAT TTTTCGCAAA 1140 

40 AAAAAAANNA AAAAACTCGA GGGGGGCC 11^8 



45 (2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1285 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEimSS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIOI : SEQ ID NO: 80: 
55 AGAAAATCAC ATCCTAACAA AGAAGTCTGT CTAAGACAGT ACATCTCCTG TTGAACTTGC 60 
ATCTTTCCAC AGGACTTTCT GTTTTTAGGG ATGAGACTAT TCTCTGCTTC ATCAAGGAAA 120 



60 



GAGAAATCTT CAGGGTTCTA GGGATGGCAC ACTTATTAGT TCTGCCTGTC TGAAAGGTTC 



180 
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CroCAGCSACA CTITCCyiCAG AGCTCCAATT CTTWHOCAT GGTCTAA.TC3C TTC5ACTATCT 

ciTCTrrccc TrrccrercT casgaatcag creaGAATTC ATTCcaTrGT catccctcta 

GCCCCTTACT CSTCATITCTr GOrTCCACTT TCATrTQCTT TAC3TTCTAGA ATCACCTGTT 
GACTCCTCAG ACTICACCTA ACTTTOGAAA CTCTCTTITC GAGCSCTTCTC ATITCCCCCT 
AATICrereC TCCCTGAGCC CTAGAATTTT CCCACCAACG AATTATTCCA GGTAGATCCT 



40 



(2) INFORMftTION FOR SEQ ID NO: 81: 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 1290 base pairs 

(B) TYPE: nucleic acid 

(C) STRMIDEENESS : double 

(D) TOPOLOGY: linear 

50 (xi) SEQUENCE DESCRIPTIC»l: SEQ ID NO: 81: 

TCrcCAGCCC CAATTTCTAC GCGCACCGGA AGACQG»£3GT CCTCmCCT TGCCTAACGC 
AQCCATCGCT COro^TCCCA AGABGCATCT GAAGCGOeTO GCAGCTCCAA AGCATTGGAT 
GCTOGATAAA TTCACCGC^rG TOTrTOCTCC TCCTCCATCC ACOGGTCCCC ACAAGITCAG 
AGAGTCTCTC CCCCTCATCA TmCCTGAG GAACAGACTT AAGTATGCCC TGACAGGAGA 
60 -raAAOTAAAG AAGAnTCCA tGCAGCGGTT CATTAAAATC GATGGCAAGG TCCGAACTGA 



240 
300 
360 
420 
480 



AflCTTCCTCG ATCTAJyiTCA TATTTAAACA ATATCTAGTT GATATTTCTC ATrCAGCTGG 540 
ATCCflGAAAC CACTATCICT NAAAAACAAC CTCTCATACC TTGrGGACCT AATTITGICT 
15 GCGIGIX3ICT GimXXSCAT ATOPATATAG ACAGGCftCAT CnrmTACT TTrGTAAAAG 
CTTATCCCTC TTTOtWATCT ATATCTOTGA AAGTTrTAAT GATCTGCCAT AATGTCTTGG 
QGACCTITOr CTTCTOKn-A AKTOGTACTA GAGAAAACAC CTATATTATG AGTCAATCTA 

OTIGOTITrA TTCGACATCA AGGAAATTTC CAGATAACAA CACTAACAAA CTCTCCCrTG 840 

ACTAGGGGGA CAAAGAAAAG CAAAACTCAC CATAAAAAAC AATTACCTGG TGAGAAGTTG 900 

25 CATAAACAGA ATTAGGTAOT ATATKSAAGA CAGCATCAIT AAACAGTTAT GrTGrrCTCC 960 

TTCCAAAAAA CATCTACTCA CTTCCCGTTG AGTAATGCCA AGTTGTTm TTTATTATAA 1020 

AAcrrecccT TCATTACATC TTTCAAAGTC GTGTGGTGGG CCAAAATATT GAAATGATQG 1080 

^ AACTCACIGA TAAAGCTOTA CAAATAAGCA GTOTGCCTAA CAAGCAACAC AGTAATGTTG 1140 

ACATGCTPAA TICACAAATC CTAAmCAT TATAAATTGT TITCCTAAAA TACACTITGA 1200 

35 AACTATmr CTOTATTCCA AGAQCTCAGA TCTTAGATTr TATGTAGTAT TAAGTGAAAA 1260 

1285 

AATACGAAAA TAATAAACAT TGAAG 



600 
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TATAACCTAC CCTGCTGGAT TCATGGATGT CATCAGCATT GACAAGACGG GAGAGAATTT 360 

CCX3TCTGATC TATGACACCA AGGGTCGCTT TGCTGTACAT CGTATTACAC CTGAGGAGGC 420 

5 

CAAGTACAAG TTGTGCAAAG TGAGAAAGAT CTTTGTQGGC ACAAAAGGAA TCCCTCATCT 480 

GCyTGACTCAT GATGCCCX3CA CCATCCGCTA CCCCGATCCC CTCATCAAGG TGAATGATAC 540 

10 CATTCAGATT GATTTAGAGA CTGGCAAGAT TACTGATTTC ATCAAGTTCC ATTCACCCAG 600 

CCAGGTQGTC TCGTCACCTC AGAGGCTCCG CAGACTCCTG CCCAGGCCAG GACTGAGGCA 660 

AGCrrCAAGG CACTTCTAQG ACCTGCCTCT TCTCACCAAG ATGAACTCAC TGGTTTCTTG 720 

GCAGCTACrc CTTTTCCTCT GTGCCACCCA CTTTGGGGAG CCATTAGAAA AGGTGGCCTC 780 

TCTCGGGAAT TCTAGACCCA CAGGCCAGCA GCTAC3AATCC CTGGGCCTCC TGGCCCCSGG 840 

20 G6AGCAGAGC CTGCXX3TGCA CCGAGAGGAA GCCAGCTGCT ACTGCCAGGC TGAGCCGTCG 900 

GGGGACCTCG CTOTCCCCGC CCCCCGAGAG CTCCGGGAGC CCCCAGCAGC OXSGCCTGTC 960 

CX5CCCCCXyVC AQCCGCCAGA TCCCCX3CACC CCAGGGCGCG GTGCTGGTGC AGCQGGAGAA 1020 

GGACCTGCCG AACTACAACT GGAACTCCTT CGGCCTGCGC TTCGGCAAGC GGGAGGCGGC 1080 

ACCAGGGAAC CACGGCAGAA GCGCTGGGCG GGGCTGAGGG CGCAGGTGCG GGGCAGTGAA 1140 

30 CrrcAGACCC CAAAGGAGTC AGAGCATGCG GGGCX3GGGGC GGGGGGCGGG GACGTAGGGC 1200 

TAAGGGAGGG GGCGCTGGAG CTTCCAACCC GACGCAATAA AAGAAATGTT GCGTAACTCA 1260 

AAAAAAAAAA AAAAAAAANC TCGGGGGQGG 1290 

35 



40 



(2) INFORMATION FOR SEQ ID NO: 82: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 684 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
45 (D) TOPOUOGY: linear 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

TTTATTGTAT TCTOTAACTA TAGAACTTCT ATTTWATTCT TTTTTGGACT TGCT7\AGTTG 60 

TCTTTOATGG TTTTWAGTTC CATQCTGAAG TTTTCAGTAT TGACTTATCC CCTTGAACAT 120 

GAGTICTTTT ATAGACTCTR ATGATTCAAA AATCTTACAT CmTGGTAG TCTCTTTCAT 180 

55 TTGTYCACTG TTTCTGTTGA TTCTWACTCA TGGTATTTTA ATTCTTCGTT WTTTTTTTTC 240 

TGTTWAGAWA CATTCTTTGA AAAATAATTT GGAGGAATAT TTGATTCTTA TGAACAAGGC 300 

ATTACTCACC AGAGAAGATT TTTTTGTTYT ACCARGTGCC TARGAATGCT AACAGTCTGG 360 



wo 98/39448 PCT/US98/a4493 
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GAMCACATAG AMCACCAGGT GAOXSAGACAA TCCTGQGART CCTCTTTTAC TVTCGSCCAT 420 

crrrrcTCcc aaccctgtgg gaatartcat ycatatccta rctocaggct araaggtggt 480 

5 TTATCAGAGC CCAACTTCGA GGGCTCTGGG CTTTAGCTAC TCTCACCCCA TCATAACTGA 
GCTTCATOGA TTGATTCTCT TTTTATCTTT CAGATTTTCT TTTAAAAATC TTTGTrTTOT 



540 
600 



TTTITCTrCC GAAAGATTCC CCCAACATTA CCATTCXXXA CCTTCCGTTG AATTrnTTG 660 

10 



GCTCTCATTT TGAATTTTTC AAGA 
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(2) INFORMATION FOR SEQ ID NO: 83: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2024 base pairs 
20 <B) TYPE: nucleic acid 

(C) STRANDEDNESS : dotible 

(D) TOPOLOGY: linear 



(xi) SE3QUENCE DESCRIPTION: SEQ ID NO: 83: 

CTGCAGGAAT TCGCCACAGC TGCGCTQGAG GCTTCATCTT TC3CCGCCGCT GCCGTCGCCT 60 

TCCTGGGATT GGAC3TCTCGA GCTTTCTTCG TTCGTTOGYC GGCGGGTTCG CGCCCTTCTC 120 

30 GCGCCTCX3GG GCTOCGAGGC TGGGGAAGGG GITGGAGGGG GCTGTTGATC GCCGCGTITA 180 

AOTTCCGCTC GGGGCGGCCA TCTCGGCCGG CGAGC3TCGAG CGCCTAGTC3T CGGAGCTGAG 240 

CGGCGGGACC GGAGGGGATG AGGAGGAAGA GTGGCTCTAT QGCGATGAAA ATGAAGTTGA 300 

AAGQCCAGAA GAAGAAAATG CCAGTGCTAA TCCTCCATCT QGAATTGAAG ATGAAACTGC 360 

TGAAAATCGT GTACCAAAAC CGAAAGTGAC TGAGACCGAA GATGATAGTG ATAGTGACAG 420 

40 CGATGATGAT GAAGATGATG TTCATCTCAC TATAGGAGAC ATTAAAACGG GAGCACCACA 480 

GTATGGGAGT TATGGTACAG CACCTGTAAA TCTTAACATC AAGACAGGGG GAAGAGTTTA 540 
TGGAACTACA GQGACAAAAG TCAAAGGAGT AGACCTTGAT GCACCTGGAA GCATTAATGG 
AGITCCACTC TTAGAGGTAG ATTTGGATTC TTTTGAAGAT AAACCATGGC GTAAACCTGG 
TGCTGATCTT TCTGATTATT TTAATTATGG GTTTAATGAA GATACCTGGA AAGCTTACTG 

50 TGAAAAACAA AAGAGGATAC GAATGGGACT TGAAGTTATA CCAGTAACCT CTACTACAAA 780 
TAAAATTACG GTACAGCAGG GAAGAACTGG AAACTCAGAG AAAGAAACTG CCCTTCCATC 840 
TACAAAAGCT GAGTTTACTT CTCCTCCTTC TTTGTTCAAG ACTGGGCTTC CACCGAGCAG 900 
GAGATTACCT GGGGCAATTC A1CTTATCGG TCAGACTATA ACTATCAGCC GAGTAGAAGG 960 

CAGGCGACGG GCAAATGAGA ACAQCAACAT ACAGGTCCTT TCTGAAAGAT CTGCTACTGA 1020 

60 ACn-AGACAAC AATTTTAGCA AACCACCTCC GTTTTTCCCT CCAGGAGCTC CTCCCACTCA 1080 
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anrccACCT cctccatttc ttccacctcc tccgactgtc agcactgctc cacctctgat 

TCCACCACCG GCTTTrCCTC CTCCACCAGG CGCTCCACCT CCATCTCTTA TACCAACAAT 
AGAAAGTCGA CATTCCTCTG GTTATGATAG TCGTTCTGCA CGTGCATTTC CATATGQCAA 
TGTTGCCriT CCCCATCTTC CTGGTTCTGC TCCTTCGTGG CCTAGTCTTG TGGACACXy^ 
CAAGCAGTCG GACTATTATG CCAGAAGAGA GAAAGACCGA GATAGAGAGA GAGACAGAGA 
CAGAGAGCGA GACCCnGATC GGGACAGAGA AAGAGAACGC ACCAGAGAGA GAGAGAGGGA 
GCX?rcATCAC AGTCCTACAC CAAGTOTrTT CAACAGCGAT GAAGAACGAT ACAGATACAG 
GGAATATCCA GAAAGAGGTT ATGAGCGTCA CAGAGCAAGT CX3AGAAAAAG AAGAACGACA 
TAGAGAAAGA CGACACAGGG AGAAAGAGGA AACCAGACAT AAGTCTTCTC GAAGTAATAG 
TAGACGTCGC CATCAAAGTO AAGAAGGAGA TAGTCACAGG AGACACAAAC ACAAAAAATC 
TAAAAGAAGC AAAGAAGGAA AAGAAGCX3GG CAGTGAGCCT GCCCCTGAAC AGGAGAGCAC 
CGAAGCTACA CCTGCAGAAT AGGCATGCHT TTGGCCTTTT GTGTATATTA GTACCAGAAG 
TAGATACTAT AAATCTTCTT ATTTTTCTGG ATAATGTTTA AGAAATTTAC CrTAAATCTT 
GrrCTCTITG rrACTTA-TCAA AAGTTAACTT TTTTTCCAAA ATAAAAGAGT GAATTTTTCA 

TOTTAAcyrrA aaaatctttg tcitctacta tttcaaaaat aaaaagacag caatgacttt 

ATATCCAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAGGGC GGCC 
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(2) INFOPMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LE^DGTH: 931 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOlJOGy: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 
CQCGCCMATA GCCGGACGGG GATCTCAGCT QGCAGGATGA ATGTGGGGGT GGCACACAGC 
GAAGTAAACC CCAACACCCG AGTGATGAAT AGCCGAGGCA TCTGGCTGGC CTACATCATC 
rrOGTAQGAT TOCTGCATAT GGTTCTACTC AGCATCCCCT TCTTCAGCAT TCCTGTTGTC 
TCGACCCTGA CCAACGTCAT CCATAACCTC GCTACGTATG TCTTCCTTCA TACGGTGAAA 
GGGACACCCT TTCAGACTCC TCACCAAGGA AAGGCTCGGC TACTGACACA CTGQGAGCAA 
ATCGACTATG GGCTCCAGTr TACCTCTTCC CGCAAGTTCC TCAGCATCTC TCCTATTGTG 
CTCTATCTCC TGGCCAGCTT CTATACCAAG TATGATGCTG CGCACTTCCT CATCAACACA 
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(2) INFORMATION FOR SEQ ID NO: 85: 



600 
660 



GCCTCATTCSC TAAC3TOTACT GCTGCCGAAG TTGCCCCAC3T TCCATCCSGCST TCGTGTCTTT 480 
GGCATCAACA AATACTCAGG GATGGGTTTT GQGACflGCTC CATGC3GCATG GGGAAGGCAC 540 
5 TGAAACAGAG GACTATAAAA CATCCTTCTC TTATTCTCCA TACTGTCTTC TACACCTTTA 
AACX:CTGAGA ACTATACAAC CTTTCCCAGA CTCCCAAGAA GAGAAGAGAT TGGCAAATGG 
GGCTCCTCGG CCCAGTCCTC CTAGTGGCAA GTTTCTTTGA ATCAGGAAGG CAGGTGAGGT 720 
AAGGGCCAAA TCACTCTCCT CCMAGCAGG AAGCCATTTG QGCAGCTCCT TTGGTGATTA 780 
CATCITTCCA TATCTTTTAC ACTTACCACC TTCCAGCTCT GTTTrGCTGT GTATTTTTCT 
15 TACAATAATT TTTTTCAGCT ATAGCTGCAG TTTAATCAGG ATGGGTAGAG AGCTGTCCTC 
ATAAGGCTGG GGGTGGGAAG ATGGAATACT G 



840 
900 
931 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 825 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

CX3QGGCCGGC GGGGTCTTCA GQGTACCOGG CTQGTTACAG CAGCTCTACC CCTCACGAGG 60 

CAAACATCGC AGCGCAGAAG GACCAGCAGA AAGATGCCGA QGCQGAAGGG. CTGAGCGGCA 120 

CGACCCIGCT GCCGAAGCTC ATTCCCTCCG GTGCAGGCCG G6AGTGQCTG GAGCGGCGCC 180 

GCGCGACCAT CCGGCCCTGG AGCACCTTCG TGGACCAGCA GCGCTTCTCA CGGCCCCGCA 240 

40 ACCTOGGAGA GCICTQCCAG CGCCTCGTAC GCAACGTGGA GTACTACCAG AGCAACTATG 300 

TCyrTCGTCTT CCTGGGCCTC ATCCTGTACT GTGTGGTGAC GTCCCCTATG TTGCTGGTGG 360 



420 



CICTGGCTGT CTmTCGGC GCCTCTTACA TTCTCTATCT GCGCACCTTG GAGTCCAAGC 
TTCTGCTCTT TGGCCGAGAG GTCAGCCCAG CGCATCAGTA TGCTCTGGCT GGAGGCATCT 480 
CCTTCCCCTT CTTCTGGCTG GCTGGTGCGG GCTCGGCCGT CTTCTGGGTG CTGGGAGCCA 540 
50 CCCTGGTCGT CATCGGCTCC CACGCTGCCT TCCACCAGAT TGAGGCTGTG GACGGGGAGG 
AGCTOCAGAT GGAACCCGTC TGAGGTGTCT TCTGGGACCT GCCQGCCTCC CGGGCCAGCT 
GCCCCACCCC TGCCCATCCC TGTCCTGCAC GGCTCTGCTG CTCGGGCCCA CAGCGCCGTC 720 
CCATCACAAG CCCGGGGAGG GATCCCGCCT TTGAAAATAA AGCTGTTATG GGTGTCATTC 780 
AQGAAAAAAA AAAAAAAAGG GGGGCCCCTC TAGGGGTCAA AGTTA 
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(2) INFORMATION FOR SEQ ID NO: 86; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1238 base pairs 

(B) TYPE: nucleic acid 

(C) STRANIHENESS: doiible 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 
CATGTAAAAG GATGAAATGT GACTTCTGGT GTTTTTTTAT TTCTATQGAG GGACTTTCTG 60 

15 GGGACGGTTT CTGGCTCTCA GGCTCTGAGA AGCTGCAGTT TATGAGTQGC TCTGTGTGTG ' 120 

CTGCCACCTA CTGGAGAAGC CATAAGCTGC AGCTTTAGGA AAAGGGAACC OGGGGCAGAG 180 

TGTCGGGAAG TGGGATGGCA GCATGGCAGG GCTTTGGAAA ATGAGAGGTG AGAGTKTKTG 240 

CAGGAAGGGT GTAAGGAGAG GATGGATCCT GATACATGGA TTCAGGATCA TTAGGGTCCT 300 

GTCTGGGACA CTGGCCTTCC TGCTTACCTG CTCTTTCCTT CCTCCTTGGT CGGAGGAGGG 360 

25 GCTGGCTCAC TGCTCTQGCT TCATTTTCCA GAGCTGCCTG CTGCAGTCAC ACTTAGGTCA 420 

TCTTCTCTCA CTTTTCTCCT TTTGCCGATT AGTQGACGTG ACAGAGATGT GAATGGGGCA 480 

GGGATGTCCT TTGATGGCAT CAAGACTTTA GCTTCTGGTG CGCTGTGTCC CAGCTCTGAT 540 

TTCAGTTGCA GCCGTGATGG AMAGTTTTCCA TGGAAGCTGA GACTCTCACT GACAGTGAAA 600 

CCCTCAAATG AACACAATCC CTGCTTTCCT GCCAAGGATC CTTGTAGQGT NCCCCCAGCT 660 

35 TOCCCACTTT TTTTCTGTGT CCTGACAAAG AAACACAGAG TAACTTGATT GCCCTGTGAC 720 

CTQGCCAGTT GCATTTCCCC TGCAGGCTTG AGCCCAAGCC AGAGCCTTGA AAAGGTATTC 780 

AGGTTGTTGC CCAAAACACT GAAAAAAACT GCCCTGGCCC TGAACCAAAT ACCTTGAACC 840 

CTCGTAAACT CCATACCCTG ACCCCCTTGT TTTCGATATA CCCAGGTAGA ACAACTCTCT 900 

CTCACTGTCT GTTGTGAGGA TACGCTGTAG CCCACTCATT AAGTACATTC TCCTAATAAA 960 

45 TGCTTTGGAC TGATCACCCT GCCAGTCTTT TGTCTTGGGC AATCTATACT TTTNCTCAGA 1020 

GGTTCCCAAG GCCTACTGAA GQGACTTAAC ATACTCTTAA TGQCTTTCCT CTCTCTTGTT 10 80 

TTACCTTATG CCCTCACTTC CTGAGTTAAC CTCCCAAATA CAGGATTCAC CTGTACCCAA 1140 

GCCCTTAGCT TCAAGAATAC AGGATCACCT GTACCCAAGC CCTTAGCTCA AGCTCTGCTT 1200 

TGGAAGAACC CAAACTAAGA CAGTGCTCCT GGTGCCCT 1238 



55 



(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1450 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTI(»I: SBQ ID NO: 87: 

ATIXSCCTTCT QOrCCCTGGT GACACTGGGG TCATCCTTCA TCCCCGGAGA GCATTTCTGG 60 

CTCCICCTCC TCACCCGGGG CCTGGTGGGG GTCGGGGAGG CCAGTTATTC CACCATCGCX5 120 

CCCACTCTCA TTGCCGACCT CTTTtSTGGCC GACCAGCGCG ACCXX3ATGCT CAGCATCTTC 180 

TACTTTGCCA TTCCGGTGGG CAGTGGTCTG GGCTACATTG CAGGCTCCAA AGTGAAQGAT 240 

ATGGCTGGAG ACTGGCACTG GGCTCTGAGG GTGACACCGG GTCTAGGAGT GGTGGCCGTT 300 

CTGCTGCTOT TCCTGGTAGT GCGGGAGCCG CCAAGGQGAG CCGTGGAGOS CX:ACTCAGAT 360 

TTGCCACCCC TGAACCCCAC CTCGTGGTGG GCAGATCTGA GGGCTCTGGC AAGAAATCCT 420 

AGTTTCGTCC TGTCTTCCCT GGGCTTCACT GCTGTGGCCT TTGTCACGGG CTCCCTGGCT 480 

CTCTGGGCTC CGGCATTCCT GCTGCGTTCC CX3CGTGGTCC TTGGGGAGAC CCCACCCTGC 540 

CTTCCX:GGAG ACTCCTGCTC TTCCTCTGAC AGTCTCATCT TTGGACTCAT CACCTGCCTG 600 

ACCGGAGTCC TGGGTGTGGG CCTGGGTGTG GAGATCAGCC GCCGGCTCCG CCACTCCAAC 660 

CCCCGGGCTG ATCCCCTGGT CTGTGCCACT GGCCTCCTGG GCTCTGCACC CTTCCTCTTC 720 

CTCTCCCTTG CCTCCGCCCG TGGTAGCATC GTQGCCACTT ATATTTTCAT CTTCATTGGA 780 

GAGACCCTCC TGTCCATGAA CTGQGCCATC GTGGCTGACA TTCTGCTGTA CGTQGTGATC 840 

CCTACCCGAC GCTCCACCGC CGAGGCCTTC CAGATCGTGC TGTCCCACCT GCTGGGTGAT 900 

GCTCGGAGCC CCTACCTCAT TQGCCTGATC TCTGACCGCC TGCGCCGGAA CTGGCCCCCC 960 

TCCTTCTTGT CCGAGTTCCG GGCTCTGCAG ITCTCGCTCA TGCTCTGCGC GTTTGTTGGG 1020 

GCACIGGGCG GCGCACTTCC TGGGCACCGC CATCTTCATT GAGGCCGACC GCCGGCGGGC 1080 

ACAGCTGCAC GTCCAGGGCC TGCTGCACGA AGCAGGGTCC ACAGACGACC GGATTGTGGT 1140 

GCCCCAGCGG GGCCGCTCCA CCCGCGTGCC CGTGGCCAGT GTGCTCATCT GGAGAGGCTG 1200 

CCGCTCACCT ACCTGCACAT CTGCCACAGC TGGCCCTGGG CCCACCCCAC GAAGGGCCTG 1260 

GGCCTAAACC CCTTGGCCTG GCCCAGCTTC CAGAGGGACC CTGGGCCGTG TGCCAGCTCC 1320 

CAGACACTAC ATX3QGTAGCT CAGGGGAGGA GGTGGGQGTC CAGGAGGGGG ATCCCTCTCC 1380 

AACAGGGGCA GCCCCAAGGG CTCGGTGCTA TTTGTAACGG GATTAAAATT TGTAGCCAGA 1440 

AAAAAAAAAA AAAAAAAAAA 1460 



60 
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(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQ3GTH: 1395 beise pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTKMa: SEQ ID NO: 88: 

CAGGTCCAAA GTCGGAAGTC TCAGTCCTCA GTCTTGQGCT ATTCGGCCAC GTGCCTGCCG 60 

GACATOGGAC GCTGGAGGGT CAGCAGCGTG GAGTCCTGGC CTTTTGCGTC CACGGGTGGG 120 

15 AAATTOGCCA TTGCCACGGC GGGAACTGGG ACTCAGGCTG CCCCCCGGCC GTTTCTCATC 180 

COTCCACCGG AYTCGTOGGC GCTCGCACTG GCGCTGATGT AGTTTCCTGA CCTCTGACCC 240 

GTATTOTCTC CAGATTAAAG GTACGACATT TGC3flGGCCCC AGCGAGAAAC GTCACCGGGA 300 

GAAACGTCAC CGGGCGAGAG CGGKCCCGCT GTGTGCTCCC CCGGAAGGAC AGCCAGCTTG 360 

TAGGGGGGAG TCCCACCK5A AAAAAAAATT TCCAGGTCCC CAAAGGGTGA CCGTCTTCCG 420 
25 GAGACAQCGG ATCGACTACC ATGTGGGTGC CCACAAAAAT TYCACCTYTG AGTCCTCAAC 
TGCTCACCCC GQGGTCAGTT CCAGAGAGAA GGACTCCCTC CTGCTTGGAA GAGACCTCAC 
ACCGTCATCA CGATGCCAAC GGCTCTGAAG GTGGATGGCA TTCCTGCGTG GATTCATCAC 

TCCCGCATCA AAAAGGCCAA CRGAGCCCAA CTAGAAACAT GGGTCCCCAG GGCTGGGTCA 660 

GGCCCCTTAA AACTGCACCT AAGTTGGGTG AAGCCATTAG ATTAATTCTT TTTCTTAATT 720 

35 TTOTAAAACA ATGCATAGCT TCTGTCAACT TATCyrATCTT AAGACTCAAT ATAACCCCCT 780 

TCyiTATAACT GAGQGAATCA ATCATTTGAT TCOCCAAAAA CACAAGTGGG GAATGTAGTG 840 
TCCAACCTGG TTTTTACTAA CCCTGTTTTT AGACTYTCCC TTTCCTTTAA TCACTCAGCC 



480 
540 
600 



900 



TTCTTTCCAC CTGAATTGAC TCTCCCTTAG CTAAGAGCGC CAGATGGACT CCATCrTGGC 960 



1020 



TCTTTCNACT GGCAGCCGCT TCCTYCAAGG ACTTAACTTG TGCAAGCTGA CTCCCAGCAC 

45 ATCCAAGAAT GCAATTAACT GATAAGATAC TGTGGCAAGC TATATCCGCA GTTCCCAGGA 1080 

ATTCCyrCCAA TTGATTACAC CCMAAAGCCC CGCGTCTATC ACCTTGTAAT AATCTTAAAG 1140 

CCCCTCCACC TGGAACTATT AACGTTCCTG TAACCATTTA TCCTTTTAAC TTTTTTGCCT 1200 

ACTTTATTTC TCTAAAATTG TTTTAACTAG ACCCCCCCTC TCCTTTCTAA ACCAAAGTAT 1260 

AAAAGCAAAT CTAGCCCCTT CTTCAGGCCG AGAGAATTTC GAGCGTTAGC CGTCTCTTGG 1320 

55 CCACCAGCTA AATAAACGGA TTCTTCATGT GTAAAAAAAA AAAAAAAAAA CTCGGAGGGG 1380 

GGGCCCGGTA CCCAA ^^^^ 
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(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 1186 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

GGCACGAGCC GGCAAGCCGA GCTAGGGTGA AAACTGOGGG CGCACCAGGA TGTNNGACAG 60 

AAAAGCAGAA GATGAGACTC TGTTCATTCA CTTTTCCTAG GCCCATCCTG TGGTCATCIT 120 

15 

TCCCCCTCCC ATCATACCTC CTCCTTCCTG GAGCCTCTGC CGGCTTQGCT GTAATGGTGG 180 

CACTTACCTG GATATTTCAG TGGGAGGATG AAAGGCGAGA CTCACCCTAC GCGGTGGGAC 240 

20 AGATGGGGAG AGGAAAAAGG CAGAGATGGC CAGGAGAGGG GTGCAGGACA AACCAGAGAG 300 

GTTGGGTCAG GGGAAAAGGG TGGGGAGAAA GAGGGGTGCA GGCCCTGCAG GCCGGTTAGC 360 

CAGCAGCTGC GGCCTCCCCG GGCCCTTGGC ATCCAACTTC GCAGACAGQG TACCAGCCTC 420 

25 

CTGGTGTGTA TCATAGGATT TGTTCACATA GTGTTATGCA TGATCTTCGT AAGGTTAAGA 480 

AGCCGTGGTG GTGCACCATG ACATCCAACC CGTATATATA AAGATAAATA TATATATATA 540 

30 TGTATGTAAA TTATGGCACG AGAAATTATA GCACTGfiGGG CCCTGCTGCC CTGCTGGACC 600 

AAGCAAAACT AAGCCTTTTG GTTTGGGTAT TATGTTTCGT TTTGTTATTT GTTTGTTTTT 660 

GTGGCTTGTC TTATGTCGTG ATAGCACAAG TGCCAGTCGG ATTGCTCTGT ATTACAGAAT 720 

35 

AGTGTTTTTA ATTCATCAAT GTTCTAGrrA ATGTCTACCT CAGCACCTCC TCTTAGCCTA 780 

ATTTTAGGAG GTTGCCCAAT r iTOmViT CAATTTTACT QGTTACTTTT TTGTACAAAT 840 

40 CAATCTCTTT CTCTCTTTCT CTCCTCCCCA CCTCTCACCC TTGCCCTCTC CATCTCCCTC 900 

TCCCGCCCTC CCCTCCTCCC TCTGGCTCCC CGTCTCATTT CTGTCCACTC CATTCTCTCT 960 

CCCTCTCTCC TQCCTCCTGC TGCCCCCTCC CCAGCCCACT TCCCCGAGTT GTGCTTGCCG 1020 

45 

erCCTTATCT GTTCTAGTTC CGAAGCAGTT TCACTCGAAG TTGTGCAGTC CTGGTTGCAG 1080 

CTTTCCGCAT CTGCCTTCGT TTCGTGTAGA • TTGACGCGTT TCTTTGTAAT TTCAGTOTTT 1140 

50 CTGACAAGAT TTAAAAAAAA AAAAAGGAAA AAAAAAAAAA AAAAAA 1186 



55 (2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1821 base pairs 

(B) TYPE: nucleic acid 
60 (C) STRANDEDNESS : double 
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(D) TOPOLOGfY: linear 
(xi) SEQUENCE DESCRIPTICafl: SBQ ID NO: 90: 

AAAACATQCT TTCAQGGCGT CCCCTATGTA TTCGGGGGGC CX^^CGGACAC TCAGGCTGGA 60 

KATCCGTCCT CACTGCGCTC AAGATGGCCT CAGCAGACAC CAGTTACCCA GCTGAAAGTC 120 

ACAATCCCTC CTAGAAGTCT CXXAACACTA GrTGCTGACCA GAQGTGGGGC TCTCAGGCTA 180 

GGAGTTTCAC ACACAATGAC AGGCTGCTGG GGGACATTGC AGGACCCCTT TTCCTYTCCT 240 

CrcCATGCTA GAAGCCAGCC CTAGGMAGCT QCAGTTACTC CCTGTGACTC AGCAGCAGGC 300 

15 TGATTCAACA CAGCTGCCCA CACAAAGCCA G?rGGTAATAC ATCTGTTTAC CTTTCCCTAT 360 

CACCCAGACA CAAGCCCCTT TCCCAGGTCA AACCACAGGC CGATGCATCT CCAGTTTGAC 420 

AC?rCAAATCA CTACTTCCAT TGCTACTTTA GATCAGCCAA AGTGGTGACT GCTGCAGTGT 480 

GTCGCTATCC CTACAAGGCC CACCCAAGGG ATGCCCAAAG CCCAACCTTC TCCAGGGCTG 540 

CAGCAGNAGC AACCCCACCA GCCTAAGTCC AGCAGAGGAC CTCCCACCCA ATGTCTTGTT 600 

25 CTAATTAGAA GGGGAAGTTA GCCACAGAAA ATCAACTTAT CTATAATTAC AAAATTCTCT 660 

TCACTCACCT TAAAGTTCCT ATTGACATCT ACTGCTTTTA AACCTATTTG AAAACTCTGA 720 

TACTAAAACA AATOACACTC TAAGAAAGTT TGQGAGCCCC ATGCTGAGAA CXATTTCTGT 780 

GCAGTCAGGA TGTTTCCAGA AGCTACTTAC CTACATGTGA ATGTGCXATT TTCTTTCCTT 840 

TTGTAGAGAA AATCCCCTTT ACnTTTTGGA ACAGTAATGG CAGCTTCTAG TACAGCCATT 900 

35 ACAGTTTCAT ATGAGAAAAA TTAAGAATAA CTATAAAATT GTTAAAATAT CCAATAATGG 960 

ATAA-KSATGG CCAGAAGATT TAACATACAA AGTAATTCTC AATGTAAAGC TATTCAGCTC 1020 

TTCCAGGTTG AAIXSCCCTGT AACCCACCCT GACCTTCCAC ATCATCTTCA AAAAGCAGTT 1080 

TCTCTCTTCC CCATGATTCT CCTATAAGGT AACTCTTTAG TCCTCCATTT AGCACATTTT 1140 

AAATCCrcCA AAGAATAAGT ATCATGTGAT TATTTTAGCT TTACAAAAAA AAAGTTGAAT 1200 

45 GGCX?rrTTAT TTTCATQGCC TATAAGCAGG TACCTTAGTA GGGCAGATAT AGGAAAAACA 1260 

AATTAGAGCA AAACAAATCC TCTACAAATC CAAGGCAGGA AAAGTGG?rGG CAGAGTGACT 1320 

CATTCTCCTG TCCCTCCCAT CAGGTCAAAT CAGGAGGCTG CAGTGAATGC CTGTTCTTTG 1380 

AATGTCTAGC AGTTGTTCCT GTAACTCTTT AAAACTTGGC TATAGGCTGT TTAGCACAGT 1440 

ACAGATTAAA GATACAGTTA CGTAAACAGC AAAGTAATTT TATAGTGCTT CATCCATTTA 1500 

55 TCATGCTTTC GTTTGCTAAT TTTTTCACAT ACCTTTTTCT ATCACAGTCT GTTGCTTTTG 1560 

TACACATTTC TCATATTGGG GTTCGACAGG TAAACACAAA CTGCTATTTC AGTAGAAAAA 1620 

GTTATTGTTA TGGAATATTA AACCCAATAA ATTGTATAAA GGGTAAAAAA AAAAAAAAAA 1680 
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40 
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AAAAAAAAAA AAAAAAAAAA AAAAAAATTC CTGOGGQCCG CM3GCTTTTT CCCTTTQGGT 1740 
GAQGGGTTAT TTTNQGCTTG GGCACTGQGC CCTTCGTTTT TACAACGTCG TGANGGGGGG 1800 
5 AACCCGGGGG GGGTITCCCC C 1821 



10 (2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 862 base pairs 
(B) TYPE: nucleic acid 
15 (C) STRANDEt»^ESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

20 TGCCCTTTTT CCCACCGATT CGGGGCNTGG TGAAGGTGGG AGATGTGAAC TCCAATTAAG 60 

GGACTGGAGA GAGGTGAAGA ATTTTGCAGG TGGGAGATTT GGATTTGAAT GTGGACTTGT 120 

AAATGACTTG ACCTTGCCAT CTGTGTTCAA GGTCACGGTT TGCTGTGGGG 1TCCTGGGAG 180 

25 

AGCTTACTCA CCCCGGAGTC TTTTCTTTCT CTTGCTCCAA GAAGAGCCCT GTTGGTGCTT 240 

TACCACCGCT TQGAGTCTCC CGAGGACACA AACAGGCAGA GAGGGACGTG TAGGGAGAGT 300 

30 TCTTTCCTGT TTTCTGTGCT TTCCTTTTTA CAGGACTCCC GGAAGGCCAC TCATGGCCAT 360 

GCCAGGAGCT TTCTCAGAAA CAGTCATAAA CGATCTCTTG AGTCTCTTTC TTGTCCTCCC 420 

AQCTGAGCTT TCTTATTCCA CCCrTTCTGG TGTCTATAGG AATC3CATGAS AAGACCCTGG 480 

35 

GACGTTTTTC TGCTCTCTTC TGGCCCTCCA TGGAGCCATG GGCCTCGGCC TCGGCGGCTC 540 

CTCACCCTCA CAATTTATTT CCTCCTCCCG TGCCAGCCCT TCTTTTGTGT CTGAAACCGG 600 

40 TTTTAAAATG TGACTCTCCC AGAGAAGAAG CCGCTGGCTG TATGAAACTT GACGGCGCTT 660 

TTGTAAGGTG CCACCCCCAA ACTTTAAGGT AGCTAAACCA ATTTTTAAAA GATTCAATGG 720 

CTTGTTCATC CTCCAGATGT AGCTATTGAT GTACACTTCG CAACGGAGTG TCTGAAATTG 780 

45 

TGGTGGTCCT GATTTATAGG ATTTCATAAT TAAAATGTCT GCTGAATAAA AAAAAAAAAA 840 

AAAAACTCGA GGGGGGCCCG GT 862 

50 



(2) INFORMATION FOR SEQ ID NO: 92: 

55 (i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 696 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) T0POL£)GY: linear 

60 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

CTGAGGCGAG TGAAGTGGAC TCTGAGGGCT ACCGCTACCG CCACTGCTGC GGCAGGGGCG 60 

TGGAGGGCAG AGGGCCGCGG AGGCCGCAGT TGCAAACATG GCTCAGAGCA GAGACGGCGG 120 

AAACCCGTTC GCCGAGCCCA GCGAGCTTGA CAACCCCTTT CAGGACCCAG CTGTGATCCA 180 

GCACCGACCC AGCCGGCAGT ATGCCACGCT TGACGTCTAC AACCCTTTTG AGACCCGGGA 240 

GCCACCACCA GCCTATGAGC CTCCAGCCCC TGCCCCATTG CCTCCACCCT CAGCTCCCTC 300 

CTTGCAGCCC TCGAGAAAGC TCAGCCCCAC AGAACCTAAG AACTATQGCT CATACAGCAC 360 

15 TCAGGCCTCA GCTGCAGCAG CCACAGCTGA GCTGCTGAAG AAACAGGAGG AGCTCAACCG 420 

GAAGGCAGAG GAGTTGGACC GAAGGAGCGA GAGCTGCAGC ATGCTGCCCT GGGRGGCACA 480 

GCTACTCGAC AGAACAATTG GCCCCCTCTA CCTTCTTTTT GTCCAGTTCA GCCCTGCTTT 540 

20 

TTCCAGGACA TCTCCATGGA GATOCCCCAA GAATTTCAGA AGACTGTATC CACCATGTAC 600 

TACCTCTQGA TGTCCAQCAC GSTGGNTCTT CTCCTGAAYT TCMTCGSCTG CCTGGCCAGT 660 

25 TCTGTGTGGA AACCAACAAT GGCGAGGCTT TGGGTT 696 



30 (2) INFX)RMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1886 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

40 CAGGCCACTG AOGCTTCTTT GCGAGGGATG CAGGAGGTCC TACAGAGAAA GGCGCTTCTT 60 

GCATKTCAGA GQGCCCACAG CCTGTCACCC ACAGATCACC AAGCAGCTTT CTACCTGGCT 120 

CTGCAGCTTG CCATCTCCAG ACAGATCCCA GAGGCTCTGG GGTATGTCCG CCAAGCTCTT 180 

45 

CAGCTTCAAG GTGACGATGC CAACTCCCTG CACCTCCTTG CCCTCCTGCT GTCAGCACAG 240 

AAGCATTACC ATGACGCTCT GAACATCATC GACATGGCCC TGAGTGAATA CCCAGAAAAT 300 

50 TTCATACTAC TGTTTTCCAA AGTGAAGTTG CAGTCACTCT GCCGAGGCCC GGACGARGCA 360 

CTGCTGACTT GTAAGCACAT GCTGCAGATA TGGAAATCCT GCTACAACCT CACCAACCCC 420 

AGrrGATTCTG GACGTGGGAG CAGCCTCTTA GATAGAACCA TTGCTGACAG ACGACAGCTT 480 

55 

AATACAATTA CTTTGCCAGA CTTCAGCGAT CCCGAGACAG GCTCCGTCCA TGCCACATCG 540 

GTAGCAGCCT CAAGAGTGGA GCAQGCACTG TCGGAAGTGG CTTCGTCTCT GCAGAGCATG 600 

60 CCCCTAAGCA GGGCCCGCTG CACCCCTGGA TGACGCTGGC ACAGATCTGG CTCCATQCAG 660 
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CTCAAGTCTA TATCGGCATC GGGAAGCCTG CAGAAGCCAC AGCCTGTACC CAAGAAGCTG 
CCAACCTCTT CCCAATGTCC CACAATGTCC TCTACATGCG CGGCCAGATT GCTGAGCTCC 
GGGGAAGCAT GGACXSAGGCG CGGCGGTGGT ATGAAGAQGC CTTAGCCANT CAGCCCCACC 
CACGTCAAGA GCATGCAGCG ACTTQGCCCT GATCCTTCAC CAGYTAC3GCC GYTACAGTVT 
GGCGGAGAAG ATCCTCOGGG ACGCGGTGCA GGTGAACTCG ACAC3CCCACX3 AGGTCTQGAA 
CGGC3CTGQC3C GAGGTCCTCC AAGCTCAC3GG CAACGATGCG GCGGCTACX3G AGTGCTTCCT 
GACAGCCTTC GAGCTGGAGG CCAGCAGCCC CGOCGTGCCC TTCACCATCA TCCCCCXXCT 
GCTCTCAGCA GGCGCCTGCC AGCCTCACCT GCCQCTCAGC CHjCAGAGGC CXTTGCCGGQC 
ACCAGGGCTT GTGCCATCGC CCCAAGQGGA TGAATCTGCC GCACTGAGGC CAGGGACGAG 
TGTTCAGTGG GCCACAGTGA ACCAACCAAA CCAACCCCGA ATCATCGCTC TCGCCATGTG 
COTTTCrcTT GTTTmTTG CCAGCCCAAT GGTAGTTTCT GAACCTATTG ACATTGTTCA 
AAATGGATCA TGTGCCATAT TTTGTTAGTT GACATCTGAG TTTTCAGTAA AATGATTATG 
GAATTAATCA GCAAATOTAG AAGAATATAT TCAAAGTTAA AATTCAGTGG CAGCACAGAT 
TATTTTTATC AGAGCTCTAA AGAAAACAAC TGTCCTTTTC TCCCCACCAC CCCTCCTGCC 
CCACTTTCGC CCAGAAACCA AATGTGAACT TCCTGTCTCC CACCTCAGCA CTAGTCCATG 
CCAQGACACC AGCTGACAAT TTCTTGGTTT TACTGTCAAT AATTGTACCA TGTGATCAAT 
TACTOTCCTC ACTTAGAACA AAGCCTGAGT CCGAGAATAT TTATATTTTA CCAATATATG 
CCTGTTACAA GAGAAGGAAA TATGAGTTAT TTAAGTTTAA CTTTTTTATG TGAATTCAGA 
GTTTATTTAT CGAGGGAAAT ATGTACAAAG AAGCTTCAAA TGGAATATTT ACCGACATTC 
CTTATACATG ACAGACACTT GGCTACATGG GAAGATGATG TTAATAATAA AATGATTTTT 
AAATGGAAAA AAAAAAAAAA AAAAAN 



720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1886 
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(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1774 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 
CTCAGCTACC GTATACAGTA GGACATAACC CCATTTCACA TGCACTACAC TGAGACTTGC 
CCCACATTGA AGATGTTCTT TTTTCATAAC TATATACTAT TCCATTGCAT 



60 
120 
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GAATATTCTG TAATTTATTT AATCCCCTAT GGATTGATAA TTAGGTTCAT TATAGATAGA 180 

AGTGTAATTA ACATTCCTGT ACATGTATTT TGCTACTTGT GTQQGTATTT CTGTAGGATG 240 

5 AATAACTAGA AATTTATTGG ATCAGGTTTC ACATTTGCAG TTTTGAAAAC TACTACCAAA 300 

AAGATTTCAC CAATTTACAA CTCCATCATT AGTAAGAATG CXHOTTTGCC TATAGTCTGC 360 

CAACCCTGAA TCCTTAAAAA TTTTTGCCAA TCTGGTAGGC AAAATTTCIT TCTTTTCTTT 420 

GAATATTAAT GAGGAGGAAC ATCTITTCAT GTTTCTTGGC CATTTGCATT TCCTATTATG 480 

AATTCCTTTT GCXXATTTTC CTTTTTTTAA TTATGAAAGT CTAATGACTA CCTTCTCATT 540 

GTATAAAAAA CACAGTTCTT TGAATAGAGA GACCCTTTTC TCCAATGCTA CCAATCACAT 600 

TCCACTTACC ACAGTTTAAC ATACATCCTC TAGTCACCTT TCCGTACX3AA TATACATACA 660 

CATAAAAACA CTTTTTACAT AAATAGGATC TCATATTCTC^TAGCTTTTTA AAATTTTGGT 720 

CTCAAAAAAA GATAACAGGT CTTTAAATTT CTTTAATGGT TGAATATGAT TAAATACTAT 780 

GAAAATGCXA TTATTTATTC CCTTAATTTT TTTCCTCTCG CTATTACATT GCCAAAGTAA 840 

ACATCCTATT CAGATGTCTT TGTGCATGTG TGTGAATATT TCTTTAGTCT GGAGTCCAGT 900 

AAGGTGGATT TTTQGATCAA AGGGTTTGTT CTCTGTCCAC CTTCAGTCTT CCCAAAGGCC 960 

TTCATAACTG TATTTTCACC AAGTGTATGG AGAATGTTCA TTTCCCCATA TAACCATACC 1020 

TACACTTGAT AGTTTTTATC TGTTGGGCGA AAAAGAACCT TTTCTTATTT TGCATTTCCC 1080 

TGATTATAAA AAAAAATQGT GAGATTGGGG TTATTTTCAT GTTTATTGGC CATTTATAGT 1140 

TTACTGTGGA TTGTTTCTAT CCCTTACCTG CTTTCTATTG GGTTATGTGT GGATATATTG 1200 

TTTTTATTTG TTCAGCATCT CCTTCCCCAT CTTCTGGTAA CACAACCTTT ATTTATTTGT 1260 

GGGGAACCTA TTCCCTGTGG C1TAGGTGAG CATGTGACCA GGCCTGGCCT CCTGAGTCCC 1320 

ACAGCTTCCT AGCCACAGTG ATAAAAGAAT GGGTATATAA CTTAAGCCAG GCTAAGGAAA 1380 

GCCCTTAACA GAACTTCTGC TGGAACTACT GGAAAGAAGG CTTTATGGAG ATCCCAGGAA 1440 

CCAAGGACCA TGTAAGCCTG AATTTGTGCC ATGTGGAGAG AGTCTGTCTG AGGAGAAACT 1500 

CGGATGCTAG CAGAAATGGA AAGAGAACTA AGTTCTGATG TCATTTTTCT GGAGGCCCTA 1560 

GATCCAGCTG TGCCTAAAGC CTGCCCTACT CCGGACTTTA AAGTTTTGTG AGCCAATAAA 1620 

GTCCCTTTCT TGTTTAAGAT AATTGAATTG AGTTTCTGTT CTGATTAATA TAGGTTATTT 1680 

GTATTTTCTT ATTGATTTGT AGAAAACCTT TGTAATTTTA AATTCTAGAC TTTATGCACT 1740 

ATATAAGTTA ATAAAATTAG CATGGCCTTC CATG 1774 



(2) ItJFORMATION FOR SEQ ID NO: 95; 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2503 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS: do\jble 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 



10 GGCACGAGCG AAGGCAAGGG GGCACCAGCT CAGGACTGCA TCTGCCKXTC ATTTCCCTTC 60 



CACTCCTCCT TTCTGGAGTC TGACATTAGA AAGCCAGCGA GAAGGAAGAT TCAAACAACC 120 



AACCCTGATT TCCTGCTTCT CCTTTTCATG AGTGTTCCTG TGGTCTCTGC ACCTCCTTTC 180 

15 

TGTCCCCCGG CAGAGGGCAG TAGAGATGGC CGGCCCAAGG CCTCRGTGGC GCGACCAGCT 240 



GCTGTTCATG AGCATCATAG TCCTCGTGAT TGTGGTCATC TGCCTGATGT TATACGCTCT 300 



20 TCTCTGGGAG GCTGGCAACC TCACTGACCT GCCCAACCTG AGAATCGGCT TCTATAACTT 360 



CTGCCTGTGG AATGAGGACA CCAGCACCCT ACAGTGTCAC CAGTTCCCTG AGCTGGAAGC 420 



CCTGQGGGTG CCTCGGGTTG GCCTGGGCCT QGCCAGQCTT GGCGTGTACG GGTCCCTGGT 480 

25 

CCTCACCCTC TTTGCCCCCC AGCCTCTCCT CCTAGCCCAG TGCAACAKTG ATGAGAGAGC 540 



GTGGCGSCTG GCAGTGGGCT TCCTGGCTGT KTCCTCTGTG CTGCTGGCAG GCGGCCTGGG 600 



30 CCTCTTCCTC TCCTATGTGT GGAATGQGTC ARGCTCTCCC TCCCGGQGCC TGGGTTTCTA 660 



GCTCTGGGCA GCGCCCAGSC CTTACTCATC CTCTTGCTTA TAGCCATGGC TGTGTTCCCT 720 



CTGAGGGCTG AGAGGGCTGA GAGCAAGCTT GAGAGCTGCT AAAGGCTTAC GTGATTGCAA 780 

35 

GGGTTCAGTT CCAACCATGG TCAGAGGTGG CACATCTGCT CAGCCATCTC ATTTTACAGC 840 



TAACGCTGAT CTCCAGCTCC AGCGATGGAA CCCACTACAG AGGAGGTGGG GCCCCTGTGT 900 



40 CAAAGAGGCC GAGGQGCAGC AAGGGCAGMC AGGGCACCTG TGACTTCTTA GTACAAGATT 960 



GTCTGTCCTT CAGGACTTCC AAGGCTCCCA AAGACTCCCT AAACCATGCA GCTCATTGTC 1020 



ACACCAATTC CTGCTTTAAT TAATGGATCT GAGCAAATCT TCCTCTAGCT TCAGGAGGGT 1080 

45 

GGGGAGGGAG TGATTGCTGT CATGGGGCCA GACTTCCAGG CTGATTTGCC AAATGCCAAA 1140 



ATGAAACCTA GCAAAGAACT TACGGCAACA AACGAGGACA TTAAAAGAGC GAGCACCTCA 1200 



50 GTGTCTCTGG GGACATGGTT AAGGAGCTTC CACTCAGCCC ACCATAGTGA GTGGGCCGCC 1260 



ATAAGCCATC ACTGGAACTC CAACCCCAGA GGTCCAGGAG TGATCTCTGA GTGACTCAAC 1320 



AAAGACAGGA CACATGGGGT ACAAAGACAA GGCTTGACTG CTTCAAAGCT TCCCTGGACC 1380 

55 

TGAAGCCAGA CAGGGCAGAG GCGTCCGCTG ACAAATCACT CCCATGATGA GACCCTGGAG 1440 



GACTCCAAAT CCTCGCTGTG AACAGGACTG GACGGTTGCG CACAAACAAA CGCTGCCACC 1500 



60 CTCCACTTCC CAACCCAGAA CTTGGAAAGA CATTAGCACA ACTTACGCAT TGGGGAATTG 1560 
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TGTGTATTTT CTAGCACTTG TGTATTGGAA AACCTGTATG GCAGTGATTT ATTCATATAT 1620 

TCCTOTCCAA AGCCACACTG AAAACAGAGG CAGAGACATG TACTCTGGTG TGATCTCTTG 1680 

5 

TCCTCAGTGT CTCTTCTGGG CTCCTGTCCC TCTTGCTTTA TAGCTAGCTG CCCGGGGACC 1740 

AAGGTACAGG TGAAAGCAAG GTAGCAGCTT GCGGGAGGAG GCCTGTCTGG CTTACCAGTC 1800 

10 TATACACTGT GGCCTCAACC TCCCAGACAG GGCAGAGAAC TGTGGGCAGC TCGTTTGCTT 1860 

TCTAGQCTGG CTOGAGAGGT GGGAGCTCAT TGATAGACTC ATGATQGAAA CTATTTTTGA 1920 

AACAGGCTTC CTCCTTCAGG AGAGATCATG CGGACTAAAC TGTAGCAATT' CCAGTGCACC 1980 

15 

TQGCAGTGAT CCTTTTCTTT GCAAAGTACT GTCTCTTTGG TTCCAGTAAG TTGGACXZACC 2040 



ACATGACATY ATTTTCCCTG GAACCTGGTC ACTGACTAAC ACAGACAATT QQGACTCCAG 2100 
20 AGCCTCAAGA GCCAGGAGAG GGCACAGTAC ATACAGAGGG i^GTCAAATGG GATCTCATTT 2160 
TGAGTCCTGC CTTCCGCACA CTCAGAACGG CANCX:CX:AAG GCCCX3GAGTG TCCAGQGCTT 2220 



CTGGCCTGAG GTGAATCTGC CAGGCCCAAG AAGGCACAAA GGTAGGAGCA CAGAGAGCCC 2280 

25 

CATTCCCACA GGCGGKCGGC CCAGCAGCAC CAGTGGAAGC TCAGCTGTCC TCCAGCTGCT 2340 



CTCGGCAGAC AGTTCAGTGC ACAGTTTATG CCCTAGCTGA AAAAGATCTC CCGGACGTAT 2400 
30 TTCAGCACAT CCTCTTCCTC CTCCTCCTCA GGGCTCCTGC TACAGGCAGA GCTGGAACCC 2460 
CCCX5GCCTCT GGGAAGGGCT GAQGCCTQGA GYCAGTGCCT GTC 2503 



35 



(2) INFORMATION FOR SEQ ID NO: 96: 

(i> SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 2801 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doilble 

(D) TOPOLOGY: linear 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 

CTGGAAAGCC GAGGGTAGCC GAGCGGGGCG GGCGCTCTGG AGCGGCGGGT GCTCGGGCTG 60 

CCGTCCGCTC CGCCAGAAGC ACCGAGCAGC CGAGCCQGGG CCCGCCQCCC TCCTCCTCCA 120 

50 

TGAGGCCCGA GTGAGGCGCG GCGGCTATAG CCGACCCGCG GCGCCTTCCC CCCGCGTCCT 180 

ATCGCGAGCG CACGACMAGC GGCCCCTGGA GGAGGAQGCG GAGGAGGAGG AGCATGTCGG 240 

55 ACGGTTICGA TCGGGCCCCA GGTGCTGGTC GGGGCCGGAR CCGGGGCCTG GGCCGCGGAG 300 

GGGGCGGGCC TRAGGGCGGC GGTTTYCCGA AMGGARCGGR GCCTGCTGAG CGGRCGCGGC 360 

ACCAGCCGCC GCAACCCAAA GCCCCGGGCT TYCTGCARCC AMCGCCGCTG CGCCARCCCA 420 

60 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



GGACGACCCC GCCGCCAGGG GCCCAGTCCG AGGTCCCCXX: CAGCCCCCAG CGGCCTTCCC 480 

GQCCCGGQGC GCTCCCAGAG CAAACGAGGC CCCTGAGAGC TCCACCTAGT TCACAGGATA 540 

AAATCCCACA GCAGAACTCG GAiGTCAGCAA TGGCTAAGCC CCAGGTGGTT GTAGCTCCTG 600 

TATTAATGTC TAAGCTGTCT GTGAATGCCC CTGAA'TTTTA CCCTTCAGGT TATTCTTCCA 660 

GTTACACAGA ATCCTATGAG GATGGTTGTG AGGATTATCC TACTCTATCA GAATATGTTC 720 

AGGATTTTTT GAATCATCTT ACAGAGCAGC CTGGCAGTTT TGAAACTGAA ATTGAACAGT 780 

TTGCAGAGAC CCTGAATGGT TGTGTTACAA CAGATGATGC TTTGCAAGAA CTICTQGAAC 840 

TCATCTATCA ACAGGCCACA TCTATCCCAA ATTTCTCTTA TATGGGAGCT CGCCTGTGTA 900 

ATTACCTGTC CCATCATCTG ACAATTAGCC CACAGAGTGG CAACTTCCGC CAATTGCTAC 960 

TTCAAAGATG TCX3GACTGAA TATGAAGTTA AAGATCAAGC TGCAAAAGQG GATGAAGTTA 1020 

CTCGAAAACG AnTCATGCA TTTGTACTCT TTCTQGGAGA ACTTTATCTT AACCTGGAGA 1080 

TCAAGGGAAC AAATGGACAG GTTACAAGAG CAGATATTCT TCAGGTTQGT CTTCGAGAAT 1140 

TGCTGAATGC CCTGTTTrCT AATCCTATGG ATGACAATTT AATTTGTOCA GTAAAATTGT 1200 

TAAAGTTGAC AGGATCAGTT TTGGAAGATG CTTGGAAGGA AAAAGGAAAG ATGGATATGG 1260 

AAGAAATTAT TCAGAGAATT GAAAACGTTG TCCTAGATGC AAACTGCAGT AGAGATGTAA 1320 

AACAGATGCT CTTGAAGCTT GTAGAACTCC GGTCAAGTAA CTGGGGCAGA GTCCATGCAA 1380 

CTTCAACATA TAGAGAAGCA ACACCAGAAA ATGATCCTAA CTACTTTATG AATGAACCAA 1440 

CATTTTATAC ATCTGATGGT GTTCCTTTCA CTGCAGCTGA TCCAGATTAC CAAGAGAAAT 1500 

ACCAAGAATT ACTTGAAAGA GAGGACTTTT ITCCAGATTA TGAAGAAAAT GGAACAGATT 1560 

TATCCGGGGC TGGTGATCCA TACTTGGATG ATATTGATGA TGAGATGGAC CXTAGAGATAG 1620 

AAGAAGCTTA TGAAAAGTTT TGTTTGGAAT CAGAGCGTAA GCGAAAACAG TAAAGTTAAA 1680 

TTTCAGCATA TCAGTTTTAT AAAGCAGTTT AGGTATGGTG ATTTAGCAGA ACACAAGAGA 1740 

GCAAGAAAAT GTGTCACATC TATACCAAAT TRAGGATGTT GAGTTATGTT ACTAATGTAT 1800 

GCAACTTTAA TTTTGTTTAA CACTATCTGC CAAAATAAAC TTTATTCCCT ATAACTTAAA 1860 

ATGTGTATAT ATATATAATA CnTATTATG TACAGTTAAT TCTACTGTTT TGGCTGCAAT 1920 

AAAATCGATT TTGAAATAAA TGAAATGTTG AAAATTTTGC TAGTTGGTTA GATGCTTATC 1980 

CTTTAAATTC TACTTTTCTT GAGQGGAAAA AGTCTTCGTC TGGAAATACA TATTACTGCA 2040 

AAAATGTAGC ATCCTTTTTT AGGTAGGAGT ATTATAGCTT YCATTTTAGT TKGACATTTA 2100 

GTGTCCCAAT GAATTGAATT TCAAATATGA ATCATAATCT TGAAAATCTT TAGCACTAAA 2160 

GTCTTGGGAA TATATCAACA ACTGATTTAC ATATGCAGAT GCTATTTGNA TACCAAGGGC 2220 
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TTTTTAAATG TCATGGGGGG GAAAAACCCA ACTTGGTGGA ACTCCCAGCT AAACAACCAA 2280 



GACTTCACTG GAAGATTTAT 'TCCAATTCTA GGAATTGTTC TTTTTTATTT TTATTTTTTC 2340 



5 AACTGRCTAA CTTCATTACC TTAAAGCCTA GAACATTATT CTGCTTTATT TATATQGCTT 2400 



TCTCACTTTT ATTTTOTAGC AKGGGTTGCA TCGACTTTTT TACTAGAGAA TTTTACTAGA 2460 



TATTTGTCAT TCAAGTTTTC ATCTGCTTTA TAATTCATAC ACCTTGAGGG TXiACTTTTCT 2520 

10 

AATACTTTTA CTATAATGTG GTACCACCTC AGCCCTAATA AATAATATTT TTACCTAATG 2580 



TCAAATCTTT TTCCAGCTAA CTAAAAACTG TGTACAAAAG GATTGCTTGT AAATATGCAT 2640 
15 GTAAATAGTT CTGTTAATAA CCCACTGTTT TACATTTGGT ACATCTGTGT CTGCTAATAC 2700 



AGTTAGCTTT CTCACTTITC TGCTTGTTTG TTCAGTCTGA ATTAAAATTA GACTTTGAAA 2760 



ATAAAGCTTA AAAAAAAAAA AAAAAAAAAA AAAAACTCGA G 2801 

20 



(2) INFORMATION FOR SEQ ID NO: 97: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1631 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
|0 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 



ATGGAGCCAA AGACAATCAC TGATQCTTTG GCTTCTAGTA TAATTAAGAG TGTGCTGCCT 60 

35 

AATTTTCTTC CATACAATGT CATGCTCTAC AGTGATGCTC CAGTGAGTGA ACTGTCCCTC 120 



GAGCTGCTTC TGCTTCAQGT TGTCTTQCCA GCATTACTCG AACAGGGACA CACGAQGCAG 180 



40 TGGCTGAAGG GGCTGGTQCG AGCGTGGACT GTGACCGCCG GATACTTOCT GGATCTTCAT 240 



TCTTATTTAT TGGGAGACCA GGAAGAAAAT GAAAACAGTG CAAATCAACA AGTTAACAAT 300 



AATCAGCATG CTCGAAATAA CAACGCTATT CCTGTGGTGG GAGAAGGCCT TCATGCAGCC 360 

45 

CACCAAGCCA TACTCCAGCA GGGAGGGCCT GTTGGYTTTC AGCYTTACCG CCGACCTTTA 420 



AATTTTCCAC TCAGGATATT TCTGTTGATT GTCTTCATGT GTATAACATT ACTGATTGCC 480 



50 AGCCTCATCT GCCTTACTTT ACCAGTATTT GCTGGCCGTT GGTTAATGTC GTTTTGGACG 540 



GGGACTGCCA AAATCCATGA GCTCTACACA GCTGCTTGTG GTCTCTATGT TTGCTGGCTA 600 



ACCATAAGGG CTGTGACGGT GATQGTGGCA TGGATGCCTC AGGGACGCAG AGTGATCTTC 660 

55 

CAGAAGGTTA AAGAGTGGTC TCTCATGATC ATGAAGACTT TGATAGTTGC GGTGCTGTTG 720 



GCTGGAGTTG TCCCTCTCCT TCTGGGGCTC CTGTTTGAGC TGGTCATTGT GGCTCCCCTG 780 



60 AGGGTTCCCT TGGATCAGAC TCCTCTTTTT TATCCATGGC AGGACTGGGC ACTTGGAGTC 840 
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CTGCATGCCA AAATCATTGC AGCTATAACA TTGATGGGTC CTCAGTGGTG GTTGAAAACT 900 

GTAATTGAAC AGGTTTACGC AAATGGCATC CX3GAACATTG ACCTTCACTA TATTGTTCGT 960 

5 

AAACTGC5CAG CTCCCGTGAT CTCTGTGCTG TTGCTTTCCC TGTGTGTACC TTATGTCATA 1020 

GCTTCTGGTG TTOTTCCTTT ACTAGGTGTT ACTGCX3GAAA TGCAAAACTT AGTCCATCGG 1080 

10 CGC5ATTTATC CATTTTTACT GATGGTCGTG GTATTGATGG CAATTTTGTC CTTCCAAC3TC 1140 

CX3CX7VGTTTA AGCGCCTTTA TGAACATATT AAAAATGACA AGTACCTTGT QGGTCAACGA 1200 

CTCGTGAACT ACGAACGGAA ATCTGGCAAA CAAGGCTCAT CTCCACCACC TCCACAGTCA 1260 

15 

TCCCAAGAAT AAAGTAGTTG TCTCAACAAC TTGACCTTCC CCTTTACATG TCCTTTTTTG 1320 

TGGACTTCTC TCTTTGGAGA TTTTTCCCAG TGATCTCTCA QCX5TTGTTTT TAAGTTAAAT 1380 

20 GTATTTGACT TGTGTTCTCA GCATTCAGAG AGCAGCGGTG TAAfiATTCTC-CTGTTCTCCr 1440 

TGGATCTTCT GACATTACTG CTGTCTGAGA TTTGTATATG TGTAAATACA AGTTCCTTGA 1500 

TACCCTAAAA CCTTGGATTA AACAGAATGT GCATTGTACA TCTTTAAACA AAATGTATAT 1560 

TAATTTATTA AATCTAGTTG TCACTTTAAA AAAAAAAAAA AAAAAACTCG AGGGGGGCCC 1620 

GGTACCCAAA T 1631 



25 



30 



(2) INFORMATION FOR SEQ ID NO: 98: 

35 U) SEQUENCE CHARACTERISTICS: 

(A) LEMGTH: 504 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS : double 
(D> TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 
CCGAGCTQQG CGAGAAGTAG GGGAGGGCAC GAGCCGCCGC GGTGGCGGTT GCTATCQCTT 60 
45 CGCAGAACCT ACTCAGGCAG CCAGCTGAGA AGAGTTGAGG GAAAGTGCTG CTGCTGGGTC 120 
TGCAGACGCG ATGGATAACG TGCAGCCGAA AATAAAACAT CGCCCCTTCT GCTTCAGTGT 180 
GAAAGGCCAC GTGAAGATGC TGCGGCTGGA TATTATCAAC TCACTGGTAA CAACAGTATT 240 

50 

CATGCTCATC GTATCTGTGT TGGCACTGAT ACCAGAAACC ACAACATTGA CAGTTGGTGG 300 
AGGGGTGTTT GCACTTGTGA CAGCAGTATG CTGTCTTGCC GACGGGGCCC TTATTTACCG 360 
55 GAAGCTTCTG TTCAATCCCA GCGGTCCTTA CCAGAAAAAG CCTGTGCATG AAAAAAAAGA 420 
AGTTTTGTAA TTTTATATTA CTTTTTAGTT TGATACTAAG TATTAAACAT ATTTCTGTAT 480 
TCTTCCAAAA AAAAAAAAAA AAAA 504 
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(2) INFORMATION FOR SEQ ID NO: 99: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1416 base pairs 

(B) TYPE: nucleic acid 

(C) STRANZ3EDNESS: double 
10 (D) TOPOLOGY: linear 

(xi). SEQUENCE DESCRIPTION: SEQ ID NO: 99: 

QGCACGAGOG AGGGAGCCCT CTCCGTTGGG TGACTCTTGT GTGCCCTTTA GACAGGCTQG 60 

15 

CCTGCCGGTT CCACAGGGTA CAGTTAGGAC TTGAGTCTTT CTTTTTCTGT TTTGAGTTGG 120 

TGAGTGAGTG ATAGGGTAAC ATGGGCCTTC AGGATGACCC CTTGGAACTG TGCCGAGTTC 180 

20 CTTAAATCTC AGCTGGGATC CTQGACCTGG GAGGCCOCTG TGAGGGCCAG CTCTGGAAAA 240 

ACCTGGGAGT TGATGCCGGA GCTGTGGAAG AACTCTGCTC GAGQGCAGGG TGCCCTGGAA 300 

CACTGGTAGT TCTGGGGCTG GGAGGGAGAG GGGCTCCGGC TTTCTCTGAA ATGAACACTG 360 

25 

CTCTTCAGCA GTTCAAGTAC TTGTTCTCAA AACATTTTCT AATTGATTGG TAGGTTTTCA 420 

TAAGCATTGT TTCTTTAAGG CATGGAAAGG GAAGAATGCT CAAGCAAGTC ATGTTTGTTT 480 

30 TCAGTGGGAT GGGCCCGCGT TCTCACTGCT GGGGGCTTCC CCTTCATGTG GCACCTTTGT 540 

GCAGGGGCCA CCAGGCAGAC TCTTCCCACC TTCTCCCACT GAAGCACCAA GGGGCTTGGA 600 

ACCGTAATTT GGCTAATCAG AGGCATTTTT TTTGTCCTAG TATCTTTCAC ACTTGTCCAA 660 

35 

CCGTCTTATT TTTTTAAAAG TTCTGTTGCT TGTATTAACA CGAAACTAGA GAGAAATAGT 720 

TTCTGAAGCC AGTTTATTGT GAAGATCCCC AAGGGGAGGT TCGGTAGAGA AAAATAGTAA 780 

40 GCTGGTTTAG AAACTGACGA GQGCAAACAG CCAGGAOGCA TTGGAGAGGA ATTTGCCAAA 840 

GATCTACCCT GAGATAACGC CTGTCCAGTG TCTTCACCAC GTGAATAACC AGCGCTCCAA 900 

AGTGTTTTTC TGCTTTGAAA AAAAAAATTC CACAAGCTTT TAAAGGTGCA TTTAAGAATC 960 

45 

CATGTGACTT TAGAATGGAA CTGCCGGCCC TCXX:AACTGT CACGTGTGCT AGAAGGTTCG 1020 

ATGCCTCTGG AATGCATGTG ATACTCATCT CCATTTTGTT TCCTTGATTG CATTTTTGTT 1080 

50 CTTTTAGCAG ATCTGTCCCT GTGGGTGGTG TCTAAGAAGT CQGACACCTT GGTTTTTGTG 1140 

TTAGATTGAG CTGGGCAGCT GCAATCAGCT TCTTTATATG CAAATTAGGC ACGACCCATC 1200 

TGTGGTTCCT GGTTGGTGGC TAATGAAGTG AGGGGAGGGA GGGATGTCAC CCCAAAAGTA 1260 

55 

GGCCCrCCCA TTGGCTTTGG CCAGGCCAGA CACTTCACAT CGTTTACATG GTTCTGTGTA 1320 

ATTTTAAAGT TTATGTGTAT AAAGCGAAGC TGTTTCTGTG AAACTGTATA TTTTGTAAAT 1380 

60 AAATATATTG CTACTTGAAA AAAAAAAAAA AAAAAA 1416 
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5 (2) INFOHMATION FOR SEQ ID NO: 100: 

{i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2847 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 
15 GGCTAGGACA ATTTTGGTGC TTTACCTATC TCTGCAAAGA CTGGAGAATT TGGCATACCA 60 

TTAATTACAA CCACCAATCA TATCCAACAA AAGTACCCTA AAAGAAGGAC CAGTGGCCAC 120 

TCTCGAAAAA ATTTAAGTAT CAGAAGATTA AAAAGATTTT AGGATTTGGA AGCTTGTATT 180 

20 

GTCTTTCCCC AATAATCATT GTTTGATCTC CAAATAGTAG CCTTATATTA GCAATRGACA 240 

GATCATTGGT TCTCCATATC TGATCATATG TTACTACTTT GGAATCAGTA TTTGGGCAAA 300 

25 TTCAAGCATT TATCCAGTGG ATATAAATGG AAATATAAAA ATATTTGCCA ACCTGTCTCA 360 

GTAACTTATC ATATCTCTGT GNATCCTCAA GGAAAGCACT TTTGCTTTTA CTTAGAAAGC 420 

GTTTCAGATT TGCTTTATAG ACTCCTGCTG TCTTCAGTAC CTGATAAAAC TTTAACCAGG 480 

GAAGCATTAA ACACAGTGCA GCAGCTTTTG CCCAGGCTTC TAAGTTCCTG CCGGCAGCAT 540 

TTATCAATGT AAGAACTAGG ATGCTTCCTG CAGTGGCACT ACCTTCCCCT AGAGCTGGAG 600 

35 CATGCTGCTT GGCCTTAAGC CCCAGCATGA TGAGGCTTCC CTCCTGCCAG GTCAGTAAAA 660 

GTTAGAGAGC TCAGAATTGG GTCTTGCCTG GGTGCAGGTG GCAGGGTTTG CTGAAACCCC 720 

T7VAAGAGAAG TCACCAAGGG AGGCAGGTAA TGAATGTTTC CAGAATCAGT CKGATACTCA 780 

40 

TAGCAATTTC TX3GCTATCTT TCAAATGTTG AATTTCTGGA TGCTGAGAGG GACTTTGATT 840 

TGATATCATT AAATCCAGGA CAGTCCCAAG AAGTGCTTGG AGTCTCGGCT CTGACAGCCC 900 
45 AAQAAGGGAA ATAACTTGTA TTAAGGAACA ACTATGAGCC AGGCCCTGAG CTGTCTCTTA 960 

GATAATAAAA CAGATGGGGA GTGGAAGAGT CATTTGCTTC AAGTTATACA GCTAGGAAAT 1020 

ACTCAAGCCA AATCTTGAAC GCAGCTCCCC CTAATTCTGT GGACAGGCAC TTTGTACCAC 1080 

ACACCAltSGT CCACCTA7\AA ACAGAAGGAT AAAAAGACTT CAGGTTTTCC CACTGTGTGC 1140 

TGACCATCCC AATTTA3KSAA TCTTCTTCAA AATGACATTT CACAGTTATA GTTAGGGCTC 1200 

55 AGAAATOGCA TTH3AGGTAGC CTTATTTCTC CCCTTTAGCA GATGCTTTAA GTACACATTG 1260 

CTGACITCAG CCCACCCCCA GGAGTTAGGA GAACATTTCC TTTTTCATGC CATCTTCCAT 1320 

AAATAAGGTG TTTCTTGGCC TTCAAAGATA TAGAACTTTG CAGCAGTAGT AAAAGTGAAG 1380 



50 
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GGTGTTCTGC TCTCTACTCA ACTTTATTTG 
AGGAAATCTT CATATTTTAG TAAACTTAGC 
5 TCAAAGTCCA GTGAATCTQ6 CTCTCTTACT 
AGTGTGTACC TATATATAAA QGACAAGTGT 
CATGTCCACA CACACACACA CAATATTTGA 

10 

ATTATCTTGC GTACTACTTC AAAGATTTCT 
GTTGGGGCTT AGGTACTTGC TTACAGGAAG 
15 TAAGGCACTG AGTCAAAGTG ACAGCCCTGA 
TGTCTAATAA GATGGGAAAC TTGGATGCCC 
CTCCAGTTAG ACCTAAGGGC ACAAATGCAG 

20 

TAGGAAGTCC TCTCTCCCCA AGTAGAAAAT 
CATATAATGG CTGTGCAATA CATGCTTCTC 
25 TGCTGATCAC ATCAGATTTT TATGTTTAAA 
GCTCTAAGAG AAAAAGAAOG CCCATATGGG 
CCAGCAGTGC TTATRAAGCC CCCTACCCTG 

30 

GTTCTTGATT CTGGAGGCCT GCCTGGTAAG 
TACCAGAAAC AGCAGAACGA GGGCCAGAGC 
35 GGATACATTG GTGCAAAAAA AGCCACGGGS 
GACAGCAGAT TAATACTTAA TGAGGGTTAA 
ACACTGCATG AATGGGGAGA ACCAATGAAT 

40 

AGTCACATTC CCTCCTTAGG AATCTTCCCC 
TGAATCTTTC AAGGGAATTA CACGTTTGGG 
45 TAAATTATTT TGTAAGAGAG ATTTACTGCT 
TGCATTTOGA AATCAATAAA CTATTACTGG 
NAAAAAACTC GAGOGGGGCC CGTACCC 

50 



AAAATGTCTG CAGCTTCACT CCTGTAGAAA 1440 

CGCCAGTGTA CTCTGTGAGG ATGTGGCAAT 1500 

GATTCCTGGT TTTAGTGTGT GTGTCGGGGG 1560 

GATATGTGTG TATATGTATA TACATACATA 1620 

GAGCTAAGGA AAACTCAAAG CAGCXirCTTC 1680 

GTCAGCXXTA ATTACAAGTG TCACCATATA 1740 

AGCAATTCCC TAGCAAAGGT CATTAGCTCC 1800 

AGGAAATTGC ACTCCAGCCC TCCTCCAGGA 1860 

AGCCATTTTG GTGACCTGAG AGTCTAACTA 1920 

AATTCATGAC CTTGTAGTTG TGGCAGGGTC 1980 

ATTCrCTTGC CATTCCTGAA ATTCCACATT 2040 

AATAAGAAAA TTAACTQCAT GTTTACTGTG 2100 

AAAATCTCAT TATGGNTTGA GTCCAGCCCA 2160 

AGACTTCAGT CTCATTATTA TTGCCTTTAT 2220 

TCCCATTCCA GAAACCATAA GACTCAGGCA 2280 

ATAAGATAGT ATAATTTGGA ACTGAGAACA 2340 

AGAAAAATGA AAATAAGTGG AGACACTTAT 2400 

CCCATACTGG GCTTGATATG ACTTTGAGGG 2460 

ACCTGACCAG TCTTTCTACA GTGACAGGCC 2520 

CCATTGTCCT CTGCCTATTT TCCTGTGCAC 2580 

TTCCACXXnT TACATTAAAC AAGGGAACAC 2640 

TTAATGTTTC AGTATATCAT TTTCATACTG 2700 

ATCCCAGGAT GrTCGGACTT GGTGCCCCTG 2760 

AAATGCCAAA AAAAAAAAAA AAAAAAAAAN 2820 

2847 



55 



(2) INFORMATION FOR SEQ ID NO: 101: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1394 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
60 (D) TOPOLOGY: linear 
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Z5 



35 



45 



60 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 
GAGATTGGTG GAQGAGAGTA AATAATCTAG AGGCAAGAGT TCAC?rGAGGG CCAAGGGGGA 

5 

CCCCCAGAAA AAGGTATGGA GCTAACTCAT CTCmTACA AGGGGTGGCC ATGACTTACT 120 

GTTCCAAAGT ACTCAGTGTA TATTTAATGT TGATTGTTGA ATTTTAGTTA CGAGAGQGAA 180 

10 GAACAATTTT ACTTCTGTCC TTATTTCACT TGCTGAAAAG CTGTGGGACA AAATGTATGG 240 

AATAGACAAG GCCACTTTCT TTCTCATTTC TGCTTTTCAT GCATATTATT TTATTTACCC 300 

ATAATTTCCA AGAQGTITOG CGTTCCGCTC TCCTGCTTTT TTCTTTCATC CACCCCITTC 360 

CTTTTTTTGG AAGGGGGTTA TATATGftGAG TTCATTGAAG AAGTCCAGTG AGGCTGAAGT 420 

AAAGGGGCAA GATAGGGCAG TTAACTAAAG AGCACTTTAT TTCTTTGAAG CCTTTCTAAG 480 

20 AAAGAAATGG GGGTCCGAGT GGCTTGAATC TCCCATGATG TTGGAGGGCA CTTAGTQGGG 540 

TTCAAGTATG ACATAATATT TCCCATTGGG GAAAGGAGAA TTTCTCTTAG AGGGTGGCAA 600 

AATGCCTTTC CCCAGTGTCC CTATTTTAGG CATCTTTTCC TTCCTTATTC CTTCCAGTCA 660 

GGGTGTGTCC TATACAAAAC TTCCCATCAG TTCTCCTCAA TATTCCCCAT TTGTAAATGA 720 

TCACTTCTCT TTTCTAAACC CTTTTCCTGT TCAGATCCAT ACAGGATTTG CAAGGGTAGG 780 

30 ATCATACATG CAAATGCCCC TTCTTCATCT GTGTCTTCTG CAAACTAGTC TCATGAAGAA 840 
TTCTGGCGTC CAGCAGGGTA GCTGAAGTTT GGGTCTGGGA CTGGAGATTG GCCATTAGGC 
NTCNCTCAGA TTCCAGCTCC CTTCCACCAA GCCCAGTCTT GCTACGTGGC ACAGQGCAAA 

CCTCACTCCC TrrGGGCCTC AGTTTCCCCT CCCCTTCATG AAATGAAAAG AATACTACTT 1020 

TTTCTTGTTG GTCTAGCATT GCTGGACACA AAGTGTAGTC ATTATTGTTG TATTGGGT6A 1080 

40 TGTG1H3CAAA ACTGCAGAAG CTCACTGCCT ATAAGAGGAA ATAAGAGAGA AAGTGGAGGA 1140 

GAGGGACAAA AGGAGTAATT ATTIGGTATA GATCCACCCA TCCCAACCTT TCTCTCCTCA 1200 

GTCCCTGCTC CTCATCTTTC TGGTrTGGTG AGTCCTTTGT GCCACCACCC ATAATGCTTT 1260 

GCATTGCrcC ATCCTGGGAA GGGGGTATAT GGTCTCACAA GTTGTTGTCA TTCTTTTTTT 1320 

GCATGCTTTC TTAATAAAAA AAAAAAAAAA ATGTTTANAG TTTTATCTTA AAAAAAAAAA 1380 

50 AAAAAAAAAA ACCC '^^^^ 



900 
960 



55 (2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CH/^CTERISTICS: 

(A) LENGTH: 794 base pairs 

(B) TYPE: nucleic acid 
60 (C) STRANDEDNESS : double 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTI<»I: SEQ ID NO: 102: 

GGMRCGAGGC GGAGTAAAGG GACTTGAGCG AGCCAGTTGC CGGATTATTC TATTTCCCCT 60 

CCCTCTCTCC CQCCCCGTAT CTCTTTTCAC CCTTCTCCCA CCCTCGCTCG CGTACCATGG 120 

CGGAGCGTCX3 GCGGCCACTC AGTCCCATTC CATCTCCTCG TCGTCCTTCG GAGCCGAGCC 180 

GTCCGCGCCC GGCGGCGGCG GGAGCCCAGG AGCCTGCCCC GCCCTGGGGA CGAAGAGCTG 240 

CAGCTCCTCC TGTGCGGTGC ACGATCTGAT TTTCTGGAGA GATGTGAAGA AGACTGGGTT 300 

15 TGTCTTTGGA CACGCTGATC ATGCTGCTTT CCCTGGCAGC TTTCAGTGTC ATCARTGTGG 360 

GTTTCTTAMC TCATCCTGGC TCTTCTCTCT GTCACCATCA RCTTCAGGAT CTACAAGTCC 420 

GTCATCCAAG CTGTWCAGAA RTCAGAARAA GGCCATCCAW TCCAAAGCCT ACCTGGACGT 480 

20 

AGACATTACT CTGTCCTCAG AAGCTTTCCA TAATTACATG AATGCTGCCA TGGTGCACAT 540 

CAACAGGGCX: CTGAAACTCA TTATTCGTCT CTTTCTGGTA GAAGATCTGG TTGACTCCTT 600 

,25 GAAGCTGGCT GTCTTCATGT GQCTGATGAC CTATGTTC3GT GCTGTTTTTA ACGGAATCAC 660 

CCTTCTAATT CTTGCTGAAC TGCTCATTTT CAGTGTCCCG ATTGTCTATG AGAAGTACAA 720 

GACCCAGATT GATCACTATG TTOGCATCGC CCGAQATCAG ACCAAGTCAA TrCTTGAAAA 780 

GATCCCAAGC AAAA 794 



30 



35 

(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1544 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

45 

TTTGCTTGCT AGTCTGAACC AAAGAGTTGT TTGGGCATTT GCTGTGTTGG CCATTTCTGG 60 

AGCAAGAGGG TCTTCTTCCT CCTTCCCCCA GCCAGCCAGC TGTCCTGGGG CCAGGCTTTC 120 

50 CTGGGTGGAA AGAAGTATAC CTTTCCCTGG GGCCCTAGGA TAGCAAAGTG AGCCATAGTG 180 

GGCCAGGCTG CCCTCCATGC TGGGCCCCAG CCCAGGTCTG CACTCGCCTG GATCACCTTC 240 

TTTGAGCCTT AGCCATCTCC TGTCAGGTAG GAATGAACTT GCCAGCCTTC AGGYTCGTTC 300 

55 

AGCTATGACC ATCTGTGCGG TCAGGGTACA CTCAGCTCTC CTCCCCAACT CCAGCAGCCT 360 

TTAAGAAGTG TCCCTTTGGC GCCCCCTQGA GGCAGAGCAC TGAGCTGGAC CCTGGGTAGA 420 

60 CTCCCACAGG GAGGACGGAG CTGGCCTCAG GAGTGGGACA CCCAGACTTG GCAGGGCCTT 480 
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CAAGAGGCCr OTOreGGQGC CCCflGGRWC CTTAGCTGAA GCGOSGfiGAC TCACTCICCA 
TCTCAGGAAA TTCTAGCCCT TCCCCTCflGG GAGCCftCBGT TGAGGGTGAG GCCCAACACC 
^ TCCCTr«3C3G CCCreCJSraG GCAflGTCTGG GCCCTGGGCJP AGGGAGGGAG i«rrCAGGCCC 
;«:MrrroGCT ATmCTAAT TICAGACAAA CACACACTCA GCXSCGCACTC ACTGATTCCr 
10 ACACAITCCC AAGA-mCAC ACATOTCACC AGGGGCCACC AAAGTCCCTG TGACXnTTOT 

gactaggatc cTAA-mCTc TArmcrcc tgqctgcctg ggtctctctc acctggggca 

GICTGGATAA TOnTAOTTC T.JKaCACTC TTrTTTOGGG GTGGCACCTC GTTCrCCGAT 
GCCIOSGCTC Cn^nCAGGCC CAGGACTOTA OTGCTC^O CAGTAAAGCT CAGCTCTCTO 960 

1020 

1080 
1140 
1200 



540 
600 
660 
720 
780 
840 
900 



TAA-KASTGA -TCCTATCGCT TCCTCGIGTC TTA-reATCCA ATCOTmrr ACATCAGCCC 
20 TTOITITCTr TTAIGGCTAG TCTTATCTOG CCTOGTTATT TCCTTGCGGG GAGGAGAGGG 

rrrecTAAK: •toctcccagc ccaacctatt accaccocac ctcgctggga cctactgctc 

GGGAGQCAGC AGACAGOGAG CCACCAGCAG TCGCTTCCTC GCCCTCHGCT GGGGGTGGGG 
GGAAGCTOGGGGCaCAl^GCCCrTOCCTaCK^ 
C3OTCCCACATCATAAAAGAAAAGGGAGGTAa«a««^ 
30 GCTOCTTCOT ATCTOTAAAT CTTTAATAAA TATCTGAGCA TGTATCTATC AACGCCAAGA 
ATTTCAAAGT CTCCTTCAAC AATATCAGGC ITITAGGATG TITATATrCX: TTCATCCCTC 

TOnrrcccA GcnrrrccAG ggaaaaaaag TCTGGAKrrA tagatacagc ttattattaa 

35 1544 
ATTICTTCTT GCATAAAAAA AAAAAAAAAA AACNCNNCSQG GGGG 



1380 
1440 
1500 



40 



t2) INFOPMATIC»l FOR SEQ ID NO: 104: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 871 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOIiOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 
ACCCACGCGT CCGNCITCTC CACCCGGGGG CGTCGGAG^ AGGTACCAGA TTCAGCCCAT 60 

nx^cccGA cGCca^ix^rr ci^ggaatcc gggix^g gatigaggtc ccgcttccta 120 

55 AGCmSGGICG CTOICOVCCC GGGGGCOKSG GAGTX.AGGTA CCAGATIxrAG CCCAn^ 180 

CCCGACGCCT CI.;nrrCGG AATCCGGGIO CI«CGGATrG AGGTCCCGGT ^CTAACGGA 240 

CKSCAAGAIK. GAGGAAGGCG GGAACCTAGG AGGCCK^TT AAGAOK^GTCC ATCTACTX3GT 300 

60 
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CTTGTCAGGT GCCTGGGGCA TGCAAATGTG GGTGACCTTC GTCTCAGGCT TTCCTGCTTT 360 

TCCGAAGCCT TCCCCGACAT ACCTTCQGAC TAGTGCAGAG CAAACTCTTC CCCTTCTACT 420 

TCCACATCTC CATQC3GCTGT GCXTTTCATCA ACCTCTGCAT CTTGGCTTCA CAGCATGCTT 480 

GGGCTCAGCT CACATTCTGG GAGGCCAGCC AGCTTTACXTT GCTCTTCCTG AGCCTTACGC 540 

TGGCCACTGT CAACGCCCGC TGGCTGGAAC CXXX3CACCAC AGCTGCCATG TGGGCXrCTGC 600 

AAACCGTGGG AGAAGGAGCG AGGCCTGGGT GGGGAGGTAC CAGGCAGCCA ACAGGTTCCC 660 

GATCCTTAAC GCCAGNTGCG AGAGAAGGAC CCCAAGTACA GTGCTCTCCG CCAGAATTTC 720 

15 TTCCGCTACC ATGGGCTGTC CTCTCTTTGC AATCTGGGCT GCGTCCTGAG CAATGGGCTC 780 

TGTCTCGCTG GCXTTTGCXrCT GGAAATAAGG AGCCTCTAGC ATGGGCCCTG CATGCTAATA 840 

AATGCTTCTT CAC5AAAAAAA AAAAAAAAAA A 871 

20 



10 



(2) INFORMATION FOR SEQ ID NO: 105: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 404 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS : double 
30 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRiPTIC»I: SEQ ID NO: 105: 

GGCACGAGTT ATAGCATGGC ATTCATACTT TTGTTTTATT GCCTCATGAC TTTTTTGAGT 60 

35 

TTAGAACAAA ACAGTGCAAC CGTAGAGCCT TCTTCCCATG AAATTTTGCA TCTGCTCCAA 120 

AACK5CTTTG AGTTACTCAG AACTTCAACC TCCCAATGCA CTGAAGGCAT TCCTTGTCAA 180 

40 AGATACCAGA ATGGGTTACA CATTTAACCT GGCAAACATT GAAGAACTCT TAATGTTTTC 240 

TTTTTAATAA GAATGACGCC CCACTTTGGG GACTAAAATT GTGCTATTGC CGAGAAGCAG 300 

TCTAAAATTT ATTTTTTTAA AAAGAGAAAC TGCCCCATTA TTTTGGTGGG GTTGGTTTTT 360 

45" 

AATTTNTAAT NTGAAAAATT TTTTTGGGGT TTTTQGGGCC ATGG 404 



50 

(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1542 base pairs 
55 (B) TYPE: nucleic acid 

(C) STRANDEI»3ESS: double 

(D) TOPOLOGY: linear 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 
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GTCAGACAGG TGC5AGCCGCC GGGGCAGGAG TCTCAAAGAG CCAGC5CTCCA GGAGAGGAAG 
GGCTCTRCGA GAGGAGAGAG GAGAGCGCTG GAGAGGAGAG GCTGGAGAGT CCTTAGCCAG 
GATGGAGGCT GTTGTGAACT TCTACCAAGA GGTGATGAAG CACGCAGATC CCCGGATCCA 
GGGCTACCCT CTGATX3GGGT CCCCCTTGCT AATGACCTCC ATTCTCCTGA CCTACGTGTA 
CTTCGrrCTC TCACTTGGGC CTCGCATCAT GGCTAATCGG AAGCCCTTCC AGCTCCGTGG 
CTTCATCATT CJrCTACAACT TCTCACTGGT GGCACTCTCC CTCTACATTG TCTATGAGTT 
CCTGATOTCX; GGCTGGCTGA GCACCTATAC CTGQCGCTGT GACCCTGTGG ACTATTCCAA 
CAGCCCTCAG GCACTTAGGA TGGTTCGGGT GGCCTGGCTC TTCCTCTrCT CCAAGTTCAT 
TGAGCTGATC GACACAGTGA TCTTTATTCT CCGAAAGAAA GACGGGCAQG TGACCTTCCT 
ACATOTCTTC CATCACTCIG TCCTTCCCTG GAGCTGGTGG TGGGGGGTAA AGATTGCCCC 
GGGAGGAATG GGCTCTTTCC ATGCCATGAT AAACTCTTCC GTGCATGTCA TAATGTACCT 
GTACTACQGA TTATCTCCCT TTGGCCCTGT GGCACAACCC TACCTTTGGrT GGAAAAAGCA 
CATGACAGCC ATTCAGCTGA TCCAGriTGT CCTGGTCTCA CTGCACATCT CCCAGTACTA 
CTTTATGTCC AGCIGTAACT ACCAGTACCC AGTCATTATT CACCTCATCT GGATGTATGG 
CACCATCTTC TTCAIXSCrGT TCTCCAACTT CTGGTATCAC TCTTATACCA AGGGCAAGCG 
GCK3CCCCGT GCACTTTCAGC AAAATQGAQC TCCAQGTATT GCCAAGGTCA AGGCCAACTG 
AGAAGCATGG CCTAGATAQG CQCCCACCTA AGTGCCTCAG GACTGCACCT TAGGGCAGTG 

TCCCTCACyrG CXXrrCTCCAC ctacacctgt gaccaaggct tatgtggtca ggactgagca 

GGGGACTGGC CCTCCCCTCC CCACAGCTGC TCTACAGGGA CCACX3GCTTT GGTTCCTCAC 
CCACTTCCCC CQGGCAGCTC CAGGGATGTG GCCTCATTGC TGICTGCTAC TCCAGAGCTG 
GGGGCTAAAA GGGCTGTACA GTTATTrCCC CXTTCCCTGCC TTAAAACTTG GGAGAGGAGC * 
ACTCAGGGCT GGCCCCACAA AQGGTCTCGT GGCCTTTTTC CTCACACAGA AGAGGTCAGC 
AATAATGTCA C1GTO3ACCC AGTCTCACTC CTCCACCCCA CACACTGAAG CAGTAGCTTC 
TGGGCCAAAG GTCAGGGTGG GCGGGGGCCT GGGAATACAG CCTGTGGAGG CTGCTTACTC 
AACTTCTGTC TTAATTAAAA GTGACAGAGG AAACCANAAA AAAAAAAAAA AAAAACTCGA 
GGQGGGCCCG TACCCAAATC GCCGGTATGA TCGTAAACAA TC 



60 
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55 

(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2327 base pairs 
60 (B) TYPE: nucleic acid 
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60 



(C) STRANDEa»JESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 
GGTAGCTCAN TC3CAGTGAAA TAGTCTTACT GGAAACAAAG CCCTTTATCA AGAATAATTA 
ACTCTTCCCT 'I TA V n Ti m GAGAGGTGCT TTGTTTCTGA TCGGACCATT TCACTGCAGC 
AAGCAACACA CTATTCTRAG CAGAAGATCG GGACTTGAGG CCATGTTGCG GAGGGCCAGT 
RACATTATCT GGACTCTGGA GTGTGAGGAA TATGGACTCC ACTCTTCACT ATATTCACAR 
CGATTCAGAC TTC3AGCAACA ATAGCAGTTT TAGCCCTGAT GAGGAAAGGA GAACTAAAGT . 
ACAAGATCTT GTACCTCAGG CGTTGTTAGA TCAGTATTTA TCTATGACTG ACCCTTCTCG 
TCCACAGACG GTTGACACTG AAATTGCTAA GCACPGTQCA TATAGCCTCC CTGC?rGTC3GC 
CTTGACACTC GGAAGACAGA ATTQGCACTG CCTCAGAGAG ACGTATGRGA CTYTGGCCTC 
AGACATGCAG TGGAAAGTTC GACGGAACTC TAGCATTCTC CATCCACGRG CTTGCAGTTA 
TTCTTCGAGA TCAATTGACA GCTGCAGATC TGGTTCCAAT TTTTAATGGA TTTTTAAAAG 
ACCTCGATGA AGTCAGGATA GGTGTTCTTA AACACTTGCA TGATTTTCTG AAGCTTCTTC 
ATATTCACAA AAGAAGAGAA TATCTTTATC AACTTCAGGA GTTTTTGGTG ACAGATAATA 
GTAGAAATTG GCGGTTTCGA GCTGAACTGG CTGAACAGCT GATTTTACTT CTAGAGTTAT 
ATAGTCCCAG AGATGTTTAT GACTATTTAC GTCCCATTGC TCTGAATCTG TGTGCAGACA 
AAGTTTCTTC TGTTCGTTQG ATTTCCTACA AGTTGGTCAG CGAGATGGTG AAGAAGCTGC 
ACGCGGCAAC ACCACCAACG TTCGGAGTGG ACCTCATCAA TGAGCTTGTG GAGAACITTG 
GCAGATCTCC CAAGTGGTCT GGTCGGCAAG CCTTTGTCTT TGTCTGCCAG ACTGTCATTG 
AGGATCACTC CCTTCCCATG GACCACTTTG CTGTGCATCT CATGCCGCAT CTGCTAACCT 
TAGCAAATGA CAGGGrTCCT AACGTGCGAG TGCTGCTTGC AAAGACATTA AGACAAACTC 
TACTAGAAAA AGACTATTTC TTGGCCTCTG CCAGCTGCCA CCAGGAGGCT GTGGAGCAGA 
CCATCATX3GC TCTTCAGATG GACCGTGACA GCGATGTCAA GTATTTTGCA AGCATCCACC 
CTCCCAOTAC CAAAATCTCC GAAGATQCCA TGAGCACAGC GTCCTCAACC TACTAGAAGG 

cttgaatck: ggtotcittc ctgcttccat gagagccgag gttcagtggg cattcgccac 

GCATOTGACC TCGGATAGCT TTCQGGGGAG GAGAGACCTT CCTCTCCTGC GGACTTCATT 
GCAGC?rcCAA GTTGCCTACA CCCAATACCA GQGATTTCAA GAGTCAAGAG AAAGTACAGT 
AAACACTATT ATCTTATCTT GACTTTAAKG KKWAWKMMWW KCTCAGMSRA TTATAMTTSW 
CWMMRARGSM WYMAAWSCTK SWGCTCYWCC KSRSTGRMKG MMRCTCTAGA AYTRGYRGAK 
CMYYYKSGCT KMWGGAAKKS GGCASGAGCC AGAGACCTGC ATTGCTTTCT CCrGGTTTTA 
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TTTAACAATC GACAAATCAA ATTCTTACAG CCTGAAGGCA GACGTGTGCC CAGATGTGAA 
MAGACCTTC ACTATCflGCC CTAACTCTTC TCTCCCfiGGA MGACTTGCT GGGCTCTGTG 
^ GCCAC5CTOK: CAGCCCAC3CC CRyTOTCTCA ATCGrTTTGTC AOyrGTGCAA ATC3C3GAAfiGG 

AGGGCTirrr acatctccta aacsgacxttoa tcccaacaca actaggatto acttaaactc 

10 TTAAQCGCAG CATATTCCTC TACACATITA CAGAATQGTT GCTGftGTGTC TOTGTCIGAT 
nmCATCC TOC3TCATCAC COGAAQGAAA rTTATTAGAC GTATAATGTA TGTCTOCSTOT 
-rmAACTTO ATCATCATCA GCTCTOAGGT GCAACTTCTT CACATACTGT ACATACCTOT 
GACCACTCTT GGGAGTCCTC CAGrCTTTAA TCATGCTGTT TAAACIOTTG TGQCACAAGT 2160 
TCTCITOTCC AAATAAAATT TATTAATAAG ATCTATAGAG AGAGATATAT ACACTTTrGA 2220 

20 ironTTCTA GATOTCTACC AATAAATOCA ATTTOTGACC TGTAAAAAAA AAAWAAAAAA 2280 
ACTCGAQGOG GGCCCGGTAC CCAAATCGCC GATATGATCT AANCATC 2327 



25 



(2) INFORMATION FOR SEQ ID NO: 108: 



(i) SEQUENCE CHARACTERISTICS: 
9|Q (A) LQ5GTH: 1062 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEtaJESS: double 

(D) TOPOLOGY: linear 

35 (xi) SEQUENCE DESCRIPTIOl: SEQ ID NO: 108: 

GGCCGCCGAG GCGCAACAGC CCTTCTOICA GCTCTGGGTC CAACCGGACT AGCGAANATC 
TTCCTCATCC TCATCATCCT CTTCCTCATC CCGATCTCGG TCCAGGTCCC TCTCCCCCCC 
ACACAfiGAGG TCGCGAAGGT CCAGCTGTAG TTCCTCTGGA OGnTCTCGAA GATQCTCTTC 

crcmrrrcG tcatcatctt cctcttcgtc ttcctcatcc tcatcatcca orrcrcGAAG 

45 CCGCTCACGA ATCCCCATCC CCCCGCCGGA GRAAGTGACA GGflGGCGGCG GTACAGCTCT 
TATCGTTCAC ATCACCATTA CCAAAGGCAA AGAGTGCTAC AAAAGGAGCG TGCAATAGAA 



1740 
1800 
1860 
1920 
1980 
2040 
2100 



CAGAGGTTCT CCCTTmGG AGAGATIGAG GAGTGCACCA TCCACTTCCG TGTCCAAGGG 
GACAACTACG GCTTCCSTCAC TTATCGCTAT GCIGAGGAGG CATTTCCAGC CATTGAGAGT 
55 GGCCACAAGC TGCGGCAGGC AGATCAGCAG CCCnTCATC TCTGCTTTGG GGGCCGAAGG 
SWGOWCroCA AGAGGAGCTA TOTTCATCTr GACTCCAACC GGGAAGACTT TGACCCAGCA 
CCKWAAAGA GCAAATTTCA ITCTCPTOAC TTTGACACAT TGITCAAACA GGCCCAGAAG 

60 



60 
120 



rrmrnrn^z. OATQCTCTTC 180 

240 
300 
360 

GAAAGAAGGG TCCyTCTTCAT TGGAAAGATA CCTGGCCGCA TGACTCGATC AGAGCTGAAA 420 

50 ^^^^^ m^r-Tsr-'mv-nr- m^TCCAAGGG 480 



S40 
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AACCTCAGGA GGTAACCTTG GGCCCTTCCC TGCTATCCTT TTTCTCCTTT GGAGGTGCCC 
AACCTCCTCC ACCCCCTTCC CCTACTCTAG GGGAGAGAGC TGCTAGTGAG ATGACTGTTT 
5 TATAAAGAAA OXSGAAAAAAG TGAAATAAAA AATATGTTGA ATCAGATTTT TTAAAAGGGG 

TA-nTOmr TTTATAACAG GTATTGAAAC AAGTTAACTT GCATTCCTAT GTAAGATAGG 960 
AGGGGCTCAG GQGATCCCCA GTGTTTGGAA CATAAGTCAC TATQCAGACT AATAAACATC 1020 
AACTAGAGAG NAAAAAAAAA AAAAAAAAAA ATTTAAAAAA CT ^^^^ 



15 
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(2) INFOBMATION FOR SEQ ID NO: 109: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2539 base pairs 
20 <B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOIiOGY: linear 



180 
240 
300 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 
GAGAGACTCA CACTTCITTT CCATTATCAC TGACGATGTA GTGGACATAG CAGGQGAAGA 60 
GCACCTACCT GTOrTGGTGA GGTTTGTTGA TGAATCTCAT AACCTAAGAG AGGAATTTAT 120 
30 AGGCTTCCTC CCTTATGAAG CCGATGCAGA AATTTTGGCT GTGAAATTTC ACACTATGAT 
AACTCAGAAG TX3GGGATTAA ATATQGAGTA TTGTCGTGGC CAGGCTTACA TTGWCTCTAG 
TCGA-rrrrCT TCCAAAATCA AAGTrGTTGC TTCTAGACTT TYAAGMKMRA TVnCCCCCMAK 

35 

YWAVgCKGAAC AMAMKCIGSW CYTCCWSYGC SKTRRMKRYC GYKSTATRRC WARWKSAKYM 360 

CCYGKKMTCS RRGTAWYTSK TGCAYKAGGG AACAATTGAG GAAGTTTGTT CTTTTTTCCA 420 

40 TOGATCACCA CAACTCCTTT TAGAACTTGA CAACGTAA1T TCTGTTCnT TTCAGAACAG 480 

TAAAGAAAGG GC?rAAAGAAC TGAAGGAAAT CTGCCATTCT CAGTGGACAG GCAGGCATGA 540 

TGCTTTIGAA ATTTTAOrGG AACTCCTGCA AGCACTTGTT TTATGTTTAG ATGGTATAAA 600 

45 

TAGTGACACA AATATTAGAT GGAATAACTA TATAGCTGGC CGAGCATTTG TACTCTGAGT 660 



720 
780 



GCAGIGTCAG ATTTIGATTT CATTCTTACT ATTGTTGTTC TTAAAAATGT CCTATCTTTT 
50 ACAAGAGCCT TTOGGAAAAA CYYCMAQGGG CAAACCTCTG ATGTCTTCTT TGCKKMMSRT 

ARMTITrGAY ATRMARYACT RMMTKSAYrY AAYGRWGTGA CWSGAWAATA TTRAASTYTA 840 

TACAATKAAT YVnTOYTTSM KRMAGMYAAT COGAAAYTGr GGMAAMYAAA CTTGATATTC 900 

AAATCAAACT CCCTCGGAAA TTCCGCAGAG CTCACCAGGG TAACTTGGAA TCTCAGCTAA 960 

CCTCTCAGAG TTACTATAAA GAAACCCTAA GTGTCCCAAC AGTGGAGCAC ATTATTCAGG 1020 

60 AACTTAAAGA TATATTCTCA GAACAGCACC TCAAAGCTCT TAAATGCTTA TCTCTGGTAC 1080 
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CCTCAGTCAT GGGACAACTC AAATTCAATA CGTCGGAGGA ACACCATGCT GACATGTATA 
GAAGTGACTT ACCCAATCCT GACACGCTGT CACSCTGAGCT TCATTGTTGG AGAATCAAAT 
GGAAftCACAG GGGGAAAGAT ATAGAGCTTC CGTCCACCAT CTATGAAGCC CTCCACCTGC 
CTGACATCAA GTTTITICCT AATOTGTATG CATTGCTGAA GGTCCTGTGT ATTCTTCCTG 
TCATGAAGGT TCAGAATCAG CGC3TATGAAA ATGGACGAAA GCGTCTTAAA GCATATTTGA 
GGAACACTTT GACAGACCAA AGGTCAAGTA ACTTGGCTTT GCTTAACATA AATTTTGATA 
TAAAACACGA CCTOGATTTA ATGGTOGACA CATATATTAA ACTCTATACR AKTAMGTCAG 
MGCTYYCTAC AKAYRAYTCM SWAWMTGTGG AAAKVWSSTA I«a!SWGCWKK TAMMRRTMCXS 
GMWWTYYYMK RKTYGAYMYW YGCGWMCGAG AAAAAGCCGT AAGGTGTATG TAGACCACTT 
AATCACTAAA TATCTTTCCC TATAGGACTC CATTGAATAC ATTAGCCATT GATAATCTAC 
CTCTTTAAAT GGCCCCTCTT TCAACTCTCA AGCTTTGAAG ACCTACCTGT TCTTCCAGAA 
GAGAACOTTG AAACTTGCCAT GTTrCCrTTT GCGTGATCTC TGTTGATGGC ACTCTGGAAT 

rorrrccAGT ttaaktcatt ttagacatag catttattat cactgtggat ctctacttgt 

TCQGTCTTAT GAATTCTTTG AAGAATATAT TTPGAAGAGG TGTGGGAGGA AGGAATACAT 
TTTATAAAAT GTTOTACyrGA AGCCCACAAT TGACCTTKGA CTAATAGGAG TTTTAAGTAT 
GTTAAAAATC TATACTQGAC AGTTACAAGA AATTACCX3GA GAAAAGCTTG TGAGCTCACC 
AAACAAGGAT TTCAGTOTAG A-mTGTCrT TCTTGAACTT AAAGAAACAA ATGACAAAGT 
TTCAATGGAA AAGCCTGCTG TTGTTCCACA TCTCGTTGCT GTTTACATTC CTTTGTGGAG 
CCTACATCTT CCTAAGCTTT TTAGCAGGTA TATGTTGAAC ACTTCTGTTT CATGGTTGAG 
ACAGAATCAG AGGCCAIXSGA TACTCACAAC TGATTTGTCT G'mTiTXTC TCTCTCITrT 
TCCATCACTC TTATATACTG CCTCATCTTG ATTTATAAGC AAAACCTGGA AAACCTACAA 
AATAAGTCTT GTOCTTTATC TAGAAAAATA TGGAAAATAT TGCTGTTATT TTTGCTGAAG 
AAAATCAATT ITCTATACTT TATTTCAATC TAAATAAAAT GTGAATrTTG TTWWATTAAA 
AATTWGGSAC AAABTBGK3G GGGCTTCCAAA CHTWTCGHG KAAMTTCTCT WAARMATYTK 
ATAAACMSCT TCACAATTC 
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55 (2> INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1751 base pairs 

(B) TYPE: nucleic acid 
60 (C) STRANDEDNESS : double 
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(D) TOPOIXXnf: linear 
(xi) SEQUENCE DESCRIPTKM: SBQ ID NO: 110: 
5 AGCATCAAGC CQATCGCCCT GGTCCSCCWST ACCGfrCCTGG GCCTGC?rGCA AAACATCCGT 
GCCjnTOXSS GGATCCTCCT GC3TCC5TCTAC TACGTATTTG CCATCATTGG GATCAACTTG 
TTTAGAGGCG TCA-nXSTOGC TCTTCCTGCSA AACflGCAGCC TOQCXX:CTGC CAATQCSCTOG 
CSCGCCCTCTC GCSAiXTTCGA GCAGCTOGAG TACTQGC3CCA ACAACTTCGR TGACTTIGOG 
GCroCCCTCG TCACTCTOTO GAACTTCATC GTGGTGAACA ACTOGCAGGT GirTCTGGAT 
15 C3CATATCGGC GCTACTCAGG CCCGTCGTCC AAGATCTATT TTCTATTOTG GTC3GCTOGTG 
TCCyrcTOTCA TCTQQCTCAA CCICTTOTre GCCCTGATTC TGGAGAflCTT CCTTCACAAG 
TOC3GACCCCC QCAGCCACCT GCAGCCCCTT GCTGGGACCC CAGAGGCCAC CTACCAGATG 
ACK?roGAGC TCCTOTTCAG GGATATTCTG GAGGAGCCCG GGGAGGATGA GCTCACAGAG 
W3GCTCAGCC AGCACCCGCA CCTOTCGCTG TOCAGGTGAC GTCCXSGGCTG CCATCCCAGC 
25 MGGGCGGCA GOAGAGAGAG GCTCOCCTAA CACAGGTGCC CATCATGGAA GAGGOGGCCA 
TOCTOTCGCC AGCCAGGCAG GAAGAGACCT TTCCTCTGftC GGRCCACTAA GCTQGQGACA 
OSAACXAACT CCmOOeTO TOGCOCAACA ACCATCTACA GAACAQCTGC IGGTGCTTCA 
^® GGGAQGCGCC GTGCCCICCG CTTTClTrTA TAGCTGCTTC AGTGAGAATT CCCTCGTCGA 
CTCCACAGGG ACCTTTCAGA CAAAAATGCA AGAAGCAfiCG GCCTCCCCTG TCCCCTGCAG 
35 CTTCGGTCOT GCCTTTCCTC CCX3GCAGCCC TIGGGGACCA CAGGCCTGAC CAGGGCCTQC 
ACAGOTTAAC CGIGAGICTC TCTCATCTAT TCACAGCTGG GAATGATACT AATACCTCCX3 
A-nTTAGCXX: AGCACCACAG GCTAaSTrCC AGTITrrCTC TCmCCATA GCTGTAAGGC 

ccmrTOGG AA-TCGrrcrc ATrciccrrA ATCTATTArr oGGTCAGrrr tcctgcatct 

CCCCAGCCTC CCATCACreC CACCCACTCC CCACAGAGAT GCCCTGCTCA TCCGACTGGG 
45 GCTTTCACrc CCACACrore TACCCCrCrr GTGfTGGACGC CCTGCPGCCA AAACCrrCAG 
CAAACAGCrr TCCAAATCGA ACnTCTCACT GTCAGGCCTT TACAATCAGC AACAGCAAAA 
TCTACATOCT GCTCAGGGTC CTCCCTCATT AAGATGCAAT AAATATGTTAA GTACATAAAA 
ACAGCAATAG AAGAAAOOTA ATCCTTTATT CTCAAATATG ATOTCTACAT AGAAAAGCCA 
AAATTATTAA GAATAGTAAG AATTCACCCA GCACmGGG AGGCCGAGGC GGGTGGATCA 
55 IGAGGTCAGG AGATCGAGAC CATCCIX^ AACAGGOreA AACCCCGrTCT CTACTAAAAA 
TACAAAAAAT TCGCCGGGCG CAGTCGCGGG CGCCTGTGGT CCCAGCTACT GGGGAGGCTG 
ftGGCAGGAGA AIGGCGIGAA CCCOSGAAGC «5AGCrrGCA GTGAGCCGAG ATTCCGCCAC 

60 



PCT/OS98/04493 

WO 98/39448 

334 



O^^CAOrCCGC AOKXrAOCCT GOKX^AC^ OCOP^ OTC.^ A-A^AAAAAA 
AAAAAAAAAA A 

5 

(2) INFORMATION FOR SBQ ID NO: Xll: 

If) (i) SEQUENCE CHRRACTERISTICS: 

(A) i;eNG?IH: 1117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEraiESS: double 

(D) TOPOLOGY; linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 
^^TOTKnXX. TC^rr ^AATT CTKATTATAG AGICTCrn^ «^1«r 

20 cxzATAW^A a«xx:aaaca ACATixTicrr rm^^ 

C,,^^ t^AATATC O^TAAGACGG CAGCrACAOC AGGCA^- OOC^^ 

^.^ac^'T ACC^Gcrrrr ArrcAi«:TA aacaacaata cati«agcag a«:cacccag 
;^,^a«:a TAACCGon. <^^«n«rcc AATCr^ 
«:,^A .«.c^ 

30 C^I^AACAC T«.I^T C^ATACCX^ ATAAAGA.OC Crr^^^ 

CAGGAGCK^T CACGOGAAOT CTmTAGaA TAAACOTi^ CCVOC^ CIX^.^ 

^cccn. c^^GGCAcrc ci^rACOAOO ccrocro.'^ ccattk^ 



1740 
1751 



60 
120 
180 
240 
300 
360 
420 
480 
540 



660 
720 
780 
840 



;«^;.CKn... ^C^C^ CAOGAAAC^ AACAG^^ 

TAAA«n«GA AGAOrOOAAA OGCACACTAC ^^CHTACTGA OCACCTCCCr GAOAAAA™^ 
40 AAA^ACnrr ACA«^T GAACCn^^OA A^-r^M OA^^ 

.CCO^AO AAACCCriCA C^AATAGATA AACAAOACAA GGACraA.AG r.C^ 

^c.x:a cio^ oaagooaoct occa^^ a^^- acaoacaccc 

cAcrcm^ c^"-'^- '^^^ 

ercT^ TTKmrrcT rrrrAAcrAA gaa«x»gct oi^actct CAcrrrAcrr 

50 A^AAAT rrAAATACAT ACI^AO.^ C^A^TAATCT A3CAATATAT «:AT«^^ 
ATATATCCAC CCACCTAGAT TTTAAOCAC^ AAATAAAAO. TTTCGCAA^. GATTAAA^ 

gaattitaca ottaaaaaaa aaaaaaaaaa aaaaaaa 

55 



960 
1020 
1080 
1117 



(2) INFORMATION FOR SEQ ID NO: 

60 
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10 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1313 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: doxible 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 

GGCAGAGOTT TTCTTATATT TTAAGTAAAT TTAAAGTGGC TATCAGAATA TTTATTCTTG 60 

TTTGAGACTA CCAACATAAC TACGTGTTGA AGGTGCTTCA CAGAGAATAT ATTGCCTTTA 120 

ATOTGAAATA ATITrCACCA ATOITCCTAA CTTTAATAAA GTATAAAATT TGTAGAATAT 180 

15 TCAGTTAAGT ACTTCGTAAC CCTTTTCTAT TTTAGTAAAA CTTAATGCAT GTTTACTTTT 240 

TTITCAAAGA TGCAGACAAT CTCTTTGAAC ATGAATrGGG GGCTCTCAAT ATGGCTGCAT 300 

TACTACGAAA AGAAGAAAGA GCAAGTCTTC TPAGTAATCT TGGCCCATGT TGTAAGGCGT 360 

TOTGCTTCAG ACGGGATTCT GCAATTCGAA AGCAGCTTGT TAAAAATGAG AAGGGCACCA 420 

TAAAACAAGC TTACACGAGT GCTCCAATGG TAGACAATGA ATTACTTCGA TTGAGTCTTC 480 

25 GGTrATTTAA GCGGAAGACT ACTTCCCATG CTCCAGGACA TGAAAAGACT GAAGATAATA 540 

AACTTTCACA GTCCAGTATC CAACAGGAAC TGTGTGTGTC TTAAGACCGA AGTTACAATA 600 

TOGTATTTTT GGTACTOTCT TCCTTCAGCA GTGCATATTC TTTTGCAAAG TTCTTTGGTT 660 

TCACAAGCAT TAGTCACAAA GGCAGAAAAG ATTTATCAGC CATGCTAAAA GAGTGAAGAA 720 

mTCATCTT TAGAGACACT AGTTTTGGCC AACTTAAGAT TrTACGTTAA TTTTTACATA 780 

35 GTATITCACA CTCATGCAAA ATAATGTGAA AACATCTAGA TTTAGTAGTT TATTCTGCGC 840 

CTTTTOrrAA AACTGAAGAT TTIGGAAAAT GGTTGTCACT GCTCTTCCAG CCTATGAATA 900 

TTTTTOTGAA ATCGAACCAT GGATITATGT CTGGATCATC CATACAGAAC CAACAATTTT 960 

ATTCAAAAAC AATGTCTTCA TCAAAGTAAT TGCTCACATT GTGCAGTACT ATGTTGTACA 1020 

GACCACGIX3A AAGGGAATCC TOGTCTAGCT GGCGTGGTAT GTTTATAQGC GAATTTCAGC 1080 

45 AGAAGGAAGC CAAAATAGTT TTTTCCTTTT GAAAGTTTTT TAAAAATTAT TTCATGGGTC 1140 

TrmrrrAA ttaatatctg tgcattotta caatgtatgt tgggatgtct tttgacccta 1200 

AATGcrrrrr ttcttatcag agattgtgta ctatttttat ttttaataaa tgtatcttcc 1260 

CTTTTMAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAA 1313 



30 



40 



50 



55 

<2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1654 base pairs 
60 (B) TYPE: nucleic acid 



wo 98/39448 



336 



PCT/US98/04493 



600 
660 
720 
780 



(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 
^ ACAGGGACAG AATACTTTCT TTCCITCCTT CAAGTACAAG AAGCSCTTTCT CTACCATTTG 60 
CGTCTACACT TTAiriTAAA AGCTATCCTT TTCTAGTAGT ATTTTATCAT GGCAATQC3CA 120 

10 TCATGACAAC AACAC3TCTTT CATTACAGAC TC5AAQGGAAG CATGTCCTTA CTTAAAATAG 180 
TTCTGCTACT TTCCCTCCTA TTATAAOGAA ATTITACAGA TTCTAAAAAT ACCTTAATTT 240 
TTCTITCATT •PITATTTTAC CAAGTCftCAA ATGrCTTTTT GATCTTTTGA CSAATTGrrTCT 300 
CATAGAATCA CAAATACTGA CATTTCATTA GATGATTATT TTCCTAGAAT CCCCAAAGAG 360 
CAGTGGCAGT CCNTGGCTTG GTTGAAGCTA GAAATTITCC TGCCCCTGGT GACCTGGTAA 420 

20 GCCTCCTGCT CGGAACCGTC TGAGTGGGTG AGGAAGATGA GAGATGGTCA GATGGAAGAG 480 
AGRAATACAT GAACTGCTCT GGCCTCTCTG GTTCTGTTCT TGGCCCAGAG TTTTTGAAAA 540 
GCAGCGGANA TNGACTGACT TCACATOCTC AGCTITCrCA GCCTTTTGTT TATrTTGTTG 

25 

TCCTTAGATT TCCCTGTTCT AAAAGGGGCA AGAAAAGTAA CTCATCATCT CTAACACACC 
ATGGCAGCTT AGCCAGGTAG TCTTAGTGGT GGTGTTTAGG CATAAGATAT GCT6ATCATC 
30 ACTCTCAGGC CACAGTTrCC TTCACTAATC GTCCAGCTTG AGTGTTCTGT TCTCTTCCTG 

CCCATTTCCT TGAACCTCCT GCTCTAGCCT TGGCGGAGGG AGAGTGCTAT TTGCTTTrGT 840 
TCTCCCTCTC TCTTAGGAAA AGCCATCTTT AATATAGTTC TTCACCACTG TTGGGGTIOT 

35 

TiTCTCATrr 'I'lrrmu rr ccgaagaact cctggttgtt attggatttt gtattttaat 

ACAAATTATT GAATTTTATA AGCTTGTACA CAATATTTAA TTAGTGTGAA AGGAAACAAA 

40 GAATGCAGGA AAAATAATTT AATATCAACC TCAGTTGACA AGGTGCTCAG ATTATTCAAT 1080 

TCGQGATCCT CCTTTTGTTA GGTTTTTGAG ACAACCCTAG ACCTAAACTG TGTCACAGAC 1140 

TTCTGAATGT TTAGGCAC?rG CTACn-AATTT CCTCGTAATG ATTCrGTTAT TACTTTCCTA 1200 

45 

TTCTrrATTC CTCTTTCTTC TGAAGATTAA IGAAGTrCAA AATTGAGGTG GATAAATACA 1260 

AAAAGGTAGT GTCATAGTAT AAGTATCTAA GTGCAGATGA AAGTGTGTTA TATACATCCA 1320 

50 TTCAAAATTA TGCAAGTTAG TAATTACTCA GGGTTAACTA AATTACrTTA ATATGCTGTT 1380 

GAAYCTACrc TGTTCCITCG CTAGAAAAAA TTATAAACAG GACTTTGTAG TrTGGGAAGC 1440 

CAAATTGATA ATATTCTATG TTCTAAAAGT TGGGCTATAC ATAAATTATT AAGAAATATG 1500 

GATTTTTATT CCCAGGATAT GGrrGTTCATT TTATGATATT ACGCAGGATG ATGTATTGAG 1560 

TAAAATCAGT TTTCTAAATA TGTAAATATG TCATAAATAA ACAATGCTTT GACTTATTTC 1620 

60 CAAAAAAAAA AAAAAATAAA NTTCGAGGGG GGGC ^^^^ 



900 
960 
1020 
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5 (2) INFORMATION FOR SEQ ID NO: 114: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1171 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIOI : SEQ ID NO: 114: 
15 GGCAAACTTT CCCCCAANGC TTCGAAACTT GCAAGCCGAA ACCTTGAATC GTTAAAAGTT 60 



GGGTTGCGNC GGCGCCCTGG CCCGAAGAAG CGCAATTGGC GTTCCGCGAA CGTTGGCCCT 120 

CAACGGCTCG GCAGCCAGCC ATGTCCTGCA CCCAGGACAG CGGCCCTGGG CTACAAGGAC 180 

20 

CTGGMCCTCA TCTTCCTGCG CCGACCTGCG CQGGGTAAGG GGWAGTTTCA GACTGTGAAG 240 



GACGTCGTGC TGGACTGCCT GTTGGACTTC TTACCCGAGG QGGTGAACAA AGAGAAGATC 300 
25 ACACCACTCA CGCTCAAGGA AGCTTATOPG CAGAAAATGG TTAAAGTGTG CAATGACTCT 360 



GACCGATGGA GTCTTATATC CCTGTCAAAC AACAGTGGCA AAAATGTGGA ACTGAAATTT 420 

GTQGATTCCC TCCGGAGGCA GTTTGAATTC AGTGTAGATT CTTTTCAAAT CAAATTAGAC 480 

30 

TCTCTTCTGC TCTTTTATGA ATGTTCAGAG AACCCAATGA CTGAGACATT TCACCCCACA 540 

ATAATCGGGG AGAGCGTCTA TGGCGATTTC CAGGAAGCCT TTGATCACCT 1TGTAACAAG 600 

35 ATCATTGCCA CCAGGAACCC AGAGGAAATC CGAGGGGGAG GCCTGCTTAA GTACTGCAAC 660 

CTCTTGGTGA GGGGCTTTAG GCCCGCCTCT GATGAAATCA AGACCCTTCA AAGGTATATG 720 



TGTTCCAGGT TTTTCATCGA CTTCTCAGAC ATTGGAGAGC AGCAGAGAAA ACTGGAGTCC 780 

40 

TATTTGCAGA ACCACTTTGT GGGAATTGGA AGACCGCAAG TATGAGTATC TCATGACCCT 840 

TCATGGAGTG GTAAATGAGA GCACAGTGTG CCTGATGGGA CATGAAAGAA GACAGACTTT 900 

45 AAACCTTATC ACCATGCTGG CTATCCGGGT GTTAGCTGAC CAAAATG?rCA TTCCTAATGT 960 

GGCTAATGTC ACTTGCTATT ACCAGCCAGC CCCCTATGTA GCAGATGCCA ACTTTAGCAA 1020 

TTACTACATT GCACAGGTTC AGCCAGTATT CACGTQCCAG CAACAGACCT ACTCCACTTG 1080 

50 

GCTACCCTGC AATTAAGAAT CATTTAAAAA TGTCCTGTGG GGAAGCCATT TCAGACAAGA 1140 

CAGGAGAGAA AAAAAAAAAA AAAAAAAAAA A 11*71 

55 

(2) INFORMATION FOR SEQ ID NO: 115: 
60 (i) SEQUENCE CHARACTERISTICS: 
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15 



(A) LENGTH: 842 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOLOGY: linear 

5 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 115: 
OOTCTGCGCC GGAAGTGCAT GAGCTGCCGA TGTQCSTGCTr AGrGATTCfCG GTTTCGGTCG 
10 CTCTCCCGTG TTTCCCGGGC TC3GGTATTTG CCTCGCACCA TGGCGCCCAA QGGCAAAGTG 
GGCACGAGAG GGAAGAAGCA GATATTTGAA GAGAACfiGAG AGACTCTGAA GTTCTACCTG 
CGGATCATAC TCGGGGCCAA TGCCATTTAC TGCCTTGTGA CGTTGGTCTT CTTrTACTCA. 
TCTGCCTCAT TTrGGGCCTO GTTGGCCCTG GGCTTTAGTC TGGCAGTGTA TGGGGCCAGC 
TACCACTCTA TGAGCTCGAT GGCACGAGCA GCGTTCTCTG AGGATGGGGC CCTGATQGAT 
20 GC?rGQCATGG ACCTCAACAT GGAGCAGGGC ATGGCAGAGC ACCTTAAGGA TGTGATCCTA 
CTCACAGCCA TCCT3CAGC?r GCTCAGCTGC TTCTCTCTCT ATGrCTCSGTC CTTCrGGCrr 
CTQGCTCCAG GCCGGGCCCT TTACCTCCTG TGGGTGAATG TGCTGGGCCC CTCGTTCACT 
GCAGACA(M<5 GCACCCCAGC ACCAGAGCAC AATGAGAAAC GGCAGCGCCG ACAGGAGCGG 
CGGCAGATCA AGCGGTTATA GCCATTCACA TTGTGGCCAC AGGCCACTGG CCCTGGGTGG 
30 CTCTCTCAGG GTCCACAGCC CCTCATGCCT GGAGCAAT6A GQGTCTAGTC CAGGGGCCAA 
AAGCAGTCIG AGGTAITOGG TATACTTATA CTCTATAGGG TCGTTGAATA AATGGCTTAG 
AATCTCAAAA AAAAAAAAAA AAAAAACTCG AGGGGGQCCC GGTACCCAAT TTCNCCTANA 840 
AT 



25 



35 



40 



(2) INFORMATION FOR SEQ ID NO: 116: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1640 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 
GGCACGAGGC GGCGGCAGCG GTCGCGGCGG CGCCCCCCGG CGGGAGCCGT TCCCTTTCCC 
GTCQGGGAGC GCGGGGYCGG GQCCCAGGGG ACCCCGGGCC ACGGAGAGCG GGAAGAGGAT 
55 GGATTCCCCG GCCCTCCCCC CCGGA-TOGAA GAAGGAGGAA GTGATCCGAA AATCTGGGCT 
AAGTGCTOGC AAGAGCGAT3 TCTACTACTT CAGTCCAAGT GGTAAGAAGT TCAGAAGCAA 
GCCTCAOTPG GCAAGGTACC TGGGAAATAC TGTTGATCTC AGCAGTTTTG ACTTCAGAAC 

60 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



842 



60 
120 
180 
240 
300 
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10 



15 



20 



25 



30 



35 



40 



45 



TOGAAAGATG ATGCCTAGTA AATTACAGAA GAACAAACAG AGACTGCGAA ACGATCCTCT 360 

CAATCAAAAT AAGQGTAAAC CAGACTTGAA ATACAACATT GCCAATTAGA CAAACAQCAT 420 

CAATTTTCAA ACAACCX3GTA ACCCAAAGTC ACAAATCATC CTAGTAATAA AGTGAAATCA 480 

GACCCACAAC GAATGAATGA ACAGCCACGT CAGCTTTTCT QGGAGAAGAG GCTACAAGGA 540 

dTTAGTGCA TCAGATGTAA CAGAACAAAT TATAAAAACC ATGGAACTAC CCAAAGGTCT 600 

7t:AAGGAGTT GGrTCCAGTAG CAATGATCAG ACCCTTTTTAT CTGCTGTTGC CAGTGCTTTG 660 

CACACAAGCT CTGCGCCAAT CACAGGGCAA GTCTCCGCTG CTGTGGAAAA GAACCTGCTG 720 

TTTGGCTTAA CACATCTCAA CCCCTCTGCA AAGCTTTTAT TOTCACAGAT GAAGACTCAG 780 

GAAACAGAAG AGCGAGTACA GCAAGTACGC AAGAAAITGG AAGAAGCACT GATGGCAGAC 840 

ATCrrGTCGC GAGCTGCTGA TACAGAAGAG ATGGATATTG AAATGGACAG TGGAGATGAA 900 

GCCTAAGAAT ATGATCAGGT AACTTTCGAC CGACTTTCCC CAAGAGAAAA TTCCTAGGAA 960 

ATTGAACAAA AATGTTTCCA CTGGCTTTTG CCTGTAAGAA AAAAAATGTA CCCGAGCACA 1020 

TAGAGCTTTT TAATAGCACT AACCAATGCC TTTTTAGATG TATTTTTGAT GTATATATCT 1080 

ATTATTCAAA AAATCATGTT TATTTTGAGT CCTAGGACTT AAAATTAGTC TTTTGTAATA 1140 

TCAAGCAGGA CCCTAAGATG AAGCTGAGCT TTTGATGCCA GGTGCAATCT ACTGGAAATG 1200 

TAGCACTTAC GTAAAACATT TGTTTCCXrCC ACAGTTTTAA TAAGAACAGA TCAGGAATTC 1260 

TAAATAAATT TCCCAGTTAA AGATTATTGT GACTTCACTG TATATAAACA TATTTTTATA 1320 

CTTTATTGAA AGGGGACACC TGTACATTCT TCCATCGTCA CTGTAAAGAC AAATAAATGA 1380 

TTATATTCCA CAGAAAAAAA AAAAAAAAAW MWSTYGARRR GSRGCMCRSW AYMMARWWCC 1440 

CCWMRTWRGS MKTCSTMrKA YTTACATTCA ACTCTGATCC CGGGGCCTTA GGTTTGACAT 1500 

GQGAGGTOGG AGGAAGATAG CGCATATATT TGCAGTATGA ACTATTGCCT CTGGGACX5TT 1560 

GTGAGGAATT GTGCTTTCAC CAGAATTTCT AAGGATTTCT GGCTTAAATA TCACCTAGCC 1620 

TGTGC?rAATT TTTnTCCCT ^^^^ 



50 (2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 952 base pairs 

(B) TYPE: nucleic acid 
55 (C) STRANDE^ESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 



60 



TGAATTTAGN AAACACTTTG GAAAACTCAT AACCTCATCA GAAACTGCCT TTAGCCACAC 



60 
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TCCTGACCTT CTAGATGAGT AACAAAAAAA TGAAATAAGT TCTTCGAAAT TAAGCCATTT 120 

ATTTTAATTT GCTATTTTTT TCAATGTTCT AGGTATCTTT AAA1TTGTTA TTGTGGAATC 180 

5 

ATTTTCCTGC CAGATACCTT TATCAAAATT ATTGGCCTCA TGAGAGCTGA AGTAAGTCAG 240 

CTTTTTGGTG AAC1TTAGTG GACTTCTCTG AGATrGTAGT TGTACTTTGT ATCTCTAAAT 300 

10 CTAAAGATAG TTmTAAAA CTOXAAAGA AAATCTGCTC TCCTTTCTGA TCTAAAAACT 360 

CATCITTGGG GTAAAGAGOT AAGTGTCCAA AGGTTGTCAC AGTTCATGAG GTCAGAGGGA 420 

GCTAGCCTGG CACCTGGACT CTGCCCATCC ACAGCTGACA GATTCCAACA GAAGTGrPATT 480 

15 

TAAATTCrCC AGTAGACAAT GCTGGGTAAG GGAGGGQGTA GGGCTGGGTT ATTAAGATAC 540 

AGGCTGCTGT ATTTTACATT GGTTGTGGGG GAAGGGGAGC CTGGAGAAAA CAAAGTCACT 600 

20 ATTCCCTTTT TTGAAACAGG AAAAAAAATT ATTTTTTGTT CAGTAAAAAT GGTAGAGAAT 660 

TCCAATGTCC CTAGCCACAA GGGACCAGTT CCACTGAGAA GTGAACAGTG GGAACTCAAA 720 

ATTTCAGAAA CATTGGGGGA AGGGAAAATT GGCTTTCTCT TAATTGGCAG ATGTTCCAGT 780 

25 

GGGGSGGGGG GGCTCTGTTT TTGTTGGGAT GTCTTATGTT GTATGTACGC ATATATGGAC 840 

CQGAGTCTGC TGAGTTTATA AGGTTCCAAA AATATGGTAA AATCTTGGTT TTTGTTAATT 900 

30 TATCTCAATA AAAGCCCACT GGRACTCCAA AAAAAAAAAA AAAAAAAAGA NN 952 



35 (2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1256 base pairs 
<B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 

45 GACGTCATAG GTAAACAGGC TCTGTATCCG TGGCAGCGGC CGTGGCAGGC TGGCTGGGTA 60 

CCGGCTGTCG CTGACCCAGG AGAAGCTGCC TGTCTACATC AGCCTGGGCT GCAGCGCGCT 120 

GCCGCCGCGG GGCCGGCAGC TGAACTATGT GCTCTTCAGG GCGGGCACCG TGTTGCATTC 180 

50 

ATCTTTGTAC CCCCAGCATC TAGCAGTGTT GGCATGTAGT AGGCACTCAA GAAATGTGTG 240 

TTGAATGAAC GATGCCTGTG ACAAGCAAGC GGACTTTATT CTTTCCTGAC CCTTGCTCCT 300 

55 ATGACACACC TCCTCCTGAC TGCCACTGTC ACTCCTTCAG AGCAGAACTC CTCTAGGGAA 360 

CCTGGATGGG AAACAGCCAT GGCCAAGGAC ATCCTGGGTG AAGCAGGGCT ACACTTTGAT 420 

GAACTGAACA AGCTGAGGGT GTTGGACCCA GAGGTTACCC AGCAGACCAT AGAGCTGAAG 480 

60 
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GAAGAGTGCA AAGACTTTGT GGACAAAATT 
ATTGAGCTTG TTGATCAACT TGCA7VAAGAA 
5 GCTCGGAACT TQCTCAAATC TATAGCAAflG 
GCXXTTAATAG CAGAAAAGAA AATGCAGCTA 
TGTAAAGTAG AAGCAGAACA AAAT6AATTT 

10 

GAAAATTTCG CTTTTATAGT AGGAAGGCAA 
ACCTCTGTAG CATTCCAGCG GCTTGACCAA 
15 GAATGCAGCC CCCTGAAGAC AGCACTACAA 
ACAGCTGCTG CTGGTGGCCC TGCAGTGTAC 
AGCAGTTTGT AAACTTTCAT CCTTTTTTTT 

20 

AATAAATTTT TGTTTTCAAA TGGCTTGATG 
TTTAACTCCT CATGAGAGAA CCCTGGATTC 
25 CAGTGGTCAG CAGCTACXTTT TNATTTGGAT 



341 

GGCCAGTTTC AGAAAATAGT TGGTGGTTTA 540 

GCAGAAAATG AAAAGATGAA GGCCATCGGT 600 

CAGAGAGAAG CTCAACAGCA GCAACTTCAA 660 

GAAAGGTATC QGGTTGAATA TGAAGCTTTG 720 

ATTGACCAAT TTATTTTTCA GAAATGAACT 780 

AACAAAAAAA AGCCTCTCAA AACCAAAAAA 840 

TGACCTATGT CACAAGAGGT QGCGTGTAAG 900 

GTCTGGGGGA GCCAGTTTTA ACATCAGTGC 960 

GTTCTCACCT CTTATGCTTA GTTGGAACTA 1020 

GTAAATTCAC AAAGCTrTGG AAGGAGAAGC 1080 

TACCTTTTTT CCTGTTGCTC TTGAAATATG 1140 

TCTATCCXXT AGTCCACAAA ACAAACCAGG 1200 

CACACACGTG AGTCAGACAG TACCAC 1256 



30 (2) INFORMATION FOR SEQ ID NO: 119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1143 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEO^ESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 

40 GGCCGTAGCA GCCGGGCTGG TCCTQCTGCG AGCCGGCGGC CCGGAGTGGG GCGGCGGCAT 60 

GTACCTTCCA CATTGAGTAT TCAGAAAGAA GTGATCTGAA CTCTGACCAT TCTTTATGGA 120 

TACATTAAGT CAAATATAAG AGTCTGACTA CTTGACACAC TGGCTCGAGC AAACATGAAC 180 

45 

GTTGGAGTTG CCCACAGTGA AGTGAATCCA AATACCCGTG TCATGAACAG CCGGGGTATG 240 

TGGCIX3ACAT ATGCATTCGG AGTTGGCTTG CTTCATATTG TCTTACTCAG CATTCCCTTC 300 

50 TTCAGTGTTC CTGTTGCTTG GACTTTAACA AATATTATAC ATAATCTGGG GATGTACGTA 360 

TTTTTGCATG CAGTGAAAGG AACACCTTTC GAAACTCCTG ACCAGGGTAA AGCAAGGCTC 420 

CTAACTCATT GGGAACAACT GGACTATQGA GTACAGTTTA CATCTTCACG GAAGTTTTTC 480 

55 

ACAATTTCTC CAATAATTCT ATATTTTCTG GCAAGTTTCT ATACGAAGTA TGATCCAACT 540 

CACTTCATCC TAAACACAGC TTCTCTCCTG AGTGTACTAA TTCCCAAAAT GCCACAACTA 600 

60 CATGGTGTTC GGATCTTTGG AATTAATAAG TATTGAAATG TTTTGAAACT GAAAAAAAAT 660 
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TTTACAGCTA CTCAATTTCT TATAAGGAAG GAGTGGTTAG TAAACTGCAC TGTTTCTSTG 
ATAATOIX3AA ATCAGAAGTA TTTACArTGG AGGGCCAATG GCTGGTCCTT CAAGTGCTGT 
TTTGAACTGC AGATTTCCAT TAAATGATGC CTCTGTTTAA TACACCTGGT ACATTTCTGA 
AGAGGGGCTT TATAAGCAGG CTGGGCAGGC CCAGCTTATA AGTTAAAGGG CATCACAGTG 
10 AQQGTCTACT AGATAAATTC AAGGAAATAA GAGATTTGTA AGAAACTAGG ACCAGCTTAA 

CTTATAATCA ATOGGCATTO TGTTAAGAAA AGAACATTTC CAGTCATTCA GCTGTGGTTA 1020 
TTTAAAGCAG ACTTACATOT AAACCXK3AAT CCTCTCTATA CAAGTTTATT AAAGATTATT 1080 
•nTATTACCG TAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAANA 1140 
CAN 



15 



20 



40 



50 



(2) INFORMATION FOR SEQ ID NO: 120: 



720 

780 
840 
900 
960 



1143 



25 (i) SEQUENCE CHARACTERISTICS: 

(A) LHJGTH: 1782 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 
CAGGCCCCGG CCCCCCACCC ACGTCTCCGT TGCTGCCCCG CCTGGGCCRG GCCCCAAAGG 60 
35 CAAGGACAAA GCAGCTGTCA GGGAACCTCC GCCGGAGTCG AATTTACGTG CAGCTGCCGG 120 
CAACCACAGG TTCCAA6ATG GTITCCGGGG GCTTCGCGTG TTCCAAGAAC TGCCTGTGCG 180 
CCCTCAACCT GCTTTACACC TTOGTTAGTC TGCTGCTAAT TGGAATTGCT GCGTGGGGCA 
n<;GCTTCGG GCTCATTTCC AGTCTCCGAG TGGTCGGCGT GGTCATTQCA GTGGGCATCT 
TCTICTrCCT GATTGCTITA GTGGGTCTGA TTGGAGCTGT AAAACATCAT CAGGTGTTGC 
45 TATmTTTA TATGATTATT dOTTACTTC TATTTATTGT TCAGTTTrCT GTATCTTGCG 
CTTCTTTAGC CCTGAACCAG GAGCAACAGG GTCAGCTTCT GGAQGTTGGT TGGAACAATA 
CGGCAA<?rGC TCGAAATCAC ATCCAGAGAA ATCTAAACTG CTGTGGGTTC CGAAGTGTTA 540 
ACCCAAATGA CACCTGTCTC GCTAGCTCTG TTAAAAGTGA CCACTCGTOC TCGCCATGTC 600 
CTCCAATCAT AGGAGAATAT GCTGGAGAGG TTTTGAGATT TGTTGGTGGC ATTGGCCTGT 660 
55 TCTTCAGTTT TACAGAGATC CTCGGTGTTT GGCTGACCTA CAGATACAGG AACCAGAAAG 720 
ACCCCCGCGC RAATCCTAGT GCATTCCTTT GATGAGAAAA CAAGGAAGAT TTCCTTTCGT 780 
ATTATCATCT TGTTCACTTT CTGTAATTTT CTGTTAAGCT CCATTTGCCA GTTTAAGGAA 840 

60 



240 
300 
360 
420 
480 
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343 



10 



15 



20 



25 



30 



GGAAACACTA TCTC3GAAAAG TACCTTATTG ATAGPGGAAT TATATATTTT TACTCTATGT 
TTCTCTACAT GirTTTTTCT TTCCCTTGCT GAAAAATATT TGAAACTTGT GGTCTCTGAA 
GCTCGCTGGC ACCTO3GAAT TTACTGTATT CATTGTCGGG CACTGTCCAC TGTGGCCTTT 
CTTAGCArrT TTACCTCCAG AAAAACTTTG TATQGTACCA CTGTGTTGGT TATATGC3TGA 
ATCTGAACGT ACATCTCACT GGTATAATTA TATGTAGCAC TGTGCTGTGT AGATAGTTCC 
TACTGGAAAA AGAGTGGRAA TTTATTAAAA TCAGAAAGTA TGAGATCCTG TTATGTTAAG 
GGAAATCCAA ATTCCCAATT TTTTTTGGTC TTTTTAGGAA AGATGTGTTG TGGTAAAAAG 
TCTTAGTATA AAAAIXSATAA TTWACTKGTA GTCTTTTATG ATWACACCAA TGTATTCTAG 
AAATAGTTAT GYCYTAOGAA ATTCTCGrTT AATTTTTGAC TTTTACAGGT AAGTGCAAAG 
GAGAAGTOGT TTCATGAAAT GTTCTAATGT ATAATAACAT TTACCTTCAG CCIKXATCAG 
AATQGAACGA CTTITCAGTA ATCAGGAAGT ATATCTATAT GATCTTGATA TTGTTTTATA 
ATAATTTCAA GTCTAAAAGA CTGCATrTTT AAACAAGTTA GTATTAATGC GTTGGCCCAC 
GTAGCAAAAA GATATTTCAT TATCTTAAAA ATTGTTAAAT ACCGTTTTCA TGAAAGTTCT 
CAGTATTCTA ACAGCAACTT GXyAAACCTA AGCATATTTG AATATGATCT CCCATAATTT 
GAAATTCAAA TCGTATTOK5 TGGCTCTGTA TATTCTGTTA AAAAATTAAA GGACAGAAAC 
C'PrTCTTTGT GTATGCATGT TTGAATTAAA AGAAAGTAAT GG 



900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1782 



35 



45 



55 



(2) INFORMATION FOR SEQ ID NO: 121: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 610 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



GATCTCTCAG ATGCACIGCT ACCTGGTACT GCTTTCAGTG TGTTCCCCCT CAGCCCTCCG 



60 

120 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 
GTK3GCTCCA GATTTGTCGT GCGTTCTGAG CCGTCTGTCC TGCGCCAAGA TGCTTCAAAG 
TA1TATTAAA AACATATOGA TCCCCATGAA GCCCTACTAC ACCAAAGTTT ACCAGGAGAT 
50 TTOGATAGGA AIGGGGCTGA TGGGCTTCAT CGTTTATAAA ATCCGGGCTG CTGATAAAAG 180 
AAGTAAGGCT TTGAAAGCTT CAGCGCCTGC TCCTGGTCAT CACAACCAGA TrTACTTGGA 240 
GTACATOTGA AAGAAAACC3T CAGTCTGCCT GTAAATTTCA GCAAGCCGTG TTAGATGGGG 300 
AGCGIGGAAC GTCACTCTAC ACTTGTATAA GTACCGTTTA CTTCATGGCA TGAATAAATG 360 



420 



60 GCGIGTCAGG CATACTCTGA GTAGATAATT TGTCATGCAG CGCATGCAAT CAGAATCTCA 480 
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CTCAGCCACC CATCATTGTG AAATAATTAC CTCAGTTCTA CAGGACTTGG TGATCAGGAT 540 
CCAGGCACTC ACTTGTATTC TACTGCTCAA TAAACGTTTA TTAAACTTGA AAAAAAAAAA 600 
AAAAAAAAAA 



10 



20 



30 



40 



55 



(2) INFORMATION FOR SEQ ID NO: 122: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 526 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 

GGTACGCCTG CAGGTACCGG TCCGGAATTC CGGGTCGCCC. ACGCGTCNGG CCACGCGTCC 60 

ACCCACGCGT CCGSCCACGC GTCGGAGCCG AGCCGGACTG GTCAGGATGA TCAOGGACGT 120 

Jt5 GCAGCTCGCC ATCTTCGCCA ACATGCTQGG OGTGTCGCTC TTCTTGCTTG TCGTTCTCTA 180 

TCACTACGTG GCCGTCAACA ATCCCAAGAA GCAGGAATGA AAGTGGCGCT TTCTCCGCCC 240 

CAQGGTTCCA GGACATAGTC TGAGGCAAGA TGGAGGGTAT GAGGGGCCTT CACACTTCAC 300 

TTCATCCCTT CTACCCATCA CAACATACAA AGCAACTACA CCTGGATTTT TCCAAACAAC 360 

TTTTATTTCC TCAGACKnT CCTTAATCCT ATGGAACAAG AAGCTGCCAC TGAATAGGGC 420 

35 CCAGTATAGG GGCTIGCTTT TCTACTCCCT CCCCCCAATA TAAAAATATA GACTTTTTAA 480 

AAAAAAAAAA AAAAANTTCG NGGQGGGSCC GGTACCCATC CCCCTA 526 



(2) INFORMATION FOR SEQ ID NO: 123: 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 2081 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEM^ESS: double 
(D> TOPOLOGY: linear 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 

TGTACCGGTC CGGAAATTCC CGGGTCGACC CACGTCGTCS GGQGAACATG GCGGCTKCGG 60 

AGCCGGCGGT CCTTGCGCTC CCCAACAGCG QCGCCGGGGG CGCGGGGGCG CCGTCGGGCA 120 

CAGTCCCGGT GCTCTTCTGT TTCTCAGTCT TCGCGCGACC CTCGTCGGTG CCACACGGGG 180 

CQGGCTACGA GCTGCTCATC CAGAAGTTCC TCAGCCTGTA CGGCGACCAG ATCGACATGC 240 

60 ACCGCAAATT CCTTGCTGCAG CTGTTCGCCG AGGAGTGGGG CCAGTACGTG GACTTGCCCA 300 
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AGGGCTTCGC GGTRAGCGAG CX5CTGCAAGG 
CTACCCTGGG AAATCTTACA CCTTCAAGCA 

5 

GGTOCAGACC AGCCATCAAG TATTTTGGGG 
AAGGGGCCCG GATTTTAGGA ATTCCTGTTA 
10 GGAGCAOaGT TCAAGAAATT GATTTAACAG 
TTTCAATGGT ATTACCAGAA GTAGAAGCGG 
TTGTATTATT TGGAGTAGAA ACTCATGTGT 

15 

C5CCGAGGAGT CGAGGTTCAC ATTGTTGCTG 
GGATGTTTGC CCTCGAGCGT CTCGCTCRAR 
20 TTCTCCTTCA GCTQGTAGCT GATAAGGACC 
TTAAGGCGAG TGCTCCAGAG TCGGGTCTGC 
GTATGCTACT CACTGGTGAA GGACAGTCAG 

is 

CTTATCTCTA CTAGAATTAA AATGTTAAGT 
TGAAACTTAA CCAGCTAGAC CATTTGAGTA 
30 CCGGTGCTGC TTACCTTCCT nTTTGTTAA 
TGAAGATGCC TGTTTTGTCT CTACTGTGTA 
CTTGTGAAGT TTTCTTAAAT TGTTCACTTT 

35 

CTTTTATATT ACTGTAAGAT GTTATAATGT 
ArrGATTQGA ATAAGATTAT TGCATATGAA 
40 CACrcCCCTC ACAATGTTGT CCACTTAGTG 
GTTGAATAAT TACATATCTT TCTTGACTAT 
AATCTCTGTT AATATTCTCT CTTTTAACTG 

45 

CCATATTATT GGTGGAGGGC TGTTTTAACA 
ACCAACATCT TGAATATATA TTCTAGTGTC 
50 GTGGAATATC ATTTTAAAAT GTTCATGrTTC 
ATATTGTAAT GCAAAAAGTC TCAGTAATGA 
TCTCTTCGAT AAATTTATTT TCATTAAATA 

55 

ATATGTGAAA TGTGAAACTG CTGTCTTTTA 
ATTGAAATTA TTTTGNCXnx: CACAAGATGG 
60 ATTTATTTAA GGTNATAAAA TCTTGACATT 



TCCGCCTCGT GCCGYTGCAG ATCCAGCTCA 360 

CTGTGTTTTT CTGCTGTGAT ATGCAGGAAA 420 

ATATTATTAG CGTGGGACAG AGATTGTTGC 480 

TTGTAACAGA ACAATACCCT AAAGGTCTTG 540 

GTGTAAAACT GGTACTTCCA AAGACCAAGT 600 

CATTAGCAGA GATTCCC3GGA GTCAGGAGTG 660 

GCATCCAACA AACTGCCCTG GAGCTAGTTG 720 

ATQCCACCTC ATCAAGAAGC ATGATGGACA 780 

CCGGGATCAT AGTGACCACG AGTGAGGCTG 840 

ATCCAAAATT CAAGGAAATT CAGAATCTAA 900 

TTTCCAAAGT ATAQGACATr TGAAGAACTG 960 

GTGAAGGACT GTAAGCCCAC ACAAGCTCTT 1020 

CAAAAACGGC TCCTTTTTTG CGCCTCCTAG 1080 

CXTAGCATTTA GTTACAAACX5 TCAAAGGCTT 1140 

TGTGCTTrrA TTTATTAAAA AAAATTACAA 1200 

CTCTGATCGT ATCTTTCCAA AGTGCAGACT 1260 

AAAGAAAATG ACXTTACCAAC AATGATTTGG 1320 

TAATGTGGAT GTAGTGCTTT TACTTTACAG 1380 

TTTACCCACA GGACTCTGAA TCATGTTACC 1440 

AGTTGCATTG ATCTATCCGT ACCAAATGAT 1500 

ACTGATTTCT TATTTTGGTC ACTATTACTA 1560 

AAAAGGGATG GGATAGAAGG GTTTGCAATG 1620 

TCTTTGAAGT ATGGCTTGCT GAATATCTTT 1680 

CACAAGATTT AGCAAAAAGA TAAAGCTTGG 1740 

TGTTCTATAT TTTCTTCACC TACTCTCCAA 1800 

TTTQGTAGTA TTAATTTTGT GGTCATTGTT I860 

CTTRTTAGAG GGTTTTGAAA TGTTITTCAA 1920 

TATTAAAGTA ATTAAAGAAA ATGTATTGTG 1980 

CTCTATGAGT ATTCTTCCAG GGATTCTAAT 2040 

TATAATCTTT C 2081 
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5 (2) ItJFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17X7 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEDNESS: double 

(D) TOPOtiOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 

15 CCCCQGCGGA GCTGGACCCG CGGTGGGCTA GGGGCAGGGC CGGAGCCGCG GCGGCGGAGC 60 

TGTGGATCCT TCATGATGAG AGATTTGGGG ACACTTCTCT CTCCTGTGTG TAGTTGATAG 120 

TTTGGTGGTG AAGAGATGGC TGACAGTGTC AAAACCTTTC TCCAGGACCT TGCCAGAQGA 180 

20 

ATCAAAGACT CCATCTGGGG TATTTGTACC ATCTCAAAGC TAGATGCTCG AATCCAGCAA 240 

AAGAGAGAGG AGCAOCGTCG AAGAAGGGCA AGTAGTGTCT TGGCACAGAG AAGAGCCCAG 300 

25 AGTATAGAGC GGAAGCAAGA GAGTGAGCCA CGTATTGrTTA GTAGAATTTT CCAGTGTTGT 360 

GCTTGGAATG GTGGAGTGTT CTGGTTCAGT CTCCTCTTGT TTTATCGAGT ATTTATTCCT 420 

GTGCTTCAGT CGGTAACAGC CCGAATTATC GGTGACCCAT CACTACATGG AGATGTTTGG 480 

30 

TCGTGGCTGG AATTCTTCCT CACGTCAATT TTCAGTGCTC TTTGGGTGCT CCCCTTGTTT 540 

GTGCTTAGCA AAGTGGTGAA TGCCATTTGG TTTCAGGATA TAGCTGACCT GGCATTTGAG 600 

35 GTATCAGGGA GGAAGCCTCA CCCATTCCCT AGTGTCAGCA AAATAATTGC TGACATGCTC 660 

ITCAACCTTT TGCTGCAGGC TCTTTTCCTC ATTCAGGGAA TGTTTGTGAG TCTCTTTCCC 720 

ATCCATCTTG TCGGTCAGCT GGTTAGTCTC CTGCATATGT CCCTTCTCTA CTCACTGTAC 780 

40 

TGCTTTGAAT ATCGTTGGTT CAATAAAGGA ATTGAAATGC ACCAGCGGTT GTCTAACATA 840 

GAAAGGAATT GGCCTTACTA CTTTGGGTTT GGTTTGCCCT TGGCTTTTCT CACAGCAATG 900 

45 CAGTCCTCAT ATATTATCAG TGGCTGCCTT TTCTCTATCC TCTTTCCTTT ATTCATTATC 960 

AGCGCCAATG AAGCAAAGAC CCCTGGCAAA GCRTATCTCT TCCAGTTGCG CCTCTTCTCC 1020 

TTQGTQGTCT TCTTAAGCAA CAGACTCTTC CACAAGACAG TCTACCTGCA GTCGGCCCTG 1080 

50 

AGCAGCTCTA CTTCTGCAGA GAAGTTCCCT TCACCGCATC CGTCGCCTGC CAAACTGAAG 1140 

GCTACTGCAG GTCACTGAGT TGCCTGCCAT CCAAAGGGGA TGGGCGGGAT TGGAAGAAGC 1200 

55 TGTGGCAGCT CTTTTCCCTG TTCACCTCCC GCCTGCCAGG GAAGGCAGGA CCCGCTCTGC 1260 

CAAGGGCCCT CTQCGTATTC CCTTCTCTCT GAGGAATTGA AATTTTTGTC TCTGGTGCAC 1320 

GTAAGGCAGA ATGTTCCCTG ACACCAGTGT GTGGATTTTT AACATCACCG TGAGTCTGAA 1380 

60 
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AGGACCACAG GTTTTTCTGC AGCTATTTTC TAGCATTTGC CAGTCCCTGT GCCTGGACTG 1440 

ATTGGAACAC TTTGTTTTTC TCCCTGTGCC ATTTACCCTT CO^CCTTTCC ATCCTGCCTT 1500 

CTACCACCCT TGGATGAAT6 GATTTTGTAA TrCTAGCTGT TGTATTTTGT GAATTTGTTA 1560 

ATTTTOrrGT TTTTCTCTGA AACACATACA TTGGATATGG GAGGTAAAGG AGTGTCCCAG 1620 

TTCCTCCTOG TCACTCCCTT TATAGCCATT ACTGTCTTGT TTCTTGTAAC TCAGGTTAGG 1680 

TTTTGGTCTC TCTTGCTCCA CTGCAAAAAA AAAAAAA 1717 



10 



15 



25 



35 



45 



55 



(2) INFORMATION FOR SBQ ID NO: 125: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 804 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPODOGY: linear 



50 MTNTTTTCGN AAACATAATT TGAATAAAAT AATTTTTAAT GGATTNTGNA AAAAAAAAAA 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 431 base pairs 



120 
180 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 

CCACGCGTCC GGTCACTATG TAGTGGAGGG GCAGACACCC TCCCGCAAAT TCTGGAAGGT 60 
TCTTAGTCTC GACTAGGGCA GTAGCCCCAG GACTCCTAGT CGCCGGCTTC AGGTCACTGC 
30 CGGCTCAACG GAGCTGCCGT CGCCATGTTT GGCTGCTTGG TGGCGGGGAG GCTGGTGCAA 

ACAGCTGCAC AGCAAGTGGC AGAGGATAAA TrrGTTTTTG ACTTACCTGA TTATGAAAGT 240 

ATCAACCATG TTGTGGTTTT TATGCTGGGA ACAATCCCAT TTCCTGAGGG AATGGGAGGA 300 

TCrcaXTTACT TTTCTTATCC TGATTCAAAT QGAATGCCAG TATGGCAACT CCTAGGATTT 360 

GTCACGAATG GGAAGCCAAG TQCCATCTTC AAAATTTCAG GTCTTAAATC TGGAGAAQGA 420 

40 AGCCAACATC CTTTTGGAGC CATGAATATT GTCCGAACTC CATCTGTTGC TCAGATTGGA 480 

ATTTCAGTOG AATTATTAGA CAGTATQQCT CAGCAGACTC CTGTAGGTAA TGCTGCTGTA 540 

TCCTCAGTTG ACTCATTCAC TCAGTTCACA CAAAAGATGT TGGACAATTT CTACAATTTT 600 
GCTTCATCAT TTGCTGTCTC TCAGGCCCAG ATGACACCAA GCCCATCTGA AATGTTCATT 



660 



CCGGCAAATG TGGTTCTGAA ATGGTATGAA AACTTTCAAA GACGACTAGC ACAGAACCCT 720 



780 



AAAAAAAAAA AAAAAAAAAA AAAA 



(2) INFORMATION FOR SEQ ID NO: 126: 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

5 (xi) SEQUENCE DESCRIPTIC»«: SEQ ID NO: 126: 

GGCACAGCCC AGGGCCTTGA AGCCAGCTGG CCCTGGAGAG GGGCTCCTCT GCCAGCTTGG 60 

GGAGGGTCTG GGATGGGGCT GCCCCTGATG GCCCTGATGT GGAGTACCTT GCCAGCATCT 120 

10 

GCTCGGGTGA ACTTTATTTT AGCCCTTCCC TTGTTGCTCT TATGGAAGAA CAGAGGAGQG 180 

GTCGGCAGGT CAGTGATGTC AGCAGTGGAG TGATTCCCAG CACAGCGGCT TCTGGGAAGA 240 

15 GGGCATGGAG GCATTTCTTT CAGGGAAATG GTCCATNATT TCAGCCAGAA GGCATTGCAT 300 

TAAGTTAAGT CCNGGACTTT TGfTQGCCCPiG CTCTGTGTTA TTAAGGGCCC TTGGCGAAGA 360 

CTTCAAGGAG GGGGCAAAAN GACCTTTAAG TTTTTAGGTr TAACACAGGG AACCCNCAAA 420 

20 

GGGTTATTTT G 431 



25 



35 



45 



55 



(2) INFOKMATION FOR SEQ ID NO: 127: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3752 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 

NGGCACGAGG AGAGTCACCT GGACTCAGAA CTAGAGATAT CCAATGACCC AGACAAAATT 60 

AAACTTCAGC TTTCTAAGCA TAAGQAGTTT CAGAAGACTC TTGGTGGCAA GCAGCCTGTG 120 

40 TATGATACCA CAATTAG7\AC TGGCAGAGCA CTGAAAGAAA AGACTTTGCT TCCCGAAGAT 180 

ASTCAGAAAC TTOACAATTT CCTAGGAGAA GTCAGAGACA AATGGGATAC TGTTTGTGGC 240 

AAGTCTGTCG AGCQGCAGCA CAAGTTGGAG GAAGCCCTGC TCTTTTCGGG TCAGTTCATG 300 

GATGCTTTGC AGGCATTGGT TGACTGGTTA TACAAGGTGG AGCCACAGCT GGCTGAGGAC 360 

CAGCCCGTQC ACGGGGGACC TTGACCTCGT CATGAACCTC ATGGATGCAC ACTU^TTTT 420 

50 CCAGAAGGAA CTGGNGAAAG CGAACAGGAA CCGTTCAGGT CCTGAAGCGG TCAGGCCGAG 480 

AGCTCATTGA GAATAGTCGA GATGACACCA CTTGGGTAAA AGGACAGCTC CAGGAACTGA 540 

GCACTCQCTG GGACACTGTC TGTAAACTCT CTGTTTCCAA ACAAAGCCGG CTTGAGCAGG 600 

CCTTAAAACA AGCGGAAGTG TTTCGAGACA CAGTCCACAT GCTGTTC3GAG TGGCTTTCTG 660 

AAGCAGAGCA AACGCTTCGC TTTCGGGGAG CACTTCCTGG ATGACACAGA GGCCCTGCAG 720 

60 TCTCTCATTG ACACCCATAA GGAATTCATG AAGAAAGTAG AAGAAAAGCG AGTGGACGTT 780 
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60 



AACTCAGCAG TAGCCATGGG AGAAGTCATC CTGGCTGTCT GCCACXXTGA TTGCATCACA 840 

ACCATCAAAC ACTQGATCAC CATCATCCGA GCTCQCTTCG AGGAGGTCCT GACATGGGCT 900 

AAGCAGCACC AGCAGCGTCT TGAAACX3GCC TTGTCAGAAC TQGTGGCTAA TGCTGAGCTC 960 

CTCGAAGAAC TTCTGGCATG GATCCAGTGG GCTGAGACCA CCCTCATTCA GCGQGATCAG 1020 

GAGCCAATCC CGCAGAACAT TGACCGAGTT AAAGCCCTTA TCGCTGAGCA TCAGACATTT 1080 

ATGGAGGAGA TGACTCGCAA ACAGCCTGAC GTGGACCX3GG TCACCAAGAC ATACAAAAGG 1140 

AAAAACATAG AC3CCTACTCA CX3CX3CCTTTC ATAGAGAAAT CCCGCAGCGG AGGCAGGAAA 1200 

TCCCTAAGTC AGCCAACCCC TCCTCCCATG CCAATCCTTT CACAGTCTGA AGCAAAAAAC 1260 

CCACGGATCA ACCAGCTTTC TGCCCGCTQG CAGCAGGTGT QGCTGTTAGC ACTGGAGCGG 1320 

CAAAGGAT^ TGAATGATGC CTTQGATCGG CTGGAGGAGT TGAAAGAATT TGCCAACTTT 1380 

GACTTTGATG TCTOGAlGGAA AAAGTATATG CGTTQGATGA ATCACAAAAA GTCTCGAGTG 1440 

ATGGATTTCT TCCGGCGCAT TGATAAGGAC CAGGATGQGA AGATAACACG TCAGGAGTTT 1500 

ATCGATGQCA OTTTAGCATC CAAGTTCCCC ACCACCAAGT TAGAGATGAC TGCTGTGGCT 1560 

GACATTTTCG ACCGAGATGG GGATGGTtTAC ATTGATTATT ATGAATTTGT GGCTGCTCTT 1620 

CATCCCAACA AGGATCCGTA TCGACCAACA ACCGATGCAG ATAAAATCGA AGATGAGGTT 1680 

ACAAGACAAG TCGCTCAGTG CAAATGTGCA AAAAGGTTTC AGGTGGAGCA GATCGGAGAG 1740 

AATAAATACC GGTTCTTCCT CGGCAATCAG TTTGGGGATT CTCAGCAGTT GCGGCTGGTC 1800 

CGTATTCTCC GCAACCXSTGA TOGTTCGCGT TQGTGGAGGA TGGATGGCCT TGGATGAATT 1860 

TTTAGTGAAA AATGATCCCT GCXX5AGCACG AGGTAGAACT AACATTGAAC TTAGAGAGAA 1920 

ATTCATCCTA CCAGAGGGAG CATCCCAGGG AATGACCCCC TIXXGCTCAC GGGGTCGAAG 1980 

GTCCAAACCA TCTTCCX^GGG CAGCTTCCCC TACTCGTTCC AGCTCCAGTG CTAGTCAGAG 2040 

TAACCACAGC TGTACATCCA TCCCATCTTC TCCAGCCACC CCAGCCAGTG GAACCAAGGT 2100 

TATCCCATCA TCAGGTAGCA AGTTGAAACG ACCAACACCA ACmTCATT CTAGTCX3GAC 2160 

ATCCCTTGCT GGTGATACCA GCAATNAGTT CTTCCCCGGC CTCCACAGGT GCCAAAACTA 2220 

ATCGGGCAGA CCCTAAAAAG TCTGCCAGTC GCCCTGGGAG TCGGGCTGGG AGTCGAGCCG 2280 

GGAGTCGAGC CAGCAGCCGG CGAGGAAGTG ACGCTTCTGA CTTTGACCTC TTAGAGACGC 2340 

ATTGCTTGTT CCGACACTTC AGAAAGCAGC GCTQCAGGGG GCCAAGGCAA CTCCAGGAGA 2400 

GGGCTAAACA AACCTTCCAA AATCCCAACC ATGTCTAAGA AGACCACCAC TGCCTCCCCC 2460 

AGGACTCCAG GTCCCAAGCG ATAACACTGT CTAAGCACCC CCAAGCCACT ATCCACTTTG 2520 

AATCCTOCTC CATACATTGG GTGTATATTT ATTCTGAACG GGAGAAGTTA TATTGTTAAA 2580 
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AGTGTAAAAG AATAATTGTG TTATGAAGCT GCCTTATTTT TTTTCTTTTT GTAAGTTACT 2640 

ATTTTCATGT GAATATTTAT GTAGATAAAA TTTGCCTCCT GGTAACCCTG TAATGGATGG 2700 

5 

GGCCCAGAAA TGAAATATTT GAGAAAAACA AGTGAAAAGG TCAAGATACA AATGTGTATT 2760 

AAAAAAAAAA AAGCCTATTA ATAGGGTTTC TGCQCX3GTGC AGGGTTGTAA ACCTGCTTTA 2820 

10 TCTTTTAGGA TTATTCCTAA ATGCATCTTC TTTATAAACT TGACTTGCTA TCTCAGCAAG 2880 

ATAAATTATA TTAAAAAAAT AAGAATCCTG CAGTGTTTAA GGAACTCTTT TTTTGTAAAT 2940 

CACGGACACC TCAATTAGCA AGAACTGAGG GGAGGGCTTT TTCCATTGrT TAATGTTTTG 3000 

TGATTTTTAG CTAAAGAGAG GGAACCTCAT CTAAGTAACA TTTQCACATG ATACAGCAAA 3060 

AQGAGTTCAT TGCAATACTG TCTTTGGATA TrGTTTCAGT ACTQQGTGTT TAAAGGACAA 3120 

ATAGCTGCTA GAATTCAGGG GTAAATGTAA GTGTTCAGAA AACGTCAGAA CATTrGGGCTr 3180 

TTTAAACTGA TTTGTTGCTC CCTATCCAGC CTAGACACCA GTAACTCTTG TGTTCACCAG 3240 

GACCCAGACC CTTGGCAAGG GATAGGCTCG TTGGTGACAT TGTGAATTTC AGATTTGTTT 3300 

TATCCACTTT TTTTGCTATT TATTTAAATG GTCGATCAAC TTCCCACAAA CTGAGGAATG 3360 

AATTCCACGA GCCTGTTCTG AAAATGTQGA CGTAAGACAA ACACGTGCTC GTCCTTTAAT 3420 

GGAGTTCACC AGCACACTTG TTAACCAGTC CTGTTTGCTT TCX3TCTTTTT TTGTGCGTAA 3480 

TAAAGTCAAC TGACCAAGTG ACCATGAAAA GGGGCTGTCT GQGGCTCCTG TTTTTTAGCT 3540 

GCTGTTCTTC AGCTCCGACC ATGTTGCTGT GTGATTATCT CAATTGGTTT TAATTGAGGC 3600 

AGAAACTGAA GCTCTACCAA TGAACTGTTT AGAAACAAGA CACACTTTTG TATTAAAATT 3660 

GCTTGCAGTA ACAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAACTCGAQG GGGGCCCGGT 3720 

ACCCAATTCG CCGTATATGA TCXSTAAACAA TC 3752 



(2) INFORMATION FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEEVjTH: 1144 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 

TGACCCTCTG CCTGCCGGGC TCAGTGCTGG ACGCTTTCTG TTTTGTCGCA GTCGGTCCTC 60 

GGTAACACCA QCGGCCTGTG GTCCACCACT CCATTCAGCA GCTCCATTTG GTCCAGCAAC 120 

CTTAGCAGCG CCTTCCCTTC ACCACTCCAG CAAACACGCT GGCAAGCATC GGCCTCATGG 180 



wo 98/39448 



PCT/US98/04493 



10 



15 



20 



25 



30 



35 



351 

GCACAGAAAA CTCCCCrCSCT CCTCACGCTC CXTTCCACCTC CAGTCCAGCT GACGACTTGG 
GACAGACCTA CAACXTCGTCSG CGGATATGGA C3CCXX:ACGAT TCGAAGAAGA AGCTCGGACC 
CTTGCTCTAA TTCGCACTTT CCTCACGAGA ATTAAATTAA GCAAAAAACA AACAAACATA 
GTCGGCCCTC GTCTAGATCA TGATGTGCCA CTTTCTGAGA CATCTTTTTA AGGCTCTTAC 
TCCAGCTCCC CTCCrCACCC TCCTCTTCTT TGCAAAACAG ACCCAAGCAG GGCAGGCTCA 
GACCACTCGC TTCTTTCAGA TCTTTCTTC3C AATTATGATA ACATGAGATT TGCTGTTGTG 
CTTTTAGAGA AAAGTCTGGA CTCAGCCACA AACTCTAATA AGACCTGTAC ATCTGAGAAC 
CTTTCXCGTT ACTGCGTTTT CACCACCTGT CTTCCCCATG CTTTATTTAT CTGTATGAAC 
ACAGATTTCA CATTACAGCT AAGQAAATAA TTTGAGTTGA TTCAGAAATC CTGGCATGTG 
ACAATTTTGr TAAATTACCA AGTTrcGTTT TTAATAATTT CTCAATATTA TGCGCCAAGA 
TCTAATTTTA AAACTGTATG AGGACTTTGT GCTGAAAATA GAGTATTTTT 1TAAAGTAAG 
GCTCTCTTCG TTTAAAAC3CA GATTACAGAA ATGTAAGTCA ACTTAAGAAC RGTGAATGAA 
TGTAAAAACA TTCAGTYGAG ACCATATGCA TTTTCTGTGC TGTTTGTACT TGAGGTATGT 
AACATTTCTA TACCTCAACT TATTTTAAAG ATGAACTGAA ATGCACATAG CCAAGTCTTG 
AGATACAAGA TTCAATCTGT ATTTCTTAAA AATACAACTT TGTGTTGTAC TTTGAAATAA 
ATX3ATGCTTT TTTCAAAAAA AAAAAAAAAA AAAAAAAAAC TCGAGGGGGG GCCCGGTACC 
CAAT 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1144 
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(2) INF0RMATIC2N FOR SEQ ID NO: 129: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1830 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 
GCATGCAGAG GAGCACCCTG AGCGTGTYCC TGGAGCAGGC GGCCATSTTG GCACGGAGCC 
ACGGGITCCT GCCCAAGTGC ATCATGCAGG CCACGGACAT CATGCGGAAC AGGGCCCAAG 
GGTGGAGATT CTCGCCAAAA ACCTGCGAGT CAAGGACCAG ATGCCCCAGG GTGCTCCGCG 
CCTCTACCGC CTCTCCCAGC CGCCGGTGGA TGGGGACCTC TGAACACCCA AATGCCCCAC 
GCTGGGCCGC GGCCTCIGGA GCTGGGATTT GGGAGGACAC AGCAGGCAQC GCTGGCCTTC 
TCCAGOGATG GCCCAANGCT TCCGCARCCG CCCGTTCCGG GACCTGCCCA GCGTCCTCCC 
1X3CCTCCTTC CGGGACAAGC CTGGCCACCC TCGCTGTGAT GACGAGCTGG CTGATTC3GCC 



60 
120 
180 
240 
300 
360 
420 
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CTCGGCCGGC CCATTCTTCA CACGCCTGCC AGAAGCTGGA GGGGTGCTGG AGACCCATAG 480 

AGCTGATGGG AGCAGCTGGT GCCTQGCCTT CGGCTCCTGC GTCCCCAGAA CCCAAGGGAA 540 

5 

CGTCATCGAG GCCACATGGG GCXACCCS3GC TCCCTCGGGA TGGCTCCGCT GCACTTTTGA 600 

AACeCCGGTT TCCTTCAACG TCCACATTCC AGGTGACCAC ACGTGTCTCC TCCTCCTCAT 660 

10 CTTAGCTTCC AQGTTCACCC TAACCCTGTA CTAACCTCCT TGGTGGACTT GGAAAAGACT 720 

TGGCTCTGTC GQC5AAAGGAG AGACQGGGCC TCCATCACXX: CTGTTACCAG AGGATCCCCX; 780 

AGAGCXACAC CAGCTCTQGA CATCACXX3CC CCTGGAACTG GGGCCACCAG CCCTGOGCAC 840 

15 

GAGATTTGCT CTGACTTTAT TTATATGGCA TGAAATCTCT GGTTTATTTT GGGATimT 900 

GTTGTTCGTX5 TTGTCAAAGT TTGTrTTTTC TAAAGTTGTG TGATTATATA TTTGACATTT 960 

20 TACAOTICAA AGAAAGGTAT GTTGTCTAAC AGGGGACCAA CAGAAGGTAG TATTGACAAC 1020 

TCTTCCTGCT TCTACTAAAA AAAAAAGAGC ACAAAAGAAA AACTAAATTA TTGAAAAATT 1080 

AAAAAATCTC ATTGTTTCCT GTTTGTTAAT ATTAGGGTTG TAAGGTGTCG TTTTGAGGTA 1140 

TCGACTGTGA TTCCTTCCCC CACCCTCCAT TCTCCAGCGG TTGGCCGGTG TTAGAACTCG 1200 

CTCTCTTTGA GTGACTGGCT ACAAGGGCCT GAGAGGTGGC CAGCCAGGGT TGGAGCTGGA 1260 

30 GGGGATCGAG CCCCACCTGA GGTGCXX?rGT CACACGGGTT AGAGGGTCAC TGGGAAACAC 1320 

CGGGCGGTGG CTTCTGTGAT TTATTTTCTT GATGGTAACT TCTCAGAGCA GGGCRATTGG 1380 

GACATCACCA GCCAGAGCAC AGGAAGCCAC CCTGCXTTGCT GGGGAGGAGG GACCCACACA 1440 

AGCCCCCTCG GCAGTTTCTC CCCCCAGCTT CQGTATGCCT TCAGGGAAAG GTCACAGCTG 1500 

GGGAGGAAGC GGGGGGACGC CTGTCACXrCC TGGCAGGTGG TGAGTTCAGG TGGQGGCTCC 1560 

40 CTGCTKCCCC CAGGCCTGGG AGCTTGAAGC CCTCCCGGCA TCTGGCATCC GAGCCTCCCG 1620 

CCCTCCAGGG TGCGCTTCCC TCTCTTGCCG CAGCATACAC GAGGGCAGGC AGTGGCCTTG 1680 
TCACTOTATC TTCCATCAGA GACAAAGGAG GACCCGCTTT AGCCCTGCTG CGGGAAATGG 
GQGATOGCCC AGGGCCAQCG CATTGTGCAC TCGITTACTT TAAAATGTAC AGATTCTTCT 



25 



35 



45 



50 



55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1864 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



1740 
1800 



CGTTAAATTC TTGATAGATT TTTTATTATT ^830 



(2) INFORMATION FOR SEQ ID NO: 130: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 

GGCCGCCCGG ATOGCGACCC CAGCCTCGGC CCCAGACACA CGGGCTCTGG TGGCAGACTT 60 

TGTAQGTTAT AAGCTGAGGC AGAAGGGTTA TGTCTGTGGA GCTQGCCCCG GGGAGGGCCC 120 

AGCAGCTGAC CCGCTGCACC AAGCCATGCG GGCAGCKC3GA GATGAGTTCG AGACCCGCTT 180 

CCGGCGCACC TTCTCTGATC TGGCGGCTCA GCTGCATGTG ACCCCAGGCT CAGCCCAACA 240 

ACGCTTCACC CAGGTCTCCG ATGAACTTTT TCAAGGGGGC CCCAACTGGG GCCGCCTTOT 300 

AGCCTTCTTT GTCrTTGGGG CTGCACTGTG TGCTGAGAGT GTCAACAAGG AGATGGAACC 360 

15 ACTGGTOGGA CAAGTGCAGG AGTGGATGGT GGCCTACCTG GAGACGCGGC TGGCTGACTG 420 

GATCCACAGC ACTGGGGGCT GGTTATCCCA GATCACTQAA GCTGAGATGG CTGATGAAGT 480 

AATTTGCAGT GAAATTTTAA GCGACTGTGA CTCTGCTGCA AGTTCCCCAG ATCTTGAGGA 540 

20 

GCTGGAAGCT ATCAAAGCTC GAGTCAGGGA GATGGAGGAA GAAGCTGAGA AGCTAAAGGA 600 

GCTACAGAAC GAGGTAGAGA AGCAGATGAA TATGAGTCCA CCTCCAGGCA ATGCTGGCCC 660 

25 GGTCATCATG TCCATTGAGG AGAAGATGGA GGCTGATGCC CGTTCCATCT ATGTTGGCAA 720 

TGTOGACTAT GGTGCAACAG CAGAAGAGCT GGAAGCTCAC TTTCATGGCT GTGGTTCAGT 780 

CAACCGTGTT ACCATACTGT GTGACAAATT TAGTGGCCAT CCCAAAGGGT ITCCGTATAT 840 

AGAGTTCTCA GACAAAGAGT CAGTGAGGAC TTCCTTGGCC TTAGATGAGT CCCTATTTAG 900 

AGGAAGGCAA ATCAAGGTGA TCCCAAAACG AACCAACAGA CCAGGCATCA GCACAACAGA 960 

35 CCGGGGTTTT CCACGAGCCC GCTACCGCGC CCGGACCACC AACTACAACA GCTCCCGCTC 1020 

TCGATTCTAC AGTCGTrTTA ACAGCAQGCC CCGGGGTCGC GTCTACAGGG GCCGGGCTAG 1080 

AGCGACATCA TGGTATTCCC C^ACTAAAA AAAGTGTGTA TTAGGAGGAG AGAGAGGAAA 1140 

AAAAGAGGAA AGAAGGAAAA AAAAAAGAAT TAAAAAAAAA AAAAAAAAAA ACAGAAGWTG 1200 

MCCTTGATGG AAAAAAAATA TTTTTTAAAA AAAAGATATA CTGTGGAAGG GGGGAGAATC 1260 

45 CCATAACTAA CTGCTCAGGA GGGACCTGCT TTGQGGAGTA QGGGAAGGCC CAGGGARTGG 1320 

GGCAGGGGGC TGCTTATTCA CTCTGQGGAT TCGCCATQGA CACGTCTCAA CTGCGCAACT 1380 

GCTTGCCCAT GTTTCCCTGC CCCACCCCAC CCCTCTTCTC CGGCTCCCTG CCCCTCCAGA 1440 

TTCCCTOGTG ATCTATTTTG TrrCCTTTTG TGTTTCTTTT TCTGTTTTGA GTCTCTTTCT 1500 

TTCCAGCTTT CTGTAGCCGG AAGATCTCCG TTCCGCTCCC AGCGGCTCCA GTGTAAATTC 1560 

55 CCCrrcCCCC TGGGGAAATG CACTACCTTG nrrGGGGGG TTTAGGGGTG TTTTTGrrTT 1620' 

TCAGTTGTTT TGTTTTTTTG rmTl W rr TTTCCTTTGC CTTTTTTCCC TTTTATTTGG 1680 

AQGGAATCGG AGGAAGTGGG AACAGQGAGG TGGGAGGTGG ATTTTGTTTA rTTTTTTAGC 1740 
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TCATTTOCAG GGGTGGGAAT TTTTTTTTAA TATGrTGTCAT GAATAAAGrTT GTTTTTGAAA 1800 
AKAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1860 
5 AAAA 1864 



10 (2) INFORMATION FOR SEQ ID lK>i 131: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2041 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS: doijble 

(D) TOPOIjOGY: linear 

(xi) SEQUENCE DESCRIPTIC»a : SEQ ID NO: 131: 

20 GGCACGAGCG CGCGC3CAGGG CCCTGGACCC GCGCGGCTCC CGGGGATGGT GAGCAAGGCG 60 

CTGCTGCGCC TCGTGTCTGC CGTCAACCGC AGGAGGATGA AGCTGCTGCT GGGCATCGCC 120 

TTGCTGGCCT ACGTCGCCTC TGTTTGGGGC AACTTCGTTA ATATCAGGTC TATCCAQGAA 180 

25 

AATGGTGAAC TAAAAATTGA AAGCAAGATT GAAGAGATGG TTGAACCACT AAGAGAGAAA 240 

ATCAGAGATT TAGAAAAAAG CTTTACCCAG AAATACCCAC CAGTAAAGTT TTTATCAGAA 300 

30 AAGGATCGGA AAAGAATTTT GATAACflGGA GGCGCAGGGT TCGTGGGCTC CCATCTAACT 360 

GACAAACTCA TGATGGACGG CCACGAGGTG ACCGTGGTGG ACAATTTCTT CACQGGCAGG 420 

AAGAGAAACG TGGAGCACTG GATOGGACAT GAGAACTTCG AGTTGATTAA CCACGAGGTG 480 

35 

TGGAGCCCCT CTACATCGAG GTTGACCAGA TATACCATCT GGCATCTCCA GCCTCCCCTC 540 

CAAACTACAT GTATAATCCT ATCAAGACAT TAAAGACCAA TACGATTGGG ACATTAAACA 600 

40 TGTTGQQGCT GGCAAAACGA GTCGGTGCCC GTCTGCTCCT GGCCTCCACA TCGGAGGTGT 660 

ATGGAGATCC TGAAGTCCAC CCTCAAAGTG AGGATTACTG GGGCCACGTG AATCCAATAG 720 

GACCTCGGGC CTGCTACGAT GAAGGCAAAC GTGTTGCAGA GACCATCTCC TATGCCTACA 780 

45 

TGAAGCAGGA AGGCGTGGAA GTGCGAGTGG CCAGAATCTT CAACACCTTT GGGCCACGCA 840 

TGCACATGAA CGATGGGCGA GTAGTCAGCA ACTTCATCCT GCAGGCGCTC CAGGGGGAGC 900 

50 CACTCACGGT ATACGGATCC GGGTCTCAGA CAAGGGCGTT CCAGTACGTC AGCGATCTAG 960 

TGAATGGCCT CGTGGCTCTC ATGAACAGCA ACGTCAGCAG CCCGGTCAAC CTGGGGAACC 1020 

. CAGAAGAACA CACAATCCTA GAATTTGCTC AGTTAATTAA AAACCTTGTT GGTAGCGGAA 1080 

GTGAAATrCA GTTTCTCTCC GAAGCCCAGG ATGACCCACA GAAAAGAAAA CCAGACATCA 1140 

AAAAAGCAAA GCTGATGCTG GQGTGGGAGC CCGTGGTCCC GCTGGAGGAA GGTTTAAACA 1200 

60 AAGCAATTCA CTACTTCCGT AAAGAACTCG AGTACCAGGC AAATAATCAG TACATCCCCA 1260 
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AACCAAAGCC TGCCAGAATA AAGAAAGGAC GGACTCGCCA CAGCTGAACT CCTCACTTTT 1320 

AGGACACAAG ACTACCATTG TACACTTGAT GGGATGTATT TTTGGCTTTT TTTTGTTGTC 1380 

5 

GTTTAAAGAA AGACTTTAAC AGGTGTCATG AAGAACAAAC TGGAATTTCA TTCTGAAGCT 1440 

TGCTTTAATG AAATGGATGT GCCTAAAAGC TCCCCTCAAA AAACTGCAGA ITTTGCCTTG 1500 

10 CACTTTTTGA ATCTCTCTTT TTATGTAAAA TAGCGTAGAT GCATCTCTGC GTATTTTCAA 1560 

GTTTTTTTAT CTTGCTGTGA GAGCATATGT TGTGACTGTC GTTGACAGTT TTATTTACTG 1620 

GTTTCTTTGT GAAGCTGAAA AGGAACATTA AGCGGGACAA AAAATGCCGA TTTTATTTAT 1680 

15 

AAAAGTGGGT ACTTAATAAA TGAGTCGTTA TACTATGCAT AAAGAAAAAT CCTAGCAGTA 1740 

TTGTCAGGTG GTGGTGCX3CC GGCATTGATT TTAGGGCAGA TAAAAGAATT CTGTGTGAGA 1800 

20 GCTTrATGTT TCTCTTTTAA TTCAGAGTTT TTCCAAOGTC TACTTTTGAG TTGCAAACTT 1860 

GACTTTGAAA TATTCCTGTT GGTCATGATC AAGGATATTT GAAATCACTA CTGTGTTTTG 1920 

CTGCGTATCT GGGGCGGGGG CAGGTTGGGG GGCACAAAGT TAACATATTC TTGGTTAACC 1980 

ATGGTTAAAT ATGCTATTTT AATAAAATAT TGAAACTCAC CT^AAAAAAAA AAAAAAAAAA 2040 

A 2041 



25 



30 



(2) INFORMATION FOR SEQ ID NO: 132: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 2012 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTICN: SEQ ID NO: 132: 
TACCAAQCTG CAAGAATCTA CTATATCATG GCAGAAGAAG TAGAGTGGGA CTATTGCCCT 60 
45 GACCGGAGCT GGGAACGGGA ATQGCACAAC CAGTCTGAGA AGGACAGriTA TGGTTACATT 120 
TTCCTGAGCA ACAAGGATGG GCTCCTGGGT TCCAGATACA AGAAAGCTGT ATTCAGGGAA 180 
TACACTGATG GTACATTCAG GMTCCCTCGG CCAAGGACTG GACCAGAAGA ACACTTGGGA 240 

50 

ATCTTGiGGTC CACTTATCAA AGGTGAAGTT GGTGATATCC TGACTGTGGT ATTCAAGAAT 300 
AATGCCAGCC GCCCCTACTC TGTGCATGCT CATGGAGTGC TAGAATCTAC TACTGTCTGG 360 
55 CCACTGGCTG CTGAGCCTGG TGAGGTGC?rC ACTTATCAGT GGAACATCCC AGAGAGGTCT 420 
GGCCCTGGGC CAATGACTCT GCTTGTGTTT CCTGGATCTA IT*ATTCTGCA GTGGATCCCA 480 
TCAAGGACAT GTATAGTGGC CTGGTGGGGC CCTTGGCTAT CTGCCAAAAG GGCATCCTGG 540 



60 
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NAGCCCCATG GAGGACGGAN TGACATGGAT 
GATGAAAATA AGTCTTGGTA TTTGGAGGAA 
5 GQCACTTATTA ACCTACAGGA TGAAACTTTC 
GGGAAACTCT ATOXAACCT TAGGGGTCTT 
TACATGCTGG CCATGGGCCA AGATGTGGAT 

10 

TTCCTCTATC GGAATGGCGA GAACTACCGG 
nTGAGGTTG TGGAGATGGT GGCCAGCAAC 
15 ACTGACCATG TCCATGCTGG CATGGAGACC 
TTAAGCCCTC TCACCGTCAT CACCAAAGAG 
GAAGAAGGCA ATGTGAAGAT GCTGGGCATG 

20 

GCCTCTGTTT TQGTTGCCAT TAGTGTCACC 
GTGGTTTGGT ACXAACATCG ACAGAGAAAG 
25 GACAGCTTCA AGCTTCTGTC TTTCAAACAG 
AAGCACATCT GTAGTGCACT CCCAGCAGGC 
GGGGCATGGG TQGTGGAGAA GCAGAAGGAG 

30 

TATTTATTTT ACATGGAAAT AATATGATTT 
GGGCACCTGG CACTAAGGGA GTACCTTATT 
35 TATATTTCCT TCTGACACTT GGAAGGTATT 
AGTAGAGACC AAGAGAAAAA CTCATTGATT 
ATTTCACTTT GAACTGAGGC CAAGTGAGCT 

40 

TAAGAATATA OGCTTGATGG GAAATTGAAG 
TTTTGATTCT CCTTGGCAGT GAACTACTTT 
45 TGAGCATGTA CAACXrTCTGG AGCTAGAAGC 
AACCTGTGGC ACTGAAAGGA ATGTTGAGTT 
TCCATTAAAG TACTTGTTAG AACACTGAAA 

50 



CXX3GAATTTG CATTGTTGTT CTTGATTTTT 600 

AATGTGGCAA CCCATGGGTC CCAQGATCCA 660 

TTGGAGAGCA ATAAAATQCA TGCAATCAAT 720 

ACCATGTACC AAQGAGAACG AGTQGCCTGG 780 

CTACACACCA TCCACTTTCA TGCAGAGAGC 840 

GCAGATGTGG TCGATCTOTT CCCAGGGACT 900 

CCTGGGACAT GGCTGATGCA CTGCCATGTG 960 

CTCTTCACTG TTTTTTCTCG AACAGAACAC 1020 

ACTGAAAAAG CAGTGCCTCC CAGAGACATT 1080 

CAGATCC(XA TAAAGAATGT TGAGATGCTG 1140 

CTTCTGCTCG TTGTTCTQGC TCTTGGTGGA 1200 

CTACGACXKZA ATAGGAGGTC CATCCTGGAT 1260 

TAACATCTGG AGCCTGGAGA TATCCTCAGG 1320 

CATGGACTAG TCACTAACCC CACACTCAAA 1380 

CAATCAAGCT TATCTGGATA TTTCTTTCTT 1440 

CACTTTTTCT TTAGTTTCTT TGCTCTACGT 1500 

ATCCTACATC GCAAATTTCA ACAGCTACAT 1560 

GAAATTTCTA GAAATGTATC CTTCTCACAA 1620 

GGGTTTCTAC ITCTTTCAAG GACTCAGGAA 1680 

C?rTAAGATAA CCXy^CACTTA AACTAAAGGC 1740 

GTAQGCTGAG TATTGGGAAT CCAAATTGAA 1800 

GAAGAAGTGG TCAATCGGTT GTTGCTGCCA 1860 

TCCTCAGGAA AGCCAGITCT CCAAGTTCTT 1920 

ACCTCTTCAT GTTTTAGACA GCAAACCCTA 1980 

AA 2012 



55 



(2) INFORMATION FOR SEQ ID NO: 133: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1669 base, pairs 

(B) TYPE: nucleic acid 
<C) STRANDEI»JESS : doxible 

60 {D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTIC3N : SEQ ID NO: 133: 

GAGCAGTATT TTAACCAACT TGTATTACAG ATGTTACAGT TCATGTTAGG AAGTCAGAAA 60 

AGACTTTGTT TGTCTTTGTT CTGCTGATGT GAGTCATGTT TTGTGGGGTC 1TCCATGGCA 120 

CATTTACCTG TTGCTCCGTC CAGATGTTGA GGGCCAGTCT AGGCTGACAC ATCCTACCCG 180 

AGGACAAGCC TGTTCTCCAT TTCTTCACTC TCCCCTCCCC ATATAGCAAC TCTCCCAGGT 240 

TTAGATTACC GnTTCGACG ACAGATTAAC CAAAAATGCC CCACACAGGT TTTATTACTG 300 

TTATATACTA TACTTTTAAC AGTACAGACC CTAAATTTTA TTATTTGTTG CTCCCCCAAT 360 

CTGATACCAA ATGTTTAAAG TTGTTTGAAA TCCAAACATG GTAGTGTTCA TGGGTAAATA 420 

TTTTCTAGGC TATGTAAGAG TTAGCAGCCC ATAGCATAGA AGTAATCAAG TAGCATCTGA 480 

GACTGrrTGGA GGCACTAGGG CCTCTCTGGG CCTAACAGCC TCACTTCCCC AGCCTCACCT 540 

TGCTGTCCTC TGACACTGCC ATCAGGGCTG TTAGTGGCAC CTGTATGAGG CCAAGTGTGC 600 

GTCCAGGGGA ACAGCACAGG TTAATGCX5TC TCCCTAGAAC TCATGAAGTC AGTTTAATTC 660 

ATGCATGAAC ATGAGTTCAT TTTATGTTTT ATATAGCTTT CTTAGACATA CCAAACCATC 720 

ATTCATAAAT CAGATAAATT ATTCAGnTT TGTGTTTAGA AAGCTAAGTA TGTGTAGCTG 780 

GAAACAAAAA TGAGCGTGTT TTCTCTCCTG TTAATCTAGA GTGTGCAGTT ACACATGTGT 840 

GGATAATTTC ATGTTCCAGG GGOGCTTGGC ATCTCCCATG GACTGATTCC CAGGAAGAAA 900 

AGCCCAAAGG 6AAACCCACG ATTCCTTTCG AGTAGATGTG GGAAAGAGCC CATTGGAGGA 960 

TATGAGGTCC TGTGAAA1TC AGTTGTGTGT GTGGCTCCTT GTTAGCAGTC ATGTTGACAT 1020 

GGTGTTAGGA GGCTCCXXIAT CCACCCTTTA CATGATGTAG QGACXZAGTGT CTTGTGAGAT 1080 

TAACCTTGGG ACACACTTGCG TTAGCCTGGA GAAAATGAGA GGCCCTGCCT GGACCCAGGG 1140 

AGAGGAQCCA GTGACACAGG CAGAGCGGTG CAGCCCTCCT TCCCTTCCAT TTGGAGGAGG 1200 

TGGTGCCAGG AGCCTGCCCG CTTACCTCTG CTGAAGCATA AGTGGACTTT GCTTTTGGGG 1260 

CTTATCTCTG ATACATGCTG GAGCCCTGCC TCTCCACTGC TAGATGGAAC CTGGAATCTC 1320 

TCATCTACCT CTTAGTCTGT CAGTTTCTAC GTGTGAGAAG CAAGCTTGTG GGCCAGTGTC 1380 

CTTGTACATG CTGTAGCACT TAAAAAATAA TTCCAGGGTT CCCTGGAAAA CCAGTCCCAG 1440 

GGTTOCTATG ATCTGTAGTT TCTACCTGGA TTATAACTGG TTTTGGGTAC CTGAATTTTG 1500 

ATTGGTTAGC CTTAATTATA GTCTGGCGTG ATCATGTAGA ATCTTTTCTG GTGAACAGAT 1560 

CATAAAGTTC TATCAAGGAG TTCTATCAAG GCATCCATGT CAGTGGTGCT ATGCTGGTTA 1620 

CAACTTGAGA TTTTTGAAAT AAAAAATTTG TCATAAAAAA AAAAAAAAA 1669 
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(2) INFOFMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1565 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: double 

(D) TOPOI/DGY: linear 



60 
120 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 
CACTTTTGCT ATATAACCTA AGTGATAACC CTCTTTTAGT TACCTGCCAA ACTCTGGNCT 
15 TQGTTTATAT TCCAGTTAAC ACAGTTACAA AGCTGTAATG GTGTCTTTTT TTCCTTTGTA 

ACGGAATOK5 TAAATCAAAG TATATACATT GTGTGGTGTT CCTGTTTCTG GAGTTTCATG 180 

AGGATTTACA CATX3GCATTC AGTGTTCTGT ATAGATCTGC CTACCTTTGT GAATTCATCT 240 

GTTAACCCCT CTTCCTTTGA GAGAGCACCG GCGATQGTGG TTAACTCCTT GTGTTTTCTC 300 

TCTCTCCTAC TOGnTATTCT TGAATTAAGC ACAGACTCGT CAGCTCGGTT GCTTTATCAT 360 

25 GAATAATGTC TOTSACCITC CAOTTCTTCC ACAGTTCAGC AAACAAGTGC TAGCTTCACT 420 

GACCAAAAAT TAAGGAAGGA AAACACAGTT TTTAAAACGA TCCATCTTTT AACAGCOGAA 480 

ACCGATCTGT CTATCGTCCT GCACCTTGCT GTTGTACTTC TGAAATCAGA OGTGTGTGAA 540 

CGATCATTTC TGACTTAACC GTGAGATGCT CACGAGTACC CTTCCTGTTG TTTTGTTAGC 600 

ATTGAAATCG AGACTATTTA TTTGGAATAT ATACAACAGT GTTTTTCCAC TGTATTTCAT 660 

35 TTCCAAAAGT TGAGAACTGC TTTCTCTACC TTTTGCAAAA TAATTGATAT TCCATATTGG 720 
ATTCTCAAAG ACTTCGATAT GGTGAACCTA TTAAACCTAG AAATTGTATT CATCCTTTCA 



780 



TGACTOK3GC CTCAGTTCCC CAGCCCCTCT CCTCCTTTTT TTTAGATGAG ATTTAGCACA 840 



900 



CTCTCACyiTA TTTAAACATG CAACATTTCT TGAGTATGTA TGTTGAGGCC ATCTGAGCTC 

ATAGCTGATT CAGTAACCAG TTTCATGCTG TGTCATTCAC ACTCACTACT TAATACTGCC 960 

45 ATOGTOAAAA TGTGGAGGAA AAATGTATCC ATGTGTGTCT GGGAAGCATA TACACTTGTA 1020 

CATTTTTTAA TACTCTGATT CTGTAACATT TCTGAGTTTT GTTTTGTTTT ACAGNAAAAA 1080 

AAAAAAAAGT GATAAAGCAA TCAGAAGACC AAGAGGTTTA CTATTGATGC TTAGGGTCGT 1140 

CTGACCTTGG CTGGCCAATA GACCTACACG GCCAAATTAA TTTACGAGAG TAATAATTTT 1200 

TCAAAAGCCA ATTTTTTTTC TCTATTTTCT GTATGAAACT GCCAATATCA TGAATAGAAA 1260 

55 GGGAGAACCA TAAAGGAGAA AGAACGTGAT GTTCTGTTAT GTTCATGTAA ACCTAAAGAA 1320 
ACAGTCTCGA GGCAGGCGCG ATCAGCCGAA CTCTAGGGAC TTGGTGTTGC TTGGAAGGCA 



1380 



TCCATACCTC CArTTTGCAT TCTTCGTATG TAATCATATT GCCAAAGACA AACTATTTCA 1440 
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TCATTTATTG TAAATAACAC TTTTCCCCAG ACCTACCATA AAGTTTCTGT GATCTATrGT 



1500 



CTTCCAGTTG CAATAAAAAT TACTGAGTTG CATCAATTGA AGAAAAAAAA AAAAAAAAAA 



1560 
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(2) INFORMATION FOR SEQ ID NO: 135: 

(i) SEQUENCE CHARACTERISTICS: 

(A) 2007 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: double 

(D) TOPOIjOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135: 

TCTAAAAGCC CCCTTATACC CCACTTTGTG CAGCAAAGAT CCCCGTGCAG GTCACAGOCT 60 

GATTTGTGGC CAGGCTGGAC AAATTCCTGA GGCACAACTT GGCTTCAGTT CAGATTTCAA 120 

GCTGTGTTGG TGTTGGGACC AGCAGAAGGC AAACGTCCAG CCAACACACA GGACTGTAAG 180 

AGGACTCTGA GCTACGTGCC CTGTGAAGAC CCCCAGGCTT TGTCATAGGA GGTCGTTCAG 240 

CTTCCCCAAA GTCAGAGGTG ATTTGATTTG GGGAAGACTG AATATTCACA CCTAAGTCGT 300 

GAGCATATCC TGAGmTAC TTCCTTATGG-CTTGCCCTCC AAGTTCTCTC TCTCATACAC 360 

ACACACACCC TTGCTCCAGA ATCACCAGAC ACCTCCATGG CTCCAGCTAT GGGAACAGCT 420 

GCATTGGGGC TGCCTTTCTG TTTGGCTTAG GAACTTCTGT GCTTCTTGTG GCTCCACTCG 480 

CGAQGCAGCT CGGAGGTGTG GACTCCGATT GGGCTGCAGG CAGCTCTGGG ACGQCACAGG 540 

GCGQGCGCTC TGATCAGCTC GTGTAAAACA CACCGTCTTC TTGGCCTCCT GGCAGTTCTT 600 

TCTGCGAATA GTCCTCTCCC TGGCCAGTTG AATQGGGGAA GCTGCTGGCA CAGGAAGGAG 660 

AGGCGATCCC GGCTGAGGCT TAGGAAATTG CTQGAGCCGG CTCCAAQCAG ATAATTCACT 720 

GGGGAQGTTT TCAGAGTCAA ACATCATTCT GCCTGTKTTG GGGGCCAGGT GTGTCACACA 780 

AGCATCTCAA AGTCAAAAGC CATCTGQGGC TGCTGCTTCT CTTTCTCAGG CTCTGGGGAA 840 

AGGAATCTCC CTCTCCTCTC ACTTGATTCC AAGTGTGGTT GAATTGTCTG GAGCACTGGG 900 

ACTTTTTTTC TCTTTTCCTT GATGGACCAA CAGTGCAAAT GCAATCTCGC CATTTAACTT 960 

TCAGGTCGAT TTCCTTTCCT GATCAGACAT CTTTGTGCCC CCTTTAGGAA GGAAAAGAAT 1020 

ACACCTACGA TGTGCCAGGC ACTGTGTTAG GCGCTTTTAT ATAGATCCTC GTTAGGATGA 1080 

GACTAAGGGA TGAGGACATC TCTTTATAAA AGGCCCCTAA GTAATGGATA AACAGAAACA 1140 

CTTAGAGGTG AGAAGGTCTG TCTTCAAGAT CCAAQGTAAG ATTGCCTTCA GTCTGATGTT 1200 

TGTTCTCAAG GACITATCCC CTACAATATT CTCCCACTCC ATACTTCTCC TTCTACCCCA 1260 
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15 



20 



25 



CCATGTX5CTC CCGTGCACTC CTCAGATGGT CAGAGGGGTA ACCCAAGTCC TTAGAGAATT 1320 

TCGGGACCAA TAGAATATGT GATGTGTGAA TTTTCTTTAA AAAACTTAAG GAGTCTTTGC 1380 

TACCTTCTGC TTGTTGAGTT GTTTTGGCAT TCATATTAAA AGCCAGCATC TCACTATTTA 1440 

TTCACAGGTT GGGCTGTGTG TGTGCX3CATG TGTCTATRCA TTTCCAGGCG TGCTTGTGTC 1500 

CTCTAGCTTT TTAAAAGC3AA ACCCAGTCAT CTCACTATGA ATCTGGCATC TTCTTATGCT 1560 

TCTAGTGTTT TGGCCATACA TCAACCAftGG GGTTTAATTT ATCCAATGCT TGACGACATG 1620 

TTCAGGAGGG GCTGGATCAA ATTTTGAGAG GGTTATGGGA AAGGGAGGGG GAGAAGAAAT 1680 

TCACATTTAT TTTATTATTT ATTTTAAATG TTTACATCTT CTTTATGTTG TATCAAGCCT 1740 

GAATAGAAAC TGATAGCATT AAAATACTCC GTTCCTCTCT CTCTTCTCGC TTCCTTTTTT 1800 

TTTTTTTTTA AATTTAGGAT AACACATTTT TGTTTCTAAA GTGATTTGTG ATTTGTQCTG 1860 

TATAAACTGT ATAAAAGGTT CTGTTTTTAA AGGTGGATTT TCATTCCTCT GGGGACAGTC 1920 

GTCGCCAAGA CATCTACATT GTAAGAGAAC ACAGTGGAAG ATCCTGTCCT GATTCTCAAA 1980 

AATTATTTTC TCTGTATGAT TAAAAGT 2007 



30 



35 



40 



45 



50 



55 



(2) INFORMATION FOR SEQ ID NO: 136: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1291 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 

CTTTTAACCC TCCCCCTTCA CACACATACA TATCAGGTTG TTTTCTAGTT AAAAACCCAA 60 

GTAGCTCAGA TTCTACTTTA ATGTCAGTGC AGATTTGCAT TGAATCATGC CATTATGTTT 120 

TTTCTCATTT TTATGCTGTT GGGTCTTAGT TTTTAAATTG ATATAAAGAA CTCAGCAATG 180 

GTTTTATTTT CTACTCATAC TTAGGGTTTA GGAAACACTA CCACTAGTTA TCATTTAATC 240 

AACTTCAATG GTCTACTGAA ACAAAAATGG TAACTTTTCA TTAGTGGATT ATTTAGAGTT 300 

ATAGTAGTTG TTTCCAGAAA ACACTTCCTC ACAATTGTAC TTCCCAATCA AATCATGTGA 360 

TCATACAGTT ATTCCCATGA AAGGCAGAAT GTTTGTTTCA AAATTAATCT AGTTTTCTGT 420 

ACATTTAAAT TTCAGAAGGT GACAACTGGC TCTTTTCCAG TCTTCCTTCA TGTCAGTTTT 480 

CTGATAGACC ACTATTGGCA AACAGTATCT GTCAACTACC AAATGTGTAA AATTTTCTGT 540 

ATTTCACTTT GTCTTATTTG TAAATAGTGA ACTAAAACTT TTGGCAGATC AGCAACATTT 600 
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GCTGAGCCTG TTTTTTAAGC TAATGTGTAT TCTTACTAAT GTTCCTATCA AGAATGGATT 660 

TGTAATATAT GCTGTCTATT TCTAATGTTC ACATTCATAT TTTGAGGTTC TATCTTATTT 720 

TAATAGAGAA CAGACTTCTC AAAAAATCTT CAGAAGCAGC TTATTATTGA AATATCX5AAA 780 

TATTGAAATA AACCXX3GTGG GTTAGATTAC TCATCTGTCC ACCAAGTGGG ACATTTGCAT 840 

GGACTGGOGG CTTAAAOGAC TTAGAAGAGA CCTGTAAGTA AATCCTGAAA ATGAGCCAAT 900 

CCXrCACrTGA ATGGTTACTG GAGTAAACCC ACCTTTACCA CCCCAATTAC AGCACCCGAG 960 

GCXTGATAAAC CAACTTGGCT CTGGTTCATT TTTCTTTTCT TCATTTCTGA TGCTCAGATT 1020 

15 CAAAATGTGT GTTCTACACT GTTACAGGCT TCTCmTGT TTGATTAAAG ATTTTAGTCC 1080 

TACTTTTGTA TGGACACATT AGAATATTCA GAGACCAAAA TAGAAGAATT TGCTGTTAGA 1140 

TATTTTTCAG AAGTCAGCAG ATTTGTGGCA AATCATTTAT TTGCCTTTTT AAAAATTCAT 1200 

TTAAGCAGTT CAGAGAGTAG ACTACTCAGA AAATTATTTC ACGTAATTGT CTAAGAGGTC 1260 

AATAirnTT AATGCATATT GAATCAAATA A 1291 



20 



25 



(2) INFORMATION FOR SEQ ID NO; 137: 

3p (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1906 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

35 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 
GGCACGAGGA CCTACTTTTG TAACAGACCA TGGTTGTGTC CAAGGTAAAA CCACAGTGAT 60 
40 ATTTTTGGAT GCTTTGTCTG CAATCTTGAC TTGTTTTTGC AGTATCATTA TTCAGACTTC 120 
AAATTGTGAA TCTTTTAAAC ATCTTGATAA TTTGTTGTTG AGAGCTGTTC ATTCTAAAAT 180 
GTAATGAAAT TCAGTCTAGT TCTGCTGATA AAGATCATCA GTTTTGAAAG GTTACTGATT 240 

45 

TTCCTCTTCC CTCTTAGTTT TTTACCCAAT ATATQGAGAA GAGTAATGGT CAATCTTAAC 300 
ATTTTGTTTT AATTGTTTAA TAAAGCTGCT GGGCAGTGGT GCAGCATTCC TACCTAGTGT 360 
50 CATAAAAGCA AAATACTTAC ATAGCTTTCT TAAAATATAG GAATGACATT ACATTTTTAG 420 
GAGAAAGTAA GTTGCTTTGC ACCGCCTACT TAATTCCTTT CCATATATTG TGATACAAAC 480 
TTTTGAATAT GGAATCTTAC TATTTGAATA GAAATGTGTA TGTATAATAT ACATACATAC 540 

55 

ATAAGCATAT ATGTGTGTGT GTGTGTGTAT ATATATATAT ATGCATGCTG TGAAACTTGA 600 
CTACACAACA TAAATCACTT TTTAAATTCC AGGAACGGGT AGTCTGACAC GGTGATTATC 660 
60 CTTTTGAGGC TGAATCCGTT ATTAACTTGT TATTTAGGTT TTACTCCCAG TAGCAAGGGA 720 
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TTCrAAGTTA GTTGCACTTA CATGATTATT GTGATTTAAA ACTAAGAATA AAGGCTGCAT 7 BO 

TTTCAAAGAT AAATTGGAAT TGCTGTTGGT GAAATAACAA CCAAAATACT GAATCTGATG 840 

5 

TACATACAGG TTTCTACAGG AAGAGATGGT ATAATTTACA ATTTGGAGAT TTAATAACCA 900 

GGGCTACCCA GAAAAAGTGA CTTGATAACA TGGTACCAAT AAGTAAGGGA TGCTCTCTCG 960 

10 GTTTGCTTTT GCCACTTTCA AGATTTTAAC TTCTCAQGTT ATTAATCAAA ATTATTGTAT 1020 

AAGTTAGCCA ATAGAATTTT TAC3GTTAAAA CAACAGATGG QGGGTTTGTG GAGTGTTTAA 1080 

TOPCATGGGC ATTTTTAGTA GCATAGACCC TTTGTTCTGC ATTTGAATCT TTCGTATATT 1140 

15 

TTTCTTTCAC AGTTAA1CTT CCXTPCCCCAA GTTTGCTATT CAAATCAACT GCCTGAATGA 1200 

CATTTCTAGT AGTCTGATGT ATTTTTCTGA GGAATAGTTT GTGATTCCAA TGCAGGTGTC 1260 

20 TTCATTACCA TTACCTCTAC ACTGCAGAAG AAGCAAAACT CXnTTATTAG AATTACTGCA 1320 

CATGTGTATG GGGAAAATAG TTCTGAAAGG CTAGAATGAT ACAAGTGAGC AAAAGTTGGT 1380 

CAGCTTCGCT ATGGAGTGGT GGCAATAATC TCTAAACATT CCAAAAGACC ATGAGCTGAA 1440 

25 

CCTAAACTCC CTTGGGAATC TGGAACAAAG GAATATGAAA ATTGCCATTT GAAAACTGAC 1500 

CAGCTAATCT GGACCTCAGA GATAGATCAG CCAGTGGCCC AAAGCCATTT CAAGTACAGA 1560 

30 AATTATAGAG ACTACAGCTA AATAAATTTG AACATTAAAT ATAATTTTAC CACTTTTTGT 1620 

CTTTATAAGC ATATTTGTAA ACTCAGAACT 6AGCAGAAGT GACTTTACTT TCTCAAGTTT 1680 

GATACTGAGT TCACTGTTCC CTTATCCTTC ACCCTTCCCC TTCCCTTTCC TAAGQCAATA 1740 

35 

GTGCACAACT TAGOTTATTT TTGCTTCCGA ATTTGAATGA AAAACTTAAT GCCATGGATT 1800 

rn Ti X.''l'm' GCAAGACACC TGTTTATCAT CTTGTTTAAA TGTAAATGTC CCCTTATGCT i860 

40 TTTGAAATAA ATTTCCTTTT GTAAAAAAAA AAAAAAAAAA AAAAAA 1906 



45 (2) INFX)PMATION FOR SEQ ID NO: 138: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1935 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS: doxable 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 

55 TCIGAACTAA TGCTAACAGA TCCCCCTGAG GGATTCTTGA TGGGCTGAGC AGCTGGCTGG 60 

AGCTAGTACT GACTGACATT CATTGTGATG AGGGCAGCTT TCTGGTACAG GATTCTAAGC 120 

TCTATXSTTTT ATATACATTT TCATCTGTAC TTGCACCTCA CTTTACACAA GAGGAAACTA 180 
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TGCAAAGTTA GCTGGATCGC TCAAGGTCAC 
CTCAGCTCCT CAGGTCAGCA AGTCTACTTC 
5 GTGCCTAGCT TTGGAAAGTC TAGAATGGGT 
GTTTCTGCCT CTTTTTGGAA AAGAAAACAA 
CAATAGCATG AGGTGAACAG GACGTAGTTN 

10 

AAACACTATC TTCCXyVTCTG TTTCTCAATG 
GGAGAGCAGC AGTTAAACCC GTGGATTTTG 
15 CACTAATTGG CTATGTCTCT GGACAAGTTT 
AACTITCACT TTCTATGTCT ACCTCAAAGA 
TAAAGGGTCT GCCAGATAGG AAGATGCTAG 

20 

AGAGTCTAAA ACCTACAGTG AATCACAATG 
AAAACTAGCC AGAAGTCTCT TTTTCAAATT 
25 TAATQGATAA TCTTATTTAT CTAAACTAAA 
TGGGATAAGA TAAATGAtXA CAGTACCTTA 
TGTAGGTAAG GACATTTTCT YTTTTTCAGC 

30 

GAGQGGGGTA GCATGCACCC AGCAGGGGAC 
GTGGGGTAGT GAGCTGCCTT TCTGTGATCG 
35 TTGCTATTCG CAGCTACATA CAACX3TGQCC 
AGTGCTGGTG CTGACTCTCA ATAGCCCCAC 
AATTAGGCAA CCTAAAATAT TGATGCTGGT 

40 

TGAAACTTAG AGTTATAATT CATGTATrAG 
ATATGTATAT ATGAAAGGGA GGTTATTAGG 
45 GTCGCACAAT AGGCCGTCTG CAAGCTGGGT 
GTTCAAAAAC CTCAAAACTG GGGAAGCTGA 
AGGCCAAGAG COCCTGGCAA CCAACCCACT 

50 

AACCTGGAGT CTGATGTCCA AGAGCAGGAA 
CAAGGTAGAC AGTGTCTACC ACCAYAGTGG 
55 GCTACCTGGA TCCCTGAAGT TGCCCTGGTC 
CTTTCCATTA CATGAGCTGT CTCAAAGCCC 
AAAAAAAAAA AAAAA 

60 



TTAGGTAAGT TGGCAAGTCC ATGCTTCCCA 240 

TCTGCCTATT TTGTATACTC TCTTTAATAT 300 

CCCTGQTGCY TTTTTACTTT GAAGAAATCA 360 

AGTGCAATTG TTTTTTACTG GAAAGTTACC 420 

AGGCCTTCCT GTAAACAGAA AATCATATCA 480 

CCTGCTACTT CTTGTAGATA TTTCATTTCA 540 

TAGTTAQGAA CCTGGGKTCA AACCCTCTTC 600 

TTTTTTTTTT TTTTTTTTAA ACCCTTTCTG 660 

ATTGTTGTGA GGCTTGAGAT AATGCATTTG 720 

TTATGGATTT ACAAGGTTGT TAAGGCTGTA 780 

CATTTACCCC CACTGACTTG GACATAAGTG 840 

ACTTACAGGT TATTCAATAT AAAATTTTTG 900 

GCTTCCTGTT TATACACACT CCTGTTATTC 960 

ATTTCTAGGT GQGTGCXnXST GATGGTTCAT 1020 

AGCTGTGTAG GTCCAGAGCC TCTGGGAGAG 1080 

TGAACTGGGA AACTCAAGGT TCTTTTTACT 1140 

GTTTCCCTAG GGATGTTGCT GTTCCTCTCC 1200 

AACCCCAGTA GGCTGATCCT ATATATGATC 1260 

CXZAAGCTGGC TATAGGTTTA CAGATACATT 1320 

GTTGGTGTGA CATAATGCTA TGGCCAGAAC 1380 

GGTTCrrcCAG AGGGACAGAA ITAGTAGGAT 1440 

GAGAACTGGC TCCCACAGTT AGAAGGCGAA 1500 

TAGAGAGAAG CCAGTAGTQG CTCAGCCTGA 1560 

CAGTGCAQCC AGCCTTCAGT CTGTGGCCAA 1620 

GGTGCAAGTC CTAGATTCCA AAGGCTGAAG 1680 

GAGTGGAAGA AAGCXTAGAAG ACTCAGCAAA 1740 

CCATACCAAA GAGGCTACCX3 ATTCCTTCCT 1800 

TCTGCACCTT CTAAACCTAG TTCTTAAGAG 1860 

TCCAATWAAT TCTCAGTGTA AGYTTCAAAA 1920 

1935 
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(2) INFORMATION FOR SEQ ID NO: 139: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1446 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
10 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTICM^: SEQ ID NO: 139: 

NGCCCCCTTG GCACAAGTCA GATGAAGCAC GTTCTGCCGG GGAGQCCCTC AMCTTCCAGA 60 

15 

GAGGACAGAC ACAGATTTCC TGCTGGGGGA GGGAGGAGTC CACGCATCCT GATGCTGCCT 120 

GGAAGCTTAT TTTCCCGTGG CCAGGATGCA TTTCTCTGAG TGGAAACAGG TTCTTGCATG 180 

20 TQGATGTGTG TTTCCCCAGG CAGACGGCCC CTCTYTTCCC AGCACTTCCC TGCCTCCCCC 240 

AGGCCTCAGG CCAGCACCCA GTTCCTCCTC ACATGGCAGG TGAGCACAGA CTTCTAGTTG 300 

GCAGGAGCTG AGGAGGGTGA ACAAACCCCG AGGGAGGCCC GGCCCTTGCT CCCGAGTTGG 360 

25 

GGGGAGGGGG TGTGGCAACG TGCCCCCCGC AGAGGCCACG CATGTTTGAC CAAAGCCCTC 420 

ATTGTGGTCC GAGGACAGCC TTTTCCCCAG GCCTCARAGC ATTGCTCATC CGTGCCAAAC 480 

30 TGGGTAGGTG GATTTGAGCG GAAAGACTCC CAAAATGTGC CAAGAATTTC CCRGTCCCAG 540 

GCAGGGCAGG GGAAACTAAG GGCAAGCAGG ATACAGGGCG AGGGATGTGG CAGGTGAGGG 600 

GGCTCCCGCC TGTGCCCCTT CTCCTCACCA TGTCTCCCCC ACCCTGCCTC AGTTCTCCGT 650 

35 

TCCCCTTCAT CTCCGTCCCC CTCTTTGAAG CTGTCCCCAT CTCAGTGTCA GACCAGCCTT 720 

CTCCTCAKCT GACCACCCTC CTCTGACCSA CGCCCCCTCC TTGTCTGAAA AAAQGAGCCT 780 

40 TGAATGGTGG AGGGAGGCAG TGGGGAGAAA GGTCTCACCG GACAGGTTGG GAGAATGAGG 840 

TCAGCGGTGC TGGGGAACAG ATGGAGGGGG CAGTGGGGAC AGGGCTTGGG CAGACACCAG 900 

CAGGAATAAT TTGAAATGTG TGAGGTGACT CCCCGGAGGC CTTGGGCTTG GGCATTTGGG 960 

45 

AAAAGAATGA TGTCTGGAAG GGCTTAAGGG ACACAGTGGA CGAGGGGAGA GTCCTCATCT 1020 

GCTGGCATTT TGTGGGGTGT TAGTGCCAAA CTTGAATAGG QGCTGGGGTG CTGTCTTCCA 1080 

50 CTQACACCCA AATCCAGAAT CCCTGGTCTT GAGTCCCCAG AACTTTGCCT CTTGACTGTC 1140 

CCTTCTCTTC CTACCTCCAT CCATGGAAAA TTAGTTATTT TCTGATCCTT TCCCCTGCCT 1200 

GGTCTAGCTC CTCTCCAAAC AGCCATGCCC TCCAAATGCT AGAQACCTGG GCCCTGAACC 1260 

55 

CTGTAGACAG ATGCCCTCAG AATTQGGGCA TGGGAGGQGG GSTGGGGGAC CCCATGATTC 1320 

AGCCACGGAC TCCAATGCCC AQCTCCTCTC CCCAAAACTVA TCCCGACAAT CCCTTATCCC 1380 

60 TACCCCAACC CTTTGCGGCT CTGTACACAT TTTTAAACCT GGCAAAAGAT GAAGAGAATA 1440 
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TTGTAA 1446 



(2) INFORMATION FOR SEQ ID NO: 140: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 1109 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEtJNESS: doiible 

(D) TOPOLOGY: linear 

15 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 

TTTTTTTTTT TTTGATATGA AATTGTCTTT CTCCATTGCA GAAATAAGCT AGGGAAACAC 60 

TAACCCAAAA ACTTTCTGTA GAGCTGTTCC TTTGGAC3GCA GCATCACTTA TTGGCAGTAA 120 

20 

AGACTCAGTA TAAAAGCACC AGCATCCCTA CTTGGGTGAT GGGGATTAAT TTTATAGCAT 180 

TCCATTTTCC TAGTGCCACA TGTGAAATTG GATTTTGATG ATCTTAATCT ATATTCTACC 240 

25 CTTATAATAA AAGATCAAAA GATATATCTC CTATC3AACAG ATTGGAGATA GGAGATGAAA 300 

AGTTGGGAGG ATGTCTTTAT TCTAATGTGA GGGTAGGGAA AATGTQGATA ACATTACTGG 360 

GGTGARGGAG GCATTGTTCT TTAGTTGGAG TTCTCATTTT TATTCTCCAG TACTGACTTG 420 

30 

TGGGGAAAGC ATACTTTTTC ACTGCCAGGT ACTGAATGCA GAGGCTCAGT GAAGTATATA 480 

TGTGGGAAGT GCATGCATTT CGTTTATTAG CAAACATAGC TGGATTAAGA CAAAGTTGTT 540 

35 GGTTTGGAAA GGGGTTAAAG CCTTAAGTGA ACAAATCTAG CTAACAGTGA ATGAACTAGG 600 

TAATATAACT TGCATATTTT TAATTTCCTT TGGTTAAAGG TCCCCCATAC TTCTCTGTTC 660 

GGAGACATGA GAAGTATGAT TACTTCAGTG TTACHTTTCT TAATTTTTTT TTTCCCCTAT 720 

40 

TTGTCCCTTG TCACTTTGTT GCAAGCTAGA AATCTGTGGG TTATACATAG GGCAGCTCTT 780 

TGTGAAAGTG GTTTATTCCA CTGGAGAAAG GGGATTGAAA ATCAGTTAGA ACCAATGTAT 840 

45 TTCTTGCCCC ACGGAACACT ATTCCTATAA GATAGCTGAA AGAAGCTGCT GTGAGGAGCT 900 

CAGCTCCAAA CACAGGATCA GCACCTTGTA TAGGAATTCC CATGAATTAT GACTTCTCAT 960 

TCTGTTTTAT CAGAGTGCAT ATATGTCCTA CTTCAGGAAA AGTAAAACAG TCATTTACGA 1020 

AAGAAAGTCA ATCTGTATCC TAAQCATTTT AATAAAAAGT TAAAACAAAA AATTAAAAGG 1080 

GACACTCGAG GGGGGGCCCG AAACCCAAT 1109 



50 



55 



60 



(2) INFORMATION FOR SEQ ID NO: 141: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 497 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) T0P0LCX3Y: linear 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 
TAGGACTAAC TTAAATTCTT TTATTCATCT TTTATTTATT AAAAAATTTT ATTTCTTTGA 60 
10 ATTTTCCTGT AATTTCCTTA RGCTCTTCTA TAAAATGTTA TATTCATGTG AACCATACCT 120 
CATTATCCTT AACATTTACT CTCAAAAAGC 'TrTTTATTTT TATTTTTTTG AAGGTAGTTT 180 
TTCTGTC3TGT ACTCTGTAAC ATGATTTTGC TTTCAAATCA TTGTTGTGCC CCCATACAAA 240 

15 

ATGCCTTTTA TTTTTGAGGA TCGTC3GACTT TTTAGTATQG CATGAGTGTG CTAAAAGCCA 300 
GATATCTTTC CACATTCACT QGTGGCnTG ACACCTAGTT TTTAATCTCC CATCCTTACT 360 
20 TTAAACCCTG ACAGTGCAGT CCTCAGTCAG GGCCAGGACC GGGCTGAGGC CCTTTGTGGA 420 
GATGCTGCAC CACCAGCAGA AGGCTGAGAC CTGGTTACCT GTACCTGTTC ACTTGTAATA 480 
AAAAGAATTA TCTAAAA 497 

25 



30 



(2) INFORMATION FOR SEQ ID NO: 142: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEraJESS : double 
35 (D) TOPOLOGY: linear 



40 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 

ATGAGGCAGA GGCAAGCTGC CTGCCAACCC CCTCCCTCAA GGAATGGCCT TGCCCAGGAA 60 

TGCCCACCAC ACATACCCTC TTdTITTTT CTAGTCAAAC TCTTGTTTAT TCCTTGQCTT 120 

GCCTCCCTCC TTTCCTCCCC TCTCAACCTT TTACTTCTGG TTTCTATTTC ATGGGATTTG 180 

45 GGGTTGAAGT TAAACTTACA ACAGTGCCGC CAACACCAAG TCTTGCAGGA AAAAAATACA 240 

AAGAAATTTA ACAAAAAAAA AAAAAAAAA 269 



50 



{2) INFORMATION FOR SEQ ID NO: 143: 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 1269 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 
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TTGATTGACT ATGGTCTCTC CGGCTACCAG GAAGAGTCTG (XGAAGTGAA GGCCATGGAC 60 

TTCATCACCT ccacagcx:at CCTGCCCCTG CTGTTCGGCT GCCTGGGCGT CTTCGGCCTC 120 

5 

nCCGGCTGC TGCAGTGGGT GCX3CGGGAAG QCCTACCTGC GGAATGCTGT GGTQGTGATC 180 

ACAGGCX3CCA CCTCAGGGCT GQGCAAAGAA TGTGCAAAAG TCTTCTATGC TGCGGGTGCT 240 

10 AAACTGGTGC TCTGTGGCCG GAATGGTGGG GCCCTAGAAG AGCTCATCAG AGAACTCACC 300 

GCTTCTCATG CCACCAAGGT GCAGACACAC AAGCCTTACT TGGTGACCTT CXJACCTCACA 360 

GACTCTGGGG CCATAGTTGC AGCAGCAGCT GAGATCCTGC AGTGCTTTGG CTATGTCGAC 420 

15 

ATACTTGTCA ACAATGCTGG GATCAGCTAC CGTGGTACCA TCATGGACAC CACAGTQGAT 480 

GTGGACAAGA GGGTCATGGA GACAAACTAC TTTGGCCCAG TTGCTCTAAC GAAAGCACTC 540 

20 CTGCCCTCCA TGAICAAGAG GAGGCAAGGC CACATTGTCG CCATCAGCAG CATCCAGGGC 600 

AAGATGAGCA TTCCTTTTCG ATCAGCATAT GCAGCCTCCA AGCACGCAAC CCAGGCTTTC 660 

TTTGACTGTC TGCGTGCCGA GATGGAACAG TATGAAATTG AQGTGACCGT CATCAGCCCC 720 

25 

QGCTACATCC ACACCAACCT CTCTGTAAAT GCCATCACCG CGGATGGATC TAGGTATQGA 780 

GTTATGGACA CCACCACAGC CCAGQGCCGA AGCCCTGrPGG AGGTrOGCCCA QGATGrTTCTT 840 

30 GCTGCTGTGG GGAAGAAGAA GAAAGA1CTG ATCCTGGCTG ACTTACTGCC TTCCTTGGCT 900 

GTTTATCTTC GAACTCTGGC TCCTGGGCTC TTCTTCAGCC TCATGCCTCC AGGGCCAGAA 960 

AAGAGCGGAA ATCCAAGAAC TCCTAGTACT CTGACCAGCC AGGGCCAGGG CAGAGAAGCA 1020 

35 

GCACTCTTAG GCTTGCTTAC TCTACAAGGG ACAGTTGCAT TTGTTGAGAC TTTAATGGAG 1080 

ATTTGTCTCA CAAGTGGGAA AGACTGAAGA AACACATCTC GTGCAGATCT GCTGQCAGAG 1140 

40 GACAATCAAA AACGACAACA AGCTTCTTCC CAGGGTGAGG GGAAACACTT AAGGAATAAA 1200 

TATGGAGCTG GGGTTTAACA CTAAAAACTA GAAATAAACA TCTCAAACAG TAAAAAAAAA 1260 

AAAAAAAAC 1269 

45 



50 



(2) INFORMATION FOR SEQ ID NO: 144: 



(i) SE3QUENCE CHARACTERISTICS: 

(A) LENGTH: 1944 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
55 (D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTIC»I : SEQ ID NO: 144: 



60 



AAAAGGCAAA CTATAGGATA ACACAGAGCC CTTTTTGAAA ATAAATTGGC ATTGGAGTGT 



60 
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TrrACXTTCT AGCTGrrrTA CTTAGAATGT AACATATGCT GCCTACCCAC CTCAAAATCT 
CICTACTCCA AGAGGGCCCT QGCKCTCTGC TTTCCATATT CACGTTTC3GC CACyCTTGTA 
GTCCCAAAGA AGAGCATOGG TQGCAGATGG TAGGGAATTG AACTGGCCTG TGCAATGGGC 
ATCGAGCACA AGGGGTCACA GCATGCCTCC TGCCTTACCG TGGCAGTACX3 GAGACAGTCC 
AGAACAIXSGT CTTCTTGCCA CGGGGTGTTG TOGTCTCTGG TGGTCCTGCA TGTCTGTGGC 
TCACCTTTAT TCTK3AAACT GAGGTTTACC TGGATCTGGC TACT6AGGCT AGAGCCCACA 
GCAGAATCGG GITCGGCCTG TCGCCCCCAA ACTAGGGGGT GTGGGTTCAT CACAGTGTTG 
CCTTTTOTCT CCTAAAGATA GGGATCTACT TTTGAAGGGA ATTGTTCCTC CCAAATAAAT 
TTCCTTTACC TTOGrCCTTT CTTTTGTGCC AGTATTCAAG TGGTATAGCT CTGAGCAGGG 
TCACATPrGG CCAAACCIGA CACTCTCTTG CTGCATTCTC CTTrOGCAAA CATCAGGGTC 
AGAATTCAGG ATAGCCCTTC CTAQGGCACT GGACTTTCTG GCATGGGGGC TGTGTTTGCA 
GAAGTTATrr TCATCTTACC TOGAGAGrTGT CTAGAGGCTG CTCTGAGGCT GAGGTGTGTT 
CCCCCTTCCC TGC?rrCCAGC TOTCAGAGGG ATACCATCCT AGGGTCTGGG AATCCAAGGC 
CACGAGACTC CrrGGTrTGT GC3TCCGAGAT CCTGTACTAA GGAGGGTCTG GCCAGAGGAA 
CAGACCAGCT TTTOCACAAT GAAGCQCAAG GGAACAAGTG GTTTGCCTGG TGTCCTACCT 
GTCCTCAACC TGGTCCTOTG GGCCATTQAA AAGTTAGATC TOTGATCTCT GGGGTTlTrG 
TGGCTTTGTT CAA-rcCTTCC ACTCTAGGGC AGGCAGAGCA GTCTATACTC TCCCAAGCCT 
GCTTGACCTC CAAGTAGAGC TGATACAGAG ATCTGTGAAT ATTGTGATAG AAATTCTTTG 
GTATTCATAC ATTTCAGCTG CAAGTCAGCA ATTTCCCAGG TACCATGTAA GCTATAAAAC 
AGTCATTCrr AAAGACAGAG GATAGCTGTG ACTCATGGGA TCATGAGGTC CATGGCTGGT 
TCCAGCTTCC CTTTTTCCTT CCTCAGGTTT TGTCTCTTCC TGTGTTGTCC CCAGCAAGGG 
AGAGACTGTG GGGTGGAITC GGAGAACAGA TTAGGAGTAT AGCAAATGAA CCCAGAATGG 
AACA(?K3GGG AGCTAACTGT GAATGAGGAG AGTACCTGCT GCAGGACCTG GRGGTCAGGT 
C?IX3AATGCTG TAITCGCACA GGGAATAAAT ATCCTGGCGT CTGGAGCCTT CACCTCTCCX3 
TCAAC?rCCTT CCTOTGATAC TQCCATGGCA CAGGATCTGA GTTGCAGCTC TGCACCCTAA 
ATCACACCCT GGGCATTGTC TGGGCTQCAG GGCTGCCAGG rrCTGTACTT GTGTCCAGCT 
GTGGCCCTGG ATGCTGGAGC TGGAGGGTTT TCTGTGCTCA GACTGTAGCC TGTAGCTCTT 
GGCCTGTCTA GAGCCCCCTC CTCTGCCCTC AGTGGCTGTC GTTTGTTAAC ATCATCAGGA 
AGATGGGAAA GGTCAGGCAG AATTTTTCTG CCCTACAAAG GGTGGAAGAG AAAGGACACA 
GTATTTTCAT GAATTTACCA TATATCTTTG TTTTTCTTCA ACGAAAAAGT TAATTGAGGC 
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AATGTCATCT GCTCAAAGTT GAGTGGTTTA TTCACAATAA ACTGTAAGTT TCTGATTATA 1920 
AAAAAAAAAA AAAAAAAAAA AAAG 1944 

5 

(2) INFORMATION FOR SEQ ID NO: 145: 

10 {i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1021 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: 
TCGACCCACG CGTCCGGQGT GCGCAACGGG GAGTTCCGGC TQGAGACCCG TGCTCTQGGC 60 
20 CGGCGCCTTC ACCATGGCCT CGGCAGAGCT GGACTACACC ATCGAGATCC CQGATCAGCC 120 
CTGCTQGAGC CAGAAGAACA GCCCCAGCCC AGGTGGGAAG GAGGCAGAAA CTCGGCAGCC 180 
TGTGGTGATT CTYTTQGGCT GGGGTGGCTG CAAQGACAAG AACCTTGCCA AGTACAGTGC 240 

25 

CATCTACCAC AAAAGQGGCT GCATCGTAAT CCGATACACA GCCCCGTGGC ACATGGTCTT 300 
CTTCTCCGAG TCACTGGGTA TCCCTTCACT TCGTGTTTTG GCCCAGAAGC TGCTCGAGCT 360 
30 GCTdTTGAT TATGAGATTG AGAAGGAGCC CCTGCTCTTC CATGTCTTCA GCAACGGTGG 420 

CGTCATGCTG TACCGCTACG TGCTGGAGCT CCTGCAGACC CGTCGCTTCT GCCGCCTGCG 480 
TGTGGTGGGC ACCATCTTTG ACAGCGCTCC TGGTGACAGC AACCTGGTAG GGGCTCTGCG 540 

35 

GGCCCTGGCA GCCATCCTGG AGCGCCGGGC CGCCATGCTG CGCCTGTTGC TGCTGGTGGC 600 
CTTTGCCCTG GTGGTCGTCC TGTTCCACGT CCTGCTTGCT CCCATCACAG CCNTCTTCCA 660 
40 CACCCACTTC TATGACAGGC TACAGGACGC GQGCTCTCGC TGGCCCGAQC TCTACCTCTA 720 
CTCGAGGGCT GACGAAGTAG TCCTGGCCAG AGACATAGAA CGCATQGTGG AGGCACGCCT 780 
GGCACGCCGG GTCCTGGCGC GTTCTGTGGA TTTCGTGTCA TCTGCACACG TCAGCCACCT 840 

45 

CCGTGACTAC CCTACTTACT ACACAAGCCT CTGTGTCGAC TTCATGCGCA ACTGCGTCCG 900 
CTGCTGAGGC CATTGCTCCA TCTCACCTCT GCTCCAGAAA TAAATGCCTG ACACCTCCCC 960 
50 ACAAAAAAAA AAAAAAAAAA ACTCGAGGGG GGGCCCQGTA CCCAATTCGC CCTATAAAGG 1020 
T 1021 



55 



(2) INFORJIATION FOR SEQ ID NO: 146: 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 1285 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146: 

GGCACGAGGA GGGCCACGGC AGCCATCGCG CTTTGCAGTT CGGTCTCCTG GTGTACGGCC 60 

AACGCCAAGT AGGGGATTGC GTTCCCTCCA GTCGCAGACC CTATCAGATT TGGATATGTC 120 

CTTCATATTT GATK3GATTT ACAGTGGTTT CAGCAGTGTG CTACAGTTTT TAGGATTATA 180 

TAAGAAAACT GGTAAACTGG TATTTCTTGG ATTQGATAAT GCAGGAAAAA CAACATTGCT 240 

15 ACACATGCTA AAAGATGACA GACTTGGACA ACATGTCCCA ACATTACATC CCACTTCCGA 300 

AGAACTCACC ATTGCTGGCA TGACGTTTAC AACTTTTGAT CTGGGTGGAC ATGTTCAAQC ^ 360 

TCGAAGAGTG TCGAAAAACT ACCTTCCTGC TATCAATGGC ATTGTATTTC TGGTGGATTG . 420 

TGCAGACCAC GAAAGGCTGT TAGAGTCAAA AGAAGAACTT GATTCACTAA TGACAGATGA 480 

AACCATTOCT AATOTGCCTA TACTGATTCT TGGGAATAAG ATCGACAGAC CTGAAGCCAT 540 

^5 CAGTGAAGAG AGGTTGCGAG AGATGTTTQG TTTATATGGT CAGACAACAG GAAAGGGGAG 600 

TATATCTCTG AAAGAACTGA ATGCCCGACC CTTAGAAGTT TTCATGTGTA GTGTGCTCAA 660 

AAGACAAGGT TACGGAGAAG GCTTCCGCTG GATGGCACAG TACATTGATT AACACAAACT 720 

CACATTGGTT CCAGGTCTCA ACGTTCAGGC TTACTCAGAG ATTTGATTGC TCAACATGCA 780 

TAACTTGAAT TCAATAGACT TTTGCTGGTT ATAAAACAGA TGTTTTTTAG ATTATTAATA 840 

35 TTAAATCAAC TTAATTTCAA TGAGAATTGA AAACTGATTC AAGTAAGTTT GAGTATCACA 900 

ATCTTAGCTT TCTAATTCCA TAAAAGTACT TGGTTTTTAC AGTTTATAAT CTGACATCAC 960 

CCCAGCGCCA TTTGTAAAGA GCAACTTTCC AGCAGTACAT TTGAAGCACT TTTTAACAAC 1020 

ATCAAACTAT AAACCATATT TAAAAGCTCA TCATGTTAAA TTTTTTATGT ACTTTTCTGG 1080 

AACTAGTTTT TAAATTTTAG ATTATATGTC CACCTATCKT AAGTGTACAG TTAATAATTA 1140 

45 GCTTATTCAA TGATTGCATG ATGCCTTACA GTTTTCAATA ACTTTTTTTC TTATGCAAAC 1200 

GTCATCCAAT AAAACAAACT CTAATGTTTG GCAAAAAAAA AAAAAAAAAA NTCGAGGQQG 1260 

GGCCCGTACC CAATTCGCCC TAAAG ^^^^ 

50 



30 



40 



(2) INFORMATION FOR SEQ ID NO: 147: 

55 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1386 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
60 (D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 

GGCACGAGGT GGCGCAGGGG TCAGTGGTTC TCTCGGGTCT CGGGACAGGT GAGCACCCTG 60 

5 

ATGAAGGCCA CGGTCCTGAT GCGGCACCTG GGCGGGTGCA GGAGAITCGTG GGCGCCCTCC 120 

GCAAGGGCGS CGGAGACCGG TTACAGGTGA TTTCTGATTT TRACATGACC TTGAGCAGGT 180 

10 TTGCATATAA TGGAAAGCGA TGCCCTTCTT CTTACAATAT TCTGGATAAT AGCAAGATCA 240 

TCAGTGAGGA GTGTCGGAAA GAGCTCACAG CGCTCCTTCA CCACTATTAC CCAATTGAGA 300 

TCGACCCACA CCGGACCGTC AAGGAGAAGC TACCTCATAT GGTGGAATGG TGGACCAAAG 360 

15 

CGCACAATCT CCTATGTCAG CAGAAGATTC AGAAGTTTCA GATAGCCCAG GTGGTTAGAG 420 

AGTCCAATGC AATGCTCAGG GAGGGATATA AGACCTTCTT CAACACACTC TACCATAACA 480 

20 ACATTCCCCT TTTCATCTTT TCTGCGGGCA TTGGTGATAT CCTGGAAGAA ATTATCCGAC 540 

AGATGAAAGT GTTCCACCCC AACATCCACA TCGTGTCTAA CTACATGGAT TTTAATGAAG 600 

ATGGTTTTCT CCAGGGATTT AAGQGCCAGC TGATACACAC ATACAACAAG AACAGCTCTG 660 

P 

TGTGTGAGAA CTSTGGTTAC TTCCAGCAAC TTGAQGGCAA AACCAATGTC ATCCTGCTGG 720 

GAGACTCTAT CGGGGACCTC ACCATGGCCG ATGGGGTTCC TGGTGTGCAG AACATTCTCA 780 

30 AAATTGGCTT CCTGAATGAC AAGGTGGAGG AGCGGCGGGA NCGCTACATG GACTCCTATG 840 

ACATCGTGCT GGAGAAGGAC GAGACTCTGG ATGTGGTCAA CGGGCTACTG CAGCACATCC 900 

TGTGCCAGGG GGTCCAGCTG GAGATGCAAG GCCCCTGAAG GCGCAGGCTN CCAGNCCGCC 960 

35 

TGCAGGCCGT GGTGAGGAGG GGCGCCTCCC CAGAGTCTGC TCCCCCGTGA ACACAGAGCA 1020 

GANGCCAGGG TGGCCAGCAG TGGCTGGGTC CTTCCGCGCC CCTCCGTCCT CCTTTCCCTG 1080 

40 AGCACCTTCA TCACCAGAGG CTTGAAGGAA CCCCGCCATG TGGCAGGGCA CAGGCACTGT 1140 

TCCTGGTGAA CCTTGGACCA CAGCATGTCA GTGCTCTAGG GATTGTCTAC TCCAGGGATT 1200 

TTCTTCAAAA TTTTTAAACA TGGGAAGTTC AAACAAATAT AATGTGTGAA ACAGATCAAA 1260 

45 

ATTTTTAAAA TGAAAAAAAA GCTGCTCTGA TTCAGGGGAT GTGGGTCGGG GTAGAACCTG 1320 

GACCTCTTGG CCTGGGGGCA CATQGGATGC TTCTAGGAAC ACAGTTTGAG AACCACCAAA 1380 

50 AAAAAA 1386 



55 (2) INFORMATION FOR SEQ ID NO: 148: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2098 base pairs 

(B) TYPE: nucleic acid 
60 (C) STRANDEDNESS: double 
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(D) TOPOLOGY: linear 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO; 148: 

5 AGCCCTTCTC CCCGCGCTTC GGACTCTGAC ATCTTAAGGC TCCACGC?rCG TGTCCTTGTC 
TGGGTXSAGGC CAOCTCTCTG ATCCAAGGrTT CCTC3GAACTG ACACAGGAAG GGGCTGTGAA 
CCCTAACTGG GTCTMATCTC CTCCRACCGA GGCTTCIMAC CCTGGAGATG GCAfSTTACTC 
CTCGCCA'I\3G TTGCTGAGCA TC3GGCAGACC AGTC3GAGGCC ACCCTACTGT GTTATCTGCG 
CCTKZRATCA AGTCAGACCC TTTCGGGAGAA CGGGCTGTGG ATGAAGGAGT GGACTQCAGC 

15 CTTGGCCTAG CCACTC5GGCT GGGATCTTCT GGGTCATGTG ACTGTGTATC CAGGAGCAGA 
AACTTOTATT CTCAGGATTC AGGATCTACC CAGCACCAAA GATGTATTTT CAGGAGAACA 
GACCTAGAAA TOGGCCTCTC TGGCATTTCA GAGTCAQGCA AAGCAGGCAG GGCCAGGGAG 

20 

CrrCTCTOGG TCTACACAAG AAGGTTCCTG TGAGGGCTAT CAGTTGTTGC CTTCTAGCTT 



60 
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180 
240 
300 
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GCTGGTAACT TTQGCGCCTC CGCCAAGCCC TGCCAGACTC CCCTGGCTGT GATGGCATTC 
25 TCTGCCATCC TCCCTTCTCC CCAQCCTCTG CAGGATGCCC • TCCCTACCCA VJCTYTfCCTG 

GGCCTTCCCT GTCCACTCGG CTGGATTCAT GTTCAAACCA CTQGACTGGC AGGGCAACGA 720 

CTTCITCCCA CCTCAAGATC AGGTCCTCGC CCCCTTGTCT TGGCATAAAA ACACCTTTAA 780 

30 

AGCATCAGCC ATOreCTTCT TIGCCCrrcT CTGTCCTGTr CCAATCTTCT GCCTCCCAGT 840 

CACTCCCTGG GGACTATX3GG ATCACTOTCC CCCCACCTGT GTGGCCACAC CATGTGTCCT 900 

35 GTCAATCCAG AACTCCCTCT GAGCTCCAGG CTGACCACAG ATCAGCCACA GCCTGATGCC 960 

TGCAGCCCCA CTnGCTCAC CCTICCCCTC CCCTCCTCCT TCCTTCCACA CAGCAAGCCT 1020 

Accrmrcc atccatgctc accatagccc ccttccttgt gacctggacc ctccattgta loso 

40 

cctggctoag actgtcagcc tcctcgagga gtggggtcca ccitcttctt gccctatgca 



1140 
1200 



G-roCAAGClT CACTTCTCAC CCAGCAAGGT TGACTCATCT GCCTCCATGT CTCTGGGGCT 
45 TTGCTOTTGC CCTCAAACCT AGCTGGGCTG CTTCrTGCTCC CAGCTTGCTT CCCCCTCCTC 1260 
GGATGTCCCT TTCCAGGCCC CTCTCGTTCC TCCGGCACCA GTGTCCTTGG CTCCCATGGC 1320 

AAGCTCATCA GGGGCITCTA CCCTCGTCAC CAAGCATGGT AGCAGCTGCC TGCAITGTAT 1380 

50 ,^^n 
CTCCATCTCG TCACTGCAGG TQCCAACCCT TCATCCCCCA TGTTTTCCTG GGCCATQGAG 1440 

GGCTCACCTC CCTTTCTCGG GAATCTCGCT GAGCTGTGGT AACCAGCTAC ACCCCAGGTG 1500 

55 CTCTTTCCAT GGTGOTGCCT GCTCATCTTG CTGATGCAAA CTAGGAAGTT AGGCTGCATC 1560 

TCGGAGT3GC TTrCGCTGGA GAGGTGCTTT GCTGTCTCTC AGACTCAGTC ACTGTGTTCC 

CTCCCCGCCT CTCTTATCTC CATGGCTGTT TGCAGCTCTC CCAGGTACTT TGGGGTCTGA 

60 
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GCTQGAATTC CTTTGTGGTT TGCTCTTCTG CTTCTCACTC 
CAAAGGGAGA GTGGCATCCC TGCTGCTGCT GTGCCAGACC 
GACCCTAACC CTCCAGCTCA GCTCTGTACA CXTTGACCCTG 
ACTGTAATCC CTGACACCAG TAAAACCAAA AGGACTCTTG 
GGGTTACCTA CTCTGCCAAG TGAGGACAAA CTGCTAQGCT 
GAGAAACATT AACAATAAAA ATTTGTAGTA AACATAACCT 
AAAAACTYGG GGGGQOGCCC GTAACCCATT GGGCCCTTNG 



TTGTATTAAG AAGGATTCCA 
AGAGTTTCCT GAGGGGCCCT 
TAAATGAGTG GGGTTTGCTG 
GGGGCTCAGT GTGAGAGCCA 
GTATCCCATA ATTTCAGGAT 
CATGANGACT AAAAAAAAAA 
GGGGGGNGTT TTAAAATT 



<2) INFORMATION FOR SEQ ID NO: 149: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 1847 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149: 
TCGACCCACG CGTCCGAACT GAGGCGGCGG CGGGAGCCGG TTGC^GTCTG GTCTTCGCGT 
CGGCCCCGCG GACCAGACGC TGCCCCCGGC GCGGGGAGAA GATGGTGCCK AGCGGCCTCG 
GGCCCGCCAC GCGCCGCCAC GAGTGAGCCC AGCGCGACCG CGGGCGTCCG CCGAGCAGCT 
GQCCCGGCTG GGCCCGGGGC GCGCANTGCC CGCCGGGGCG GGGTGGAGCT GATCAGAATA 
ATGTTCAGCA TCAACCCCCT GGAGAACCTG AAGGTGTACA TCAGCAGTCG GCCTCCCCTG 
GTGGTCTTCA TGATCAGCGT AANGCCCATG GCCATAGCTT TCCTGACCCT GGGCTACTTC 
TTCAAAATCA AGGAGATTAA ATCCCCAGAA ATGGCAGAGG ATTGGAATAC TTTTCTGCTA 
CGGTTCAATG ATTTGGACTT GTGTGTATCA GAGAATGAAA CCCTCAAGCA TCTCACAAAC 
GACACCACAA CTCCGGAAAG TACAATGACC AGCGGGCAGG CCCGAGCTTC CACCCAGTCC 
CCCCAGGCCC TGGAGGACTC GGGCCCGGTG AATATCTCAG TCTCAATCAC CCTAACCCTG 
GACCCACTGA AACCCTTCGG AQGGTATTCC CGCAACGTCA CCCATCTGTA CTCAACCATC 
TTAGGGCATC AGATTGGACT TTCAGGCAGG GAAGCCCACG AGGAGATAAA CATCACCTTC 
ACCCTGCCTA CAGCGTGGAG CTCAGATGAC TGCGCCCTCC ACGGTCACTG TGAGCAGGTG 
GTATTCT^AG CCTGCATGAC CCTCACGGCC AGCCCTGGGG TGTTCCCCGT CACTGTACAG 
CCACCGCACT GTGTTCCTGA CACGTACAGC AACGCCACGC TCTGGTACAA GATCTTCACA 
ACTCCCAGAG ATGCCAACAC AAAATACGCC CAAGATTACA ATCCTTTCTG GTGTTATAAG 
GGGGCCATTC GAAAAGTCTA TCATGCTTTA AATCCCAAGC TTACAGTGAT TGTTCCAGAT 
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GATGACCGTT CATTAATAAA TTTGCATCTC ATGCACACCA C3TTACTTCCT CirTGTGATG 
GTGATAACAA TGTTTTCCTA TGCTCTTATC AAGGGCAGAC CTAGCAAATT C3CGTCAGAGC 
AATCCTCAAT TTTC?rCCCGA GAAQGTQGCT TTGGCTGAAG CCTAATTCCA CAGCTCCTTG 
TTTTITCAGA GAGACTCAGA GAACCATAAT CCTTGCCTGC TGAACCCAGC CTGGGCCTGG 
ATCCTCTGTC AATACATTAT CTTGCGATGT TGGGTTATTC CAGCCAAAGA CATTTCAAGT 
GCXriCTAACT GATITOTACA TATTTATAAA AATCTATTCA GAAATTGGTC CAATAATGCA 
CGreCTTTCC CCTOQGTACA GCCAfiAGCCr TTCAACCCCA CCTTGGACTT GAGGACCTAC 
C7IGATCQGAC GTTTCCACGT GTCTCTAGAG AAGGATTCCT GGATCTAGCT GGTCACGACG 
ATOTPTTCAC CAAGOTCACA GGAGCATTGC GTCOCTGATG GGGTTGAAGT TTGGTTTGGT 
TCTTGTTTCA GCCCAATATC TAGAGAACAT TTGAAACAGT CTGCACCTTT GATACX3GTAT 
TGCATTTCCA AAGCCACCAA TCCATTTTGT GGATTTTATG TGTCTGTGGC TTAATAATCA 
TAGTAACAAC AATAATACCT TTTTCTCCAT TTTGCTTGCA GGAAACATAC CTTAAGTnT 
rrrTGTTTTG TrnTCTTTT TTTGTTTTTT GTTTTCCTTT ATGAAGAAAA AATAAAATAG 
TCACATTTTA ATACTACCAA AAAATGGACA AAAAAAGTCG AGGGGGG 
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(2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1569 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GACGCTGACG AGAGAAGGCC TCTTCCTTGA GGGrTGGTGC 
GGATTACGCC AACTCGGATC CGQCGGTCGT GAGGTCTQGA 
CAACGCTGTT CAGCAGGAAG TAAAATCTCT TTGTQGCTTG 
AGAGGAAGCT CTTICTGGGG CTQGTGAGCC CTGTGACATC 
GGATGCCCAG GAGGAAAGCA TCCATGAGAG AACTGTCTCC 
ACACAAAGAA GAACTGGACG GGGCTGGAGG AGAAGAGTAT 
GCTGGCCTCC TATATCCGTC CTGAGGACAT TGTGAATTTT 
CTCGACTGTC ACTTCCACTG CTGCCTTTTG GACCAGGTTG 
GATGCTTCCC TGCCTTTGCG TCTGCGACCA GAGTCAATGG 



; 150: 

TCTGTTQCAG TGACCGTGGC 60 

CGAGTCAAGA AAGCCGTAGC 120 

GAAGCCTCTC AGGTTCCTGC 180 

ATCGACAGCA GTGATGAGAT 240 

AGAAAAAAGA AAAGCAAGAG 300 

CCCATGGATA TTTGGCTATT 360 

TCCCTGATTT GTAAGAATGC 420 

TACCGAAGCA CTACACGCTG 480 
AGAAGCrcCG CTGTCTCCQG 540 
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GCTTGTGTCA TCCGATCTCT GTACCATATG TATGAGCCAT TTGCTGCTCG AATCTCCAAG 600 

AATCCAGCCA TTCCAGAAAG CACCCCCAGC ACATTAAAGA ATTCCAAATG CTTACTTTTC 660 

TGGTGCAGAA AGATTGTTGG GAACAGACAG GAACCAATGT GGGAATTCAA CTTCAAGTTC 720 

AAAAAACAGT CCCCTAGGTT AAAGAGCAAG TGTACAGGAG GATTGCAGCC TCCCXyiTCAG 780 

TACGAAGATG TTCATACCAA TCCAGACCAG GACTGCTGCX: TACTGCAGGT CACCACCCTC 840 

AATTTCATCT TTATTCCGAT TGTCATGGGA ATGATATTTA CTCTGTTTAC TATCAATGTG 900 

AGCACGGACA TGCQGCATCA TCGAGTGAGA CTGGTGTTCC AAGATTCCCC TGTCCATGGT 960 

15 GGTCGGAAAC TOCGCAGTGA ACAGGGTGTG CAAGTCATCC TGGACCCAGT GCACAGCGTT ' 1020 

CGGCTCTTTG ACTQGTGGCA TCCTCAGTAC CCATTCTCCC TGAGAGCX3TA GTTACTGCTT 1080 

CCCATCCCTT GGGGGCAGCC TCGAGTGTAG TCCATTAGTA ATCAGATTCC AGTTTGGACA 1140 

20 

GGGTGGCTGG ATTGTATATC TCGTTAGTAA TGTACATGCT CTTCAGGTTC TAGGGCTCCT 1200 

GTTAGGGGAG GGAGAAATGT TGAATCAAGA GGGAAAACAA CTACTATGAT TTATAAACAT 1260 

25 ATITTAATGT AAAAATTTGC ATTTAAAAGG AGTQQCCCTG TTTTCTGTGT TAAAACCCCA 1320 

TTTGGTQCTA TTGAGTTTGT TCITTATTCT TTTATCCCAG TGAAAATTGT TGATCTTGCT 1380 

GTAGGGAAAA ATTAAACTCT TTGAATCTCC AAACAAGGAA GTTTCAGCAT TCCCTTATGG 1440 

30 

ATCAGAQGAA CCTTAGAGGC CTGAAATTGT TGCTTCCAGT TTAGCTGCCC CTCAAATTCA 1500 

AGTGAATATT TTCCCTTCTC CXTTTTACCCT TCTCCAGAAA TAAAQCAGGT GACAGGGTTT 1560 

35 CAGAATCTT 1569 

40 (2) INFORMATION FOR SEQ ID NO: 151: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1540 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 

50 CCCACGCGTC CGGAAGGATT GACCAGTTAA CCAACATCTT AGCCCCCATG GCTGTTGGCC 60 

AGATTATGAC ATTTGGCTCC CCAGTCATCG GCTGTGGCTT TATTTCGGGA TGGAACITGG 120 

TATCCATGTG CGTGGAGTAC GTCCTGCTCT GGAAGGTTTA CCAGAAAACC CCAGCTCTAG 180 

55 

CTGTGAAAGC TGGTCTTAAA GAAGAGGAAA CTGAATTGAA ACAGCTGAAT TTACACAAAG 240 

ATACTGAGCC AAAACCCCTG GAGGGAACTC ATCTAATGGG TGTGAAAGAC TCTAACATCC 300 

60 ATGAGCTTGA ACATGAGCAA GAGCCTACTT GTGCCTCCCA GATGGCTGAG CCCTTCCGTA 360 
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CCTTCCGAGA TCGATOGCTC TCCTACTACA ACCAGCCTC3T GnTCTGGCT GGCATGGGTC 

iTCcrrrccT ttatatcact otccigogct ttgactgcat caccacaggg tacgcctaca 

^ CTCAGGGACT GAeTOGGTrC CATCCTCAGT ATTITGRTGG GAGCATCAGC TATAACTOGA 
ATAATCQGAA CIGTAGCnT TACITGGCTA OGTCGAAAAT GTOOmGGT TCGGCAGGTC 
10 TCATCTCAGG ATIGGCACAG CnTCCTGTT TCSATCrTGTG TGTGATCTCT GTATTCATCC 
CTOGAAGCCC CCreGACTTO TCCGnTCTC CTTrTGAASA TATCCGATCA AGGTTCATTC 
AAGGAGAOrc AATTACACCT ACCAAGATAC CTGAAATTAC AACTGAAATA TACATGTCTA 
AT^BGCHCTAA nCTCCTAAT AITOTCCCGG AGACAAGTCC TGAATCTGTG CCCATAATCT 840 

900 

960 
1020 
1080 



420 
480 
540 
600 
660 
720 
780 



CTCTCAGICT GCrorrrcCA GGCCnCATTO CTGCTACSAAT OGCrrClTrGG TCCTTTGATr 
20 TAACTOIGAC ACACHTOCTC CAAGAAAATG TAATTGAATC TGAAAGAGGC ATTATAAATC 
GTOTACAGAA CK:CA1<3AAC TATCnCTTG ATCTTCTGCA TTTCATCATG CTCATCCTOG 
CTCCAAATCC TCAAGCrrrr GGCrrcCTCG TATTGATTTC AGTCTCCnT GTCGCAATGG 
GCCACATTAT CTATTTCOGA TITCCCCAAA ATACTCTGGG AAACAAGCTC TTTGCTTGCG 1140 

1200 

1260 
1320 
1380 



CyrcCKaOGC AAAAGAflOTT AGGAAGGAAA ATCAAGCAAA TACATCTCTT GTTTGAGACA 
30 CnTTAACICT TQCTATCCro TTACTAGATT ATATAGAGCA CATGrTGCrTA TmGTACTC 
CAGAATTCCA ATAAA-TOGCT GQOICTnTC CTCIOTmT ACCACAGCTG TGCCITGAGA 
ACTAAAAGCT (HTTAGGAAA CCTAAGTCAG CAGAAATTAA CTOGATTAAT TrCCCTTATG 
TICflGGGCCA TOGRAAAAAA ArTGOGAAAA GGAAAAACTC AGTTTrAAAT ACGGGAGACT 1440 
ATAA-TOGATA ACACTORATT CCCCTATTTC TCATGAGTAG ATACAATCTT ACGTAAAAGA 1500 

1540 

40 GTOGTTAGTC ACC^TCAATTC AGTTATCATT TGACAGATTC 



45 (2) INFORMATION FOR SEQ ID NO: 152: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 1719 base pairs 
(B) TYPE: nucleic acid 
5Q (c) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 

55 TACTTATCAG GTCAArTCGA AATAAGAACA CCATTrTACT GGGTCTAGGA TTTCAAATAT 

TACAGTWGC A'TCGTAIGGC TITCGnCAG AACCTTGGAT GATGTGGGCT GCTGGGGCAG 

TAGCAGCCAT GTXTTAGCATC ACCTTrCCTC CTGTCAGTGC ACTTGTTrCA CGAACTGCTG 

60 



60 
120 
180 
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ATGCTGATCA ACAQQGTGTC GTTCAAGGAA TGATAACAGG AATTCGAGGA TTATGCAATG 240 

GTCTGGGACC GGCCCTCTAT GGATTCATTT TCTACATATT CCATGTGGAA CTTAAAGAAC 300 

5 TGCCAATAAC AGGAACAGAC TTGGGAACAA ACACAAGCCC TCAGCACCAC TTTGAACAGA 360 

ATTCCATCAT CCCrTGGCCCT CCCTTCCTAT TTGGAGCCTG TTCAGTACTG CTGGCTCTGC 420 

TTCTTGCTTT GTTTATTCCG GAACATACCA ATTTAAGCTT AAGGTCCAGC AGTTQGAGAA 480 

10 

AGCACTGTGG CAGTCACAGC CATCCTCATA ATACACAAGC GCCAGGAGAG GCCAAAGAAC 540 

CTTTACTCCA GGACACAAAT GTGTCACGAC TGAAATCAQG AAGATTTITC TATCAGCACC 600 

15 CAGGTCTTAG TTTTCACCTC TAGTTCTGGA TGTACATTCX: ATTTCCATCC ACAGTGTACT • 660 

TTAAGATTGT CTTAAGAAAT GTATCTGCAT GAACTCCGTG GGAACTAAAG GAAGTGQGAA 720 

CTTAGAACCA GACAGTTTTC CAAAGATGTT ACAATTTCTT TTGAAAAACC TTTTGTTTAT 780 

20 

TAGCACCAAT TTCTYGCCAC TAAGCTATTT GTTTTATTAT ACATCCTTTA ATTAAAAACT 840 

ATATATGTAA CTTCTTAGAT ATTAGCAAAT GTCTCTGCTA CCATTTCCTT AAGGTGTTGA 900 

25 GCTTTAACTC TATGCTGACT CAGTGAGACA CAGTAGGTAG TATGGTTGTG GACCTAITTG 960 

TTTTAACATT GTAAAATTTT GAGTCAGATT TTAATATTGT AAAATCTTGG GTCAAATAAT 1020 

TCAAAGCCTT AATGCAGATG CACTAAAACA AAGAAATGGT AAATGAATTG TTTGCATTTA 1080 

30 

AAAAAAAAAA CTCTTAAGAA AACTGTACTA AATCTGAATC ATGTTTTGAG CTTGTTTGCA 1140 

GTACTTTTAA ACATTATTCA CTACTGTTTT TGAAGTGAGA AAGTATCAGC CATTTAGCAT 1200 

35 . TTAAGTTGGG GTAITTAGAG CCTGTAATCT AAATGCTGQC TCAAATTTAT TCCCCAQCTA 1260 

CTTCTTATAC CACTATTCTT TTAATGTTTG CATAATCATA AGCACCTCAA CACTTGAATA 1320 

CATAATCTAA AAATTATATA GTAAAGCTGG TAGCCTTGAA AATGTCAGTG TGATATCTAT 1380 

40 

TATGTAGATA AATATATATA GTGGCCTTTC AGGACTGTCA CAGTAACACT TTATTTACAG 1440 

AGCTAATGTT TGTCCTAAAT TTTCAGGACC CTAGAGGAGA GCTTTATACA ATTACCGATG 1500 

45 TGAA1TTCTC TAAAGTGTAT ATTTTTGTGT CCAGTTATAT TATTTAAAAA AGTGTTACTT 1560 

TGTAAAAATT GTATATAAAG AACTGTATAG TTTACACTGT TTTCATCTTG TGTGTGGTTA 1620 

TTGCTTAATG CTTTTTAAAC TTGGAACACT CACTATGGTT AAATAAGGTC TTAAAAGAAA 1680 

50 

TGTAAATATT YTGTTAATAA AGTTAAATAT 1TTAATGAT 1719 

55 

(2) INFORMATION FOR SEQ ID NO: 153: 



60 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 863 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEKIESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 153: 

5 

GGCACGAGGG AAGCCGGGAC GATGTCCGCA TGACAACCGA CGTTGGAGTT TGGAGGTGCT 60 

TGCCTTAGAG CAAGGGAAAC AGCTCTCATT CAAAGGAACT AGAAGCCTCT CCCTCAGTGG 120 

10 TAGGGAGACA GCCAGGftGCG GTTTTCTGGG AACTGTGGGA TGTGCCCTTG GGGGCCCGAG 180 

AAAACAGAAG GAAGATGCTC CAGACCAGTA ACTACAGCCT GGTQCTCTCT CTGCAGTTCC 240 

TGCTGCTGTC CTATQAOCTC TTTGTCAATT CCTTCTCAGA ACTGCTCCAA AAGACTCCTG 300 

15 

TCATCCAGCT TGTGCTCTTC ATCATCCAGG ATATTGCAGT CCTCTTCAAC ATCATCATCA 360 

TTTTCCTCAT GTTCTTCAAC ACCTTCGTCT TCCAGGCTGG CCTGGTCAAC CTCCTATTCC 420 

20 ATAAGTTCAA AGGGACCATC ATCCTGACAG CTGTGTACTT TGCCCTCAGC ATCTCCCTTC 480 

ATGTCTGQGT CATGAACTTA CGCTGGAAAA ACTCCAACAG CTTCATATQG ACAGATGGAC 540 

TTCAAATGCT GTTTGTATTC CAGAGACTAG CAGCAGTGTT GTACTGCTAC TTCTATAAAC 600 

25 

GGACAGCCGT AAGACTAGGC GATCCTCACT TCTACCAGGA CTCTTTGTGG CTGCGCAAGG 660 

AGTTCATGCA AGTTCGAAGG TGACCTCTTG TCACACTGAT GGATACTTTT CCTTCCTGGA 720 

30 TAGRAGGCCA CATTTGCTGC TTTCCAGGGG AGAGTTGGGC CCTATGCATG GGGCAAAACA 780 

GGTGGGATTT TCCAAGGGAA GGGTTCAGAA TTAGGCNTGT TGTTTCAGCC ATTTCCAAGG 840 

AAGGGGAAGG GTTTCCCTNC CCT 863 

35 



(2) INFORMATION FOR SEQ ID NO: 154: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1101 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
45 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 

AACAGCAAAA AAGAATGATT TCTTCTGAAA TTGTGGAACA TGAGGATTCA AGTTTTTATT 60 

50 

TTGTTACTAG GTGCTGGAGG AACATCCCAG TTCACAAAGC CCCCATCTCT TCCTCTGGAG 120 

CCAGAGCCTG CGGTGGAATC AAGTCCAACT GAAACATCAG AACAAATAAG AGAGAAATAA 180 

55 GAATAGAATG AATGACCCCA AAATARQGTT TTCTTGQGCG AGGATGTGCT GGATTAGGAA 240 

AGGTGACATG ACACAGGCAG AGCAGAGTGG CACCCACCAC AGAATACAGT GTGTGTTATT 300 

ACGAGGAGCC AGCAGTTGAG CCTAAGGTCC TTCTACCTAC CTGGTATTGG CATTTGAGGT 360 
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CX3GAAACCCT CTACTGCCCC ATAAGCCAGG 
GAACTGGCAG CAAGQCTTGA GGCCTTATGT 
5 TGTCTGCTTT CAAAAGGACT TTTCTCTCCT 
ACAGCTGAGA CAGACCTCTG CTGAGTAGCT 
GTGATCCTCA GGTTGTGAGG TTTATTAGTC 

10 

ATCCAACTTT AATAGTATAC ATTTAAAAGA 
CAAGCXTTGCT QAGTATTGCA GCTQCATTTG 
15 TGTATTTTGT TCTTGAGAGG GGTCAGTCTA 
CAGCCCAGGG CTTGTCTGGA TCAGCACCAA 
AGATGAGTAA TTGTTATTGA AGATAGTCAG 

20 

ACTATGTGTC CTTTTTAGAA TAAAGATTAC 
TTCAGGTATA AAAACAAGAG ATTATAATAA 
25 CTCGTGCCGA ATTCCCTGCA G 



AAAAGTGAAA AGAGAACACA GTTCCTTTAA 420 

ATGTAGCTGA GTCAGCAAGG TACATGATGC 480 

AGCTGACTGA CTCCTTCCTT AGTTCAAGGA 540 

CTGTGATGAC AAAGCCTTGG TTTAACTGAG 600 

CCCAAGGCAA ACACAAATAT TAGATTAATA 660 

AAAAAAACAA AAGCTCTGGA AGNTTGAGGC 720 

CCCAAAGGGA ATCCAGAACA AGTCCCTCCC 780 

GAAGCTAGAT CCTATCAGGA TGAGGAGCAG 840 

CGATTTTAAA GAAAAAAGGA AGAGTTrCTT 900 

TGATAACCAC TGACCAGATG CTATCAATAC 960 

ATATCATCAT TCCTTTGGGG AAAATTGTTA 1020 

AAAANTAAAA GAACCCTAAA AAAAAAAAAC 1080 

1101 



30 (2) INFORMATION FOR SEQ ID NO: 155; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2031 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

{xi) SEQUENCE DESCRIPTION: SBQ ID NO: 155: 

40 CAATTAACCC GTTTGAGGCC TAGGTTGTTT GGCAAGCCCC NGGCCTAAAG TTTTAATTCG 60 

GCAGAGCCAA GGGCCTGAAA GGAAGGGAAA QGQGAGGGTA GCGGGAGGGT AGCAGGTGAG 120 

nCCTAGGGC TGGAAGG?rTT AGCAGCAGCC TGGTGCAGTG CCCTGTCATC AAGACAAACC 180 

45 

CACGGTCCTC CTGGGTGCCT ACCAAGCTTG GTTTGTACAA AAGCAAQGTG GGAGTCTATT 240 

TTTGTACATG AGATACATCA CACTTACCTG TGGGCCAGTA TTGTGAAGTG AGTCTGAGTT 300 

50 GTTTACACTG ATGCCTTCCC TGCCCACCAC AAATTGTGTA CATAGTCTTC AGAATGATAC 360 

CACCCCTTTC CCCAGCTCCC AACCAAGAGC TGGTTCTAGG CCTGTGTTAT ATGTCATATT 420 

TAGCGTTTTT ATATATGACC TTTGATTTCT GTTGTTTGTA TTTTAGCACA GTGTATGCAC 480 

55 

CTTCATTTAA ATACATCTGT GTGCATACAG ATACGCATAT ATGTGTGTGC GTATGCATAT 540 

ATCTCTCATC TGTAGTTTCC AAGAGTTCAG CTGAAGCAGA TGGAGTCCTG CAGCCCAGGA 600 

60 GACACCCTCC ATCCCTGCTA ATAGTGTTTG CCACAAGTAT TAGTGAGTCT TCCTTATTAA 660 
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10 



TiOTTTCATT TCAGAAGACT GAAGCAAAGC ^^AGTGTT IGCTCTTTCT rTGGCAGCTA 
;,,,K5AGG^ n^GGATGAC TTCCTCTOTT CCOXIAAGCTK^ CACTI^ CCATCTCKX: 
AGTATTAAGC CCCCTi™ CTIX^C^A dXTTSTCI^T GCCH^ OIK^I^IK^TA 
CnCAdCTTG CAIXXSCrrCC A.<.lKrrGC^ TOKSGCATTT GGGGATAAGT GCIXSAACCAG 

;«x:A'mx3CA CTiroriTCA GGCcrcom; ccaaix;atag atcactccig ttcacci^ 
Gcrr^xnccr mccrr^ ncTcrrGGA agaggaaagg Acrcroa^ 

GGCOIAGGCT GA<^GATC AGCTCCAGCT a3CTCAT<5GC CrTCTTAGAG CAGAGAGAGG 



720 
780 
840 
900 
960 
1020 
X080 



AGTAOXmZAT TTTACTAAGT TCCTAAACAA ACATTTATGC AGGCAACACT CXTTIGCAGAT 1140 
CCAGAAAC^ AGGCACAATA GGGrTAlGAC nOCTCAAGA ATATGTAGCT GCTAGGGGGT 
20 AAATCAAGGC AT^CAATTT CIGnCAGCG GGCAGGAATA GGClGnGAAT IX^CTAGCACr 

rnrmTAA GCAArrAcrr rm^criGT iK:caxnGAAA gigcaagagg cgtacacctt 

TCCCAAA1K5T AGACTAGAAT CIGCAGGATX. CCACCCACl^ TATAGTTCTG CTTTCCCAGA 
GAGGAAGAAC nTTAGAAAC CAAAOX^OCT TAAl^AT ^CCACCCC .X^GCTTI^C 1440 
GGGTAGAAAA nCACAGTAG GAAOGATI^ TAAGAGAGAG TGCTIGGAAC CAIK^GGTTAA 
30 CAGGAAAGGC TACCTAACTT CACATA.^ CAACCAGAGC AGCCACCAAG CArTACTTAG 
CAGCAGGAAA AOGATI^AT TO^AGTICCr CHX^OCCAA AACHGAGGCA CCAl^rTCTT 
rro^^TO CCACCrCAAG CCTOaaCGCG GTIGGCIKIACA CCICTTAATC CCAGCACTTT 
GGGAGGCCGA GGCGGGCGGA OXrACCGGAGT CGGGGAGITr GAGACCAGCC IGGACCAACA 1740 
TGGGAGAAAC CCCAI^A CCTAAAAATA CAAAArTAGC CGGGCOT GGCATGCGCC 
40 TATAA1.MCA GCTAC™ AGGGYTGAGG CAGGRGAATT GCTTGAACCC RGGAKGGCGG 
AGOmGCGG riGAGTIGAG GATCGTCCCA TTGCACTTCC GGGCCTIGGG GCAACAACAG 
CAAAAAVICC GT^TTCAAMW MRlGCCGAAT TCGATATCAA GCTTATCGAT ACCGrTCGACX: 
TCGAGGGGGG GCCCGGTACC CAAT^CGCCC TATAGNGATC GT^^^ ^031 



1200 
1260 
1320 
1380 



1500 
1560 
1620 
1680 



1800 
1860 
1920 
1980 



50 

(2) INFORMATION FOR SEQ ID NO: 156: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1981 base pairs 
55 (B) TYPE; nucleic acid 

(C) STRANDEDNESS : doiible 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156: 

60 
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381 



CCroCACCCT CSAGCCCTTCA CCTCTCCGAG TTCCCCCCAG GTrGGCTTCC TTCGATTCCT 
XrrCITOGTA TCAACCTTTC ATTOGAAGAA CAACCCCCTC TrTCTCAACC TCAATAATGA 
GCTCACrore GAGGAGCAGC TCGGGCACAG CTCMCCGTyA TCGTCATTOT TACCCC(XAA 
GftCCQCAAAA ACTCTOTOTC GACACAGGAT (MACXTTCAG OCCAGATCCT CSCACSCAGCTT 
CyiGC?rcCTCG CAGCTCAAGC CXTTOCCCATG TTAGAGAAGC AGCTCATGGA TCCCCC3GQGA 



CTCTGTCTCC 



TTCCTGCTGC CAGGACCTIGT GCCTAGACTT AGCATGCCCT TCACTGCAC3T 



60 
120 
180 
240 
300 



CCroOGGACA TCAGGACACT OTKCOGCCG CCCTTOGACA TTTACGflCGT GCTGATTCGC 360 
cromCCTC C3CCATATCCC C30GC3CACCGC AGGCTTGTC3G ACTOSCCAGY TGCCTCCTTC 
15 TCCCGGQGCC TOCTCAGCCA GCCGC3C3GCCC TCATCCCTCa TGCCCCyrGCT GGCjmATGAT 
CCTOCTCAGC TCTATCTCAC GCAGCTCflGG GfiGGCXTtTTO GOGATCTGGC CCTTrrCTTC 
TATCACCAGC ATCCTTCGAGA GCJTCATTGGT GTCCTCTGC3A AGCCCACCAG CTTCCAGCCG 
CAGCCCTTCA AC3QCXTCCAG CACAAAGGGG CGCATGGTGA TGTCTCGAGG TC3G0GAGCTA 660 
CTAATGGIGC CCAATOITCA AGCAATCCTG GAGGACTTTG CTGTGCTGGG TGAAC3GCCTG 
25 GIGCAGACTC TCC3AGGCCCG AAGTOAGAGG TCGACTGTOr GATCCCAGCT CTGGAGCAAG 
CTOTAGACGG ACAGCAGC5AC ATTOGACCTC TAGflGCAftGA TGTCftGTAGG ATCACCTCCA 
CCCTCCTTOG ACATOAATCC TOCATCCSaGG GCCTGCTOQC T8AACATGCT GAATCATCTC 
^ CAACAAAACC CteCCCCAAC TTICTCTCTC ATGCTCCAQC ATTC5GGGCAG GG0CATC3GTG 960 

GCCCATOPAG TCICCTOGGC CTCACCATCC CAGAACSAGGA GTGGCSAfiCCA GCTCAGAGAA 
35 GGAACTOAAC CCAGCSAGATC CATCCACCTA TTAGCCCTCG GCCTGGACCT CCCTCCGATT 
TCCCACTCCT rTCTTAGrcT TCTTCCAGAA ACAGAGAAGG GGATCTGTGC CTGGGAGAGG 



420 
480 
540 
600 



720 
780 
840 
900 



1020 
1080 
1140 
1200 



GICAGGCCTT TAGA-TCGCAC CCAGCGAAAA -TCTCGCCCTT CTGAGTCACA TCACCGACAC 1260 
TCAGCACHGG AAAGGGGCTA TATOTCTATC AATAGACCAC ATTGAAGGAG CACAATGCCC 

45 TCcrarerro atcccacttc ccagggtgga gacagtqgaa aagaaccgag gacaggaaag 

GATTOOCTAG GTOAAGGGCT CAGGGGACTG GTAGTCACCC AATCTK3GAG AGGTGCAAAA 

agcactoggg gctacccctt agctocatct gccctggctg tttgcccgtt catctcacaa 

ACreCXaCTA CTATOTACCT GCAC?roGGGT TGCAGAGATG GGGGAGACTC AAGTCTTACT 1560 
CCXX»GGAGC TCCXaGGGCC CAAGGAGGAG AATGCTGCCT CCTTTCAGTC TQGTCTACAC 

55 ccAcnrcro ctagccictc TCCTrccTCT aattctggct GrmrccAG actcagctca 

AATAOreCCC CTCCTTAAGC CCATCCCTCG CCCCCAGCCT GAGGTGATCT TTCCCTCCTC 
TCAACTATTA GAGCAGTTAC TOKHOTTCA CnTCGTTrGG CAGGCACACA CAGTGGCATA 

60 



1320 
1380 
1440 
1500 



1620 
1680 
1740 
1800 



wo 98/39448 PCT/US98/04493 

382 

AATTCTATK3 TTTTGAACTC TCSATTTAAAA TTAAATTGCA GCTQGGCGTG GTGGCTCATG 1860 
CITCTAATCC CAACACTTAG GGAGTMAGGR GAATCACTTG ASCYCAGGAG TYCTAGACCA 1920 
5 ATCTCGGCAA MAGAGAGACC CCA-TCrTCTTT TAAATAAAAA GTTAAATTGC TTAAAAAAAA 1980 

1981 



10 



25 



35 



45 



(2) INFORMATION FOR SEQ ID NO: 157: 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 915 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
,(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 

GAATTCGGCA CGAGCGCGGC CATGGCGCTC CTGCITrCGG TGCTGCGTGT ACTGCTQGGC 
GGCTTCTTCG CGCTCGTCGG GTTGGCCAAG CTCTCGGAGG AGATCTCGGC TCCAGTTTCG 
GAGCGGATGA ATCCCCrcrr CGIGCAGTrT GCTGAGGTGT TCCCGCTGAA GGTATTTGGC 180 
TACCAGCCAG ATCCCCTCAA CTACCAAATA GCTGTGGGCT TTCTGGAACT GCTGGCTGGG 
30 TTQCTGCrGG TCATCGGCCC ACCGATGCTG CAAGAGATCA GTAACTTGTT CTTGATTCTG 

CTCATGATGG GGGCTATCTT CACCTTOGCA GCTCTGAAAG AGTCACTAAG CACCTGTATC 360 
CCAGCCATTC TCTCCCTCGG GTTCCTGCTG CTGCTGAATG TCGGCCAGCT CTTAGCCCAG 
ACTAAGAAGG TGGTCAGACC CACTAGGAAG AAGACTCTAA GTACATTCAA GGAATCCTGG 
AAGTAGAGCA TCTCICTCTC TTTATCCCAT GCAGCTGTCA CAGCAGGAAC ATGGTAGAAC 540 
40 ACAGACTCTA •TCATCTTOrT ACCAGTATAA TATCCAGQGT CAGCCAGTGT TGAAAGAGAC 600 
ATTTTGTCTA CCTCGCACTG CTTrCTCTTT TrAGCTTTAC TACTCnTTG TGAGGAGTAC 660 
ATCTTA-rcCA TATTAACATT CCTCATGTCA TATGAAAATA CAAAATAAQC AGAAAAGAAA 720 
TTTAAATCAA CCAAAATTCT GATGCCCCAA ATAACCACTT TTAATGCCTT GGTGTAAGTA 
TACCTCIGAA CimTTCTG TGCCTTTAAA CAGATATATA TTTTTTTTWA ATGAAAATAA 
50 AACCATATAT CCTATTTTAT TTCCTCCTTT TAAAACCTTA TAAACTATAA MAAAAAAAAA 
AAAAAAAAAA CTCGA 



55 



(2) INFORMATION FOR SEQ ID NO: 158: 



60 
120 



240 
300 



420 
480 



780 
840 
900 
915 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 2117 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

5 (xi) SEQUENCE DESCRIPTION: SBQ ID NO: 158: 

AGAGCGAAGC GAGGGTGGCG CQGGTCCGGG CATC3AAGCTG GGCOSGGCCG TGCTGGGCCT 60 

GCTGCTGCTG GCGCCGTCCG TQGTGCAGGC GGTQGAGCCC ATCAGCCTGG GACTQGCCCT 120 

10 

GGCCGGCGTC CTCACCGGCT ACATCTACCC GCGTCTCTAC TGCCTCTTCG CCGAGTGCTG 180 

CGGGCAGAAG CGGAQCCTTA GCCGGGAGGC ACTQCAGAAG GATCTGGACG ACAACCTCTT 240 

15 TGGACAGCAT CTTGCAAAGA AAATCATCTT AAATGCCGTG TTTGGTTTCA TAAACAACCC 300 

AAAGCCCAAG AAACCTCTCA CGCTCTCCCT GCACGGGTGG ACAGGCACCG GCAAAAATTT 360 

CGTCAGCAAG ATCATCGCAG AGAATATTTA CGAGGGTGGT CTGAACAGTG ACTATGTCCA 420 

20 

CCTGnTGTG GCCACA1TGC ACTTTCCACA TGCTTCAAAC ATCACCTTGT ACAAGGATCA 480 

GTTACAGTTG TGGATTCGAG GCAACGTGAG TGCCTGTGCG AGGTCCATCT TCATATTTGA 540 

25 TGAAATGGAT AAGATGCATG CAGGCCTCAT AGATGCCATC AAGCCTTTCC TCGACTATTA 600 

TGACCTGGTG GATQGGGTCT CCTACCAGAA AGCCATGTTC ATATTTCTCA GCAATGCTGG 660 

AGCAGAAAGG ATCACAGATG TGGCTTTGGA TTTCTGGAGG AGTGGAAAQC AGAGGGAAGA 720 

30 

CATCAAGCTC AAAGACATTG AACACGCGTT GTCTGTGTCG GTTTTCAATA ACAAGAACAG 780 

TGGCTTCTGG CACAGCAGCT TAATTGACCG GAACCTCATT GATTATTTTG TTCCCTTCCT 840 

35 CCCCCTGGAA TACAAACACC TAAAAATGTG TATCCGAGTG GAAATGCAGT CCCGAGGCTA 900 

TGAAATTGAT GAAGACATTG TAAGCAGAGT GGCT6AGGAG ATGACATTTT TCCCCAAAGA 960 

GGAGAGAGTT TTCTCAGATA AAGGCTGCAA AACGGTGTTC ACCAAGTTAG ATTATTACTA 1020 

40 

CGATGATTGA CAGTCATGAT TGGCAGCCGG AGTCACTGCC TGGAGTTGGA AAAGAAACAA 1080 

CACTCAGTCC TTCCACACTT CCACCCCCAG CTCCTTTCCC TGGAAGAGGA ATCCAGTGAA 1140 

45 TGTTCCTGTT TGATGTGACA GGAATTCTCC CTGGCATTGT TTCCACCCCC TGGTGCCTGC 1200 

AGGCCACCCA GGGACCACGG GCGAGGACGT GAAGCCTCCC GAACACGCAC AGAAGGAAGG 1260 

AGCCAGCTCC CAGCCCACTC ATCGCAGGGC TCATGATTTT TTACAAATTA TGTTTTAATT 1320 

50 

CCAAGTGTTT CTGTTTCAAG GAAQGATGAA TAAGTTTTAT TGAAAATGTG GTAACTTTAT 1380 

TTAAAATGAT TTTTAACATT ATGAGAGACT GCTCAGATTC TAAGTTGTTG GCCTTGTGTG 1440 

55 TGTGTTTTTT TTTAAGTTCT CATCATTATT ACATAGACTG TGATGTATCT TTACTGGAAA 1500 

TGAGCCCAAG CACACATGCA TGGCATTTGT TCCACAGGAG GGCATCCCTG GGGATGTGGC 1560 

TGGAGCATGA GCCAGCTCTG TCCCAGGATG GTCCCAGCGG ATGCTGCCAG GGGCAKTGAA 1620 
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GTGTTTAGGT GAAGGACAAG TAGGTAAGAG GACGCCTTCA GGCACCACAG ATAAGCCTGA 1680 

AACAGCCTCT CGAAGGGTTT TCACCTTAGC AACAATQGGA GCTGTGGGAG TCATTTTGGC 1740 

CACACTGTCA ACATTTGrTA GAACCAGTCT TTTGAAAGAA AAGTATTTCC AACTTGTCAC 1800 

TTGCCAGTCA CTCCGTTTTG CAAAAGGTGG CCCTTCACTG TCCATTCCAA ATAGCCCACA 1860 

CGTGCTCTCT GCTGGATTCT AAATTATCTG AATTTTGCCA TATTAAATCT TCCTCATTTA 1920 

TACTATTATT TGTTACX3TTC AATCAGAATC CCCGAAACCT CCTATAAAGC TTAGCTGCCC 1980 

CTTCTGAGGA TGCTGAGAAC GGTGTCTTTC TTTATAAATG CAAATGGCTA CCGTTTTACA 2040 

15 ATAAAA1TTT GCATGTGCAA AAAAAAAAAA ANAAAAAAAA AAAATCCCGG GGGGGGGCCG 2100 

GTAACCAATT TGNCCCC 2117 



10 



20 



(2) INFORMATION FOR SEQ ID NO: 159: 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 2395 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 

TGTTCCTTAA TCCCTTTTCT AAAAAGGGGG GAAAATCCGG ATGGATTTTA GQGATTQGTC 60 

TGGTGTCAGC TGTGTTTTAT TGCACACCTA AATCCTGATT ATAGQCmT CATTTCTCCG 120 

35 

CAAAGCCTTT ATTTTGGCAG TTAAGCCAAA TGTGTTTTCC AGAAAGTTAG TTATTTTCTC 180 

CTCTTTCTTT CCTTTCTITC CTCCCTTTTT CCCGTCTGAC CCCAAACGTT ATTGTCCAAA 240 

40 CATGACTGGA CAGCAOCTTT TGTTTCTTGA CCCTGTAATA TGACAGTCTG CTAATATTGA 300 

CAGAAGGTGC AGTTTTTGGG TTATAGTCGT GATTTTCGCT AATCAATCAT ATTAGCAGGA 360 

AAAAAAAKGA CTTGTTTCTG TTGTACTTGA GTCTTAAGAA AAAGTGGCCC ATAGTTTAGT 420 

45 

GGACAATTTC CAAAGGCTTT AGTACCACCT GTATTTCAAA ATGGGGGACC CAAACTCCCG 480 

GAAGAAACAA GCTCTGAACA GACTACGTGC TCAGCTTAGA AAGAAAAAAG AATCTCTAGC 540 

50 TGACCAGTTT GACTTCAAGA TGTATATTGC CTTTGTATTC AAGGAGAAGA AGAAAAAGTC 600 

AGCACTTTTT GAAGTGTCTG AGGTTATACC AGTCATGACA AATAATTATG AAGAAAATAT 660 

CCTGAAAOGT GTGCGAGATT CCAGCTATTC CTTGGAAAGT TCCCTAGAGC TTTTACAGAA 720 

55 

GGATGTGGTA CAGCTCCATG CTCCTCGATA TCAGTCTATG AGAAGGGATG TAATTGGCTG 780 

TACTCAGGAG ATGGATTTCA TTCTTTGGCC TCGGAATGAT ATTGAAAAAA TCGTCTGTCT 840 

60 CCTGTTTTCT AGGTGGAAAG AATCTGATGA GCCTTTTAGG CCTGTTCAGG CAAATTTGAG 900 
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TTTCATCATG GTGACTATGA AAAACAGTTT CTGCATGTAC TGAGCCGCAA GGACAAGACT 960 

GGAATCGTTG TCAACAATCC TAACCAGTCA GTGTTTCTCT TCATTCACAG ACAGCACTTG 1020 

5 

CAGACrrCCAA AAAACAAAGC TACAATCTTC AAGTTATGCA GCATCTGCCT CTACCTGCCA 1080 

CAGGAACAGC TCACCCACTG GGGCAGTTGG CACCATAGAG GRTCACCTCC GTCCTTATAT 1140 

10 GCCAGAGTAG AGTACTGACC AGCAAAATGG AGAAGATCAG AGAATGCAGC AGCAGmTT 1200 

' mi.Tii;m ' tcttaccact ttattctttc agagtttaaa gaaaatggac tcatgcacag 1260 

AACACTATGC ATTTTGAAAC TTGTTCATCC TGGATTTTTT TAAATCATTT TTATCTCAGA 1320 

15 

ACTTAAACAA AAATTAGATG TCGTGCACGG ACTGTGTGAA AGAAGATGCT TTGCATATTT 1380 

GCTGCACTGC ATCAGTATCT TACTAAAAAT GTGAAATGAA AGGACTATTG TACACTGAAA 1440 

20 TGCTTAAATG TATCTGAAAG CACAAGGTGA TACTCATTTT TATGGTCTTC CX!ATTTGTGC 1500 

TOGTTTTTGC CTCTTTGACA TCTGTCATCA GTATTTAGAG GGTGAGAAGT GAATGTAACA 1560 

GGTATAAATA ACATTTTTAA AAACAATAAC TTTGCTATAA TCACAGTTGT TCCAGAGCAC 1620 

25 

TGTCAGATAC ATTCTAATGA CCAGAACTGG TTTAAAAAAA GAAAATACAA CCATGGGAAA 1680 

GAAATCTTAA ATGAAAAACG CATCTCATTG TAGGCATTTT TGCCTCATAT TTTACTGGGC 1740 

30 CATGirrCTT TCCTGGTACT CATGTATTTT irmTCCAG ATCTCTTTCC CCAAGTTGCT 1800 

ATTGTAAGAG TATTCTGCTG CGTGTGGATG CAGTTATACA CATTAAAGCA GATCTGGAGT 1860 

CTGAAGTAGC TATAAAGCAG CTATAAAACA GAAATACATG CATAGCTGCA GAAACCATGA 1920 

35 

TAGGTAGAGG ACTTTTCTTT TGGTTTTGTT TTGrrTTGTT TTGTTTTGTT TTTGGTTTTA 1980 

CAGAGAAGAG ATTTTTATTA CAAAGAAAAA AATTCCAGTG AATTGTGCAG AAATGCTGGT 2040 

40 TTTTACACCA TCCTAAAGAA AAACTTTACA AGGGTGTTTT GGAGTAGAAA AAAGGTTATA 2100 

AAGTTGGAAT CTTAAATTGT AAAATTAACC ATTGAGTGTC AAAGTTCTAA AAGCAGAACT 2160 

CATTTTGrPGC AATGAACATA AGGAAAGACT ACTGTATAGG TTTTrTTTTT TTCTCCTTTT 2220 

45 

AAATGAAGAA AAGCTTTGCT TAAGGGTTGC ATACTTTTAT TGGAGTAAAT CTGAATGATC 2280 

CTACTCCTTT GGAGTAAAAC TAGTGCTTAC CAGTTTCCAA TTGTATTTAG CTTCTGGTTG 2340 

50 GAATTTGAAA AAAAAAGAAA AAAAGAAAAA GAAAACCTAA ATAAAATAGG TGAAA 2395 



55 (2) INFOFMATICW FOR SEQ ID NO: 160: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2120 base pairs 

(B) TYPE: nucleic acid 
60 <C) STRANDEENESS: double 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 
CCCCGGATAC CQCCTGACC3T AGTGCCAATC ACACCTCTCG CGTCTCGGCG CCTCGGAGGC 
TAATCAGGAC GCCTOG0C5AA ACGCAGTAAC GGATTTCCGG GTGGACCTTC GCTTTACGGC 
TCGTCAGTTC TTCCGCCCAA CCCAGAGC5AA GCGGGAGAGC AGTTTACGAC AGCGCCGGTC 
OTGrrTACGG CQGCGCCCGC TGCGCC3CGCA TGTTTCCTCT TTTCCTGGTT TCTCAAGAGT 
GCTGCTGCTA ACGCGGTCCC CGGCACGCAC CATCTGTTGC CATCCCGGCC GGCCGAGGCA 
TTGCAGATTT TGGAAGATX3G CAAAGTTCAT GACACCCGTG ATCCAGGACA ACCCCTCAGG 
CTCGGOTCCC TGTCCGGTTC CCGAGCAGTT TCGGGATATG CCCTACCAGC CGTTCAGCAA 
AGGAGATCGG CTAGGAAAGG TTGCAGACTG GACAGGAGCC ACATACCAAG ATAAGAGGTA 
CACAAATAAG TACTCCTCTC AGTTTGGTGG TGGAAGTCAA TATGCTTATT TCCATGAGGA 
GGATGAAAGT AGCTTCCAGC TGGTGGATAC AGCGCGCACA CAGAAGACGG CCTACCAGCG 
GAATCGAATC AGATTTGCCC AGAQGAACCT CCGCAGAGAC AAAGATCGTC GGAACATGTT 
GCAGTTCAAC CTCCAGATCC TGCCTAAGAG TGCCAAACAG AAAGAGAGAG AACGCATTCG 
ACTCCAGAAA AAGTTCCAGA AACAATTTGG GGTTAGGCAG AAATGGGATC AGAAATCACA 
GAAACCCCGA GACTCTTCAG TTGAAGTTCG TAGTGATTGG GAAGTGAAAG AGGAAATGGA 
TTTTCCTCAG TTCA1GAAGA TGCGCTACTT GGAAGTATCA GAGCCACAGG ACATTGAGTG 
TTGTGGGGCC CTAGAATACT ACGACAAAGC CTTTGACCGC ATCACCACGA GGAGTGAGAA 
GCCACreCGG ASATNCAAGC GCATCTTCCA CACTGTCACC ACCACAGACG ACCCTGTCAT 
CCGCAAGCTG GCAAAAACTC AGGGGAATGT GTTTGCCACT GATGCCATCC TGGCCACGCT 
GATGAGCTCT ACCCGCTCAG TOTATTCCTG GGATATTGTC GTCCAGAGAG TTGGGTCCAA 
ACTCTTCTTT GACAAGAGAG ACAACTCTGA CTTTGACCTC CTGACAGTGA GTGAGACTGC 
CAATGAGCCC CCTCAAGATG AAGGTAATTC CTTCAATTCA CCCCGCAACC TGGCCATGGA 
GGCAACCTAC ATCAACCACA ATTTCTCCCA GCAGTGCTTG AGAATGGQGA AGGAAAGATA 
CAACTTCCCC AACCCAAACC CGTTTGTGGA GGACGACATG GATAAGAATG AAATCGCCTC 
TGTTGOyrAC CGTTACCGCA GTGGNAAGCT TGGAGATGAT ATTGACCTTA TTGTCCGTTG 
TGAGCACGAT GGCGTCATGA CTGGAGCCAA CGGGGAAGTG TCCTTCATCA ACATCAAGAC 
ACTCAATGAG TGGGATTCCA GGCACTGTAA TGGCGTTGAC TGGCGTCAGA AGCTGGACTC 
TCAGCGAGGG GCTGTCATTG CCACGGAGCT GAAGAACAAC AGCTACAAGT TGGCCCGGTG 
GACCrcCTCT GCTTTGCrGG CTCGATCTGA GTACCTCAAG CTTGGTTATG TGTCTCGGTA 
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CCACGTGAAA GACTCCTCAC GCCACGTCAT 
GTTTGCCAGC CAGATCAACC TGAGCGTGGA 
5 TGACATCTGC ATGAAGCTGG AGGAGGGCAA 
GGTCATCCGT GTCTACAGCC TCCCTGATGG 
AGAOGAGGAG GAAGAAGAGG AAGAAGAAGA 

10 

GGAGTTTGTC CTTCCACCGA GACTACGAGG 
ACTTGCTCTC TGACATTTAG CAGATGAAAT 
15 AAAAAAAAAA AAAAAAAAAN 



387 

CCTAGGCACC CAGCAGTTCA AGCCTAATGA 1740 

GAATGCCTGG GGCATTTTAC GCTGCGTCAT 1800 

ATACCTCATC CTCAAGGACC CCAACAAGCA 1860 

CACCTTCAGC TCTGATGAAG ATGAGGAGGA 1920 

GGAAGAAACT TAAACCAGTG ATGTGGAGCT 1980 

GCCTTTGATG CTTAGTGGAA TGTGTGTCTA 2040 

AAAATATATA TCTGTTTAGT CTTAAAAAAA 2100 

2120 



20 (2) INFORMATION FOR SEQ ID NO: 161: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 
^5 (C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SBQ ID NO: 161: 

30 GGAAGCTGAA GTCCTTCCAG ACCAGGGACA ACCAGGGCAT TCTCTATGAA GCTGCACCCA 60 

CCTCCACCCT CACCTGTRAC TCAGGACCAC AGAAGCAAAA GTTCTCACTC AAACTGGATG 120 

CCAAGGATGG GCGCTTGTTC AATGAGCAGA ACTTCTTCCA GCGQGCCGCC AAGCCTCTGC 180 

35 

AAGTCAACAA GTGGAAGAAG CTGTACTCGA CCCCACTGCT GGCCATCCCT ACCTGCATGG 240 

GTTTCGGTGT TCACCAGGAC AAATACAGGT TCTTGGTGTT ACCCAGCCTG GGGAGGAGCC 300 

40 •ITCAGTCGGC CCTGGATGTC AGCCCAAAGC ATGTGCTGTG CAGAGAC3GTC TGTGCTGCAG 360 

GTGGCCTGCC GGCTGCTGGA TGCCCTGGAG TTCCTCCATG AGAATGAGTA TGTTCATGGA 420 

AATGTGACAG CTGAAAATAT CTTTGTGGAT CCAGAGGACC AGAGTCAGGT GACTTTGGCA 480 

45 

GGCTATGGCT TCGCNTTCCG CTATTGCCCA AGTGGCAAAC ACGTGGCCTA CGTGGAAGGC 540 

AGCAGGAGCC CTCACGAGGG QGACCTTGAG TTCATTAGCA TGGACCTGCA CAAGGGATGC 600 

50 GQGCCCTCCC GCCGCRGCGA CCTCCAGAGC CTGGGCTACT GCATGCTGAA GTGGCTCTAC 660 

GGGTTTCTGC CATGGACAAA TTGCCTTCCC AAMAWTGAGG ACATCATGAA GCAAAAACAG 720 

AAGTTTGTTG ATAAGCCGGG GCCCTTCGTG GGACCCTGCG GTCACTGGAT CAGGCCCTCA 780 

GAGACCCTGC AGAAGTACCT GAAGGTGGTG ATGGCCCTCA CGTATGAGGA GAAGCCGCCC 840 

TACGCCATGC TGAGGAACAA CCTAGAAGCT TTGCTGCAGG ATCTGCGTGT GTCTCCATAT 900 
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(2) INFOHMATICaJ FOR SEQ ID NO: 162: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1003 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



60 



180 
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300 



(xi) SEQtJENCE DESCRIPTION: SEQ ID NO: 162: 
GGCACGAGAT GAGGGC3CACC CAGTCCTTCT AGGGCAGGCT GGGTGGTGGT CCCCTAGGTA 
15 TCAGCCTCTC TTACTOTACT CTCCQGGAAT GTTAACCTTT CTATTTTCAG CCTGTGCCAC 120 
CTCTCTAGGC AAGCTCGCTT CCCCATTGGC CCCTGTQGGT CCACAGCAGC GTGGCTGCCC 
CCCAGGGCCA CCGCTTCTTT CTTCATCCTC TTTCCTTAAC AGTGACITGG GCTTGAGTCT 
GGCAAGGAAC CITCCTTTrA GCTTCACCAC CAAGGAGAGA GGTTGACATG ACCTCCCCGC 
CCCdCACCA AGGCTOGGAA CAGAQQQGAT GTGGTGAGAG CCAGGTTCCT CTGGCCCTCT 360 
25 CCAGQGTCTT TICCACTAGT CACTACTGTC TTCTCCTrGT AGCTAATCAA TCAATATTCT 420 
TCCCTTGCCT GTCGGCAGTG GAGAGGCTGC TGGGTGTACG CTGCACCTGC CCACTGAGTT 480 
GQGGAAAGAG GATAATCAGT GAGCACTGTT CTGCTCAGAG CTCCTGATCT ACCCCACCCC 540 
CTAGGATCCA GGACTGGGTC AAAGCTGCAT GAAACCAGGC CCTGGCAGCA AACCTGGGAA 600 
TOGCTOGAGG TGGGAGAGAA CCTGAACTTC TCTTTCCCTC TCCCTCCTCC AACATTACTG 660 
35 GAACTCTATC CTSTTAGGAT CTTCTGAGCT TGTTTCCCTG CTGGGTGGGA CAGAGGACAA 720 
AGGAGAAQGG AGGCTCTAGA AGAGGCAGCC CTTCTTTGTC CTCTGGGGTA AATGAGCTTG 780 
ACCTAGAGTA AATQGAGAGA CCAAAAGCCT CTGATTTTTA ATTTCCATAA AATGTTAGAA 840 
GTATATATAT ACATATATAT ATTTCTTTAA ATTTTTGAGT CTTTGATATG TCTAAAAATC 
CATTCCCTCT GOCCTGAAGC CTGAGTGAGA CACATGAAGA AAACTGTGTT TCATTTAAAG 
45 ATGTTAATTA AATGATTGAA ACTTGAAAAA AAAAAAAAAA AAA 1003 



900 
960 



50 (2) INFORMATION FOR SEQ ID NO: 163: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2196 base pairs 

(B) TYPE: nucleic acid 
55 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: 



60 AAGAAGCGGC ACACGGATGT GCAGTTCTAC ACAGAAGTGG GAGAGATAAC CACGGACTTG 
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C3GGAAACATC AGCATATGCA TCACCGAGAT GACCTCTATG CTGAGCAGAT GGAACCytfSAA 
A-TCAGGCACA AACIGAAAAC AGCCTTTAAA AATTTCATTG AGAAAGTAGA GGCTCTAACT 
AAGGAGGAAC TGGAATTTGA AGTGCCTTTT AGGGACTTGG GATTTAACGG AGCTCCCTAT 
AGGAGTACCT GCCTCCTTCA GCCCACTAGT AGTGCGCTGG TAAATGCTAC GGAATGGCCA 

ccmrarGG tgacattgga tgaggtagag ctgatccact ttragcgggt ccagtttcac 

CTCAAGAACT TTXSATATGGT AATOSTCTAC AAGGACTACA GCAAGAAAGT GACCATGATC 
AACGCTATTC CTGTAGCCTC TCTTGACCCX: ATXrAAGGAAT GGrTTGAATTC CTGCX5ACCTG . 
AAATACACAG AAGGAGTACA GTCCCTCAAC TGGACTAAAA TCATGAAGAC CATTGTTGAT 
QACCCTGAOG GCTTCTTCGA ACAAGGTGGC TGGTCTTTCC TGGAGCCTGA GGGTGAGGQG 
AOTGATCCTC AAGAAQGGGA TTCAGAGTCT GAAATTGAAG ATGAGACTTT TAATCCTTCA 
GAAGATGACT ATGAAGAGGA AGAGGAGGAC AGTGATGAAG ATTATTCATC AGAAGCAGAA 
GAGTCAGACT ATTCTAAGGA GTCATTGGGT AGTGAAQAAG AGAGTGGAAA GGATTGGGAT 
GAACTCGAGG AAGAAGCCCG AAAAGCGGAC CGAGAAAGTC GTTACGAGGA AGAAGAAGAA 
CAAAC?rCGAA GTATXSAGCCG GAAGAGGAAG GCATCTGTGC ACAGTTCGGG CCGTGGCTCT 
AACCCTOGTT CCAGACACAG CTCTGCACCC CCCAAGAAAA AGAGGAAGTA ACTTCTGAAC 
•mmXCTG AGCTCCATTC TTCCTCCAGC CAACCCCTGA AAATTTTACA TGACATAGAA 
ACTCTATTTT TCCTITCGTT TTCATTTGAA GTTTIGCCAT TrGTCTTTAT GGGTTTAGGG 
GGCCATITCT C?IX3GACCAAT CTACTCGGGG AATTCCAGGC CCACCAGGAC ACGTGCCAAT 
GGCCCCATTC AGATGGCAAG GGAGGAGGTG TTCTTGAAGA CAGGAGGAGG CTCCCGCTGT 
TAATAAATAT TGTTTCATTC TTCTCTCTrC CrGTCACXTTT CTGCCAAGAC ATTGATGGCT 
TCTCACATCT TATTTGCTCT CTCAAAGCTG TATTTCCAAG ACAGTGGTAC AAGGTGACCC 
TTAATTACX:C GTATCATCGT TCTTGACCAG CACATTCAAT CCTCCAACCT ACCCTACTGC 
CATGACCTTC CGCACATCTC TAAGTTTTAT CTTTGCAATA CTCAAGGTTC TCGGAAATTT 
GCTAATGGrr GTGATAAACC ATACAGCTTG AGCCAGTGAG GCAGATTGGG CTQGTGCCTT 
CGTCTCAOTT TTCCTGCTTT CCTGCCTCGT GCAGATTCTG AGGTATATCT GCTGCCTTGG 
AAGACATAAG AAGCAGTOAT ACTCCCTGGC TCGGTTATTT TCTCCATACA ATGCACACAT 
GOTACAATCA TAGAAGGCAA AATTGCCACT GTCTTCTTTT TTTTCTCATA TATCTAAGGA 
AGATATATCA GGTTCTGCCT CATGTACCGC TTCTAGrTGAA ATGTAGAGGA AGGCTCAAAG 
GAGTCAACAT TTAGATCTGG AAGGGACAAG TCATGCCTTG GGCCTAGAAT ACCCTGATGA 
GAAAAGAGAA GAGGAAGGGA GGCCATATCT ACAACANCAN CCTCTCGGCA CTGCTGCTCC 
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TTATTTTAAC TTTGTCTTGC ATTGTCCTGT ATTTATCACA GTTTCTGTTG AACAGCTTTT 1920 

CAAGTATTTG GGGAGTTTAT CTTGCCATCC TCCCCTTCTG GTTCTCTGCA CCCACCTGTC 1980 

5 

CCACTGCAGT TCCTTCCGTG CTCTGTGACT TTAAGAGAAG AAGGGGQGi«3 GGGTCCCGGA 2040 

TTTTATGTTT CJl'l ' ltyiTlTr TCTCCTTAGC AGTAGGACTT GATATTTTCA ATTTTGGAAG 2100 

10 AACTAAAAGA TGAATAAACT GGGTTTTTTT TGTTGTTTGT TTTTGTA/VAA AAAAAAAAAA 2160 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAA 2196 



15 



(2) INFORMATION FOR SEQ ID NO: 164: 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 1945 base pairs 

^ (B) TYPE: nucleic acid 

(C) STRANDEia^ESS : double 

(D) TOPOLOGY: linear 

25 (xi) SEQUENCE DESCRIPTIC3N : SEQ ID NO: 164: 

GCACAGAGTC GQGCGGACGG ACAGGGAGAG GAGGAGAGGG GGTCTGCGCG CQGCCGCTAC 60 

CCAGAAGCCA GCGGACGGCA GCACGGAGTG GGCTGTCCCC GAGCCCAGCC CCGAGCGAGC 120 

30 

CCCCCCCCCG CCCCCGMAGG ACGCGCCTYC CAGCCAGCCC GACTYCTAGG AGGAGGGGAG 180 

GCGGGAAAGC AGCTCAAGCC TCACCCACCG CCCTGCCCCC AGCCCCGCCA CTCCCAGGCT 240 

35 CCTCGGGACT CGGCGGGTCC TCCTGGGAGT CTCGGAGGGG ACCGGCTGTG CAGACGCCAT 300 

GGAGTTQGTG CTGGTCTTCC TCTGCAGCCT GCTGGCCCCC ATGGTCCTGG CCAGTGCAGC 360 

TGAAA7VGGAG AAGGAAATGG ACCCTTTTCA TTATGATTAC CAGACCCTGA GGATTGGGGG 420 

40 

ACTGGTGTTC GCTGrTGGTCC TCTTCTCGGT TGGGATCCTC CTTATCCTAA GTCGCAGGTG 480 

CAAGTGCAGT TTCAATCAGA AGCCCCGGGC CCCAGGAGAT GAQGAAGCCC AGGTGGAGAA 540 

45 CCTCATCACC GCCAATGCAA CAGAGCCCCA GAAAGCAGAG AACTGAAGTG CAGCCATCAG 600 

GTGGAAQCCT CTGGAACCTG AGGCGGCTGC TTGAACCTTT GGATQCAAAT GTCGATGCTT 660 

AAGAAAACCG GCCACTTCAG CAACAGCCCT TTCCCCAQGA GAAGCCAAGA ACTTGTGTGT 720 

50 

CCCCCACCCT ATCCCCTCTA ACACCATTCC TCCACCTGAT GATGCAACTA ACACTTGCCT 780 

CCCCACTGCA GCCTGCGGTC CTGCCCACCT CCCGTGATGT GTGTGTGTGT GTGTGTGTGT 840 

55 GTGACTGTGT GTGTTTGCTA ACTGTGGTCT TTGTGGCTAC TTGTTTGTGG ATGGTATTGT 900 
GTTTGTTAGT GAACTGTQGA CTCGCTTTCC CAGGCAGGGG CTGAGCCACA TGGCCATCTG 960 

CTCCTCCCTG CCCCCGTGGC CCTCCATCAC CTTCTGCTCC TAGGAGGCTG CTTGTTGCCC 1020 
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GAGACCAGCC CCCTCCCCTG ATTTAGGGAT GCGTAGGGTA AGAGCACGGG CAGTGGTCTT 
CAGTCGTCTT GGGACCTGGG AAGGTTTGCA GCACITTGTC ATCATTCTTC ATGGACTCCT 
TTCACTCCTT TAACAAAAAC CTTGCTTCCT TATCCXaCCT GATCCCAGTC TC3AAGGTCTC 
TTAGCAACTG GAGATACAAA GCAAGGAGCT GGTGAGCCCA GCX3TTGACGT CAGGCAGGCT 
ATOCCCTTCC GTGGTTAATT TCTTCCCAGG GGCTrCCACG AGGAGTCCCC ATCTGCCCCG 
CCCCTTCACA GAGCGCCCGG GGATTCCAGG CCCAGGGCTT CTACTCTGCC CCTGGGGAAT 
GTCTCCCCTC CATATCTTCT CAGCAATAAC TCCATQGGCT CTGQGACCCT ACCTCTTCCA 

ACxrrrcccTC cttctcagac ttcaatctac agcccagctc atccagatgc agactacagt 

CCCTGCAATT GGGTCTCTGG CAGGCAATAG TTGAAGGACT CCTGTTCCGT TGGGGCCAGC 
ACACCGGGAT QGATX3GAGGG AGAGCAGAGG CCTTTGCTTC TCTGCCTACG TCCCCTTAGA 
TGGGCAGCAG AGGCAACTCC CGCATCCTTT GCTCTGCCTG TCRGTGGTCA GAGCGGTGAG 
CGAGGTCQGT TOGAGACTCA GCAGGCTCCG TGCAGCCCTT GGGAACAGTG AGAGGTTGAA 
GOTCATAACG AGAGTCGGAA CTCAACCCAG ATCCCGCCCC TCCTGTCCTC TGTGTTCCCG 
CGGAAACCAA CCAAACCGTG CGCTGTGACC CATTGCTGTT CTCTGTATCG TGATCTATCC 
TCAACAACAA CAGAAAAAAG GAATAAAATA TCCTTTGrTTT CCTAGTGAAA AAAAAAAAAA 
AAAAAAAAAA AAAAAAAAAA CTCGA 
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(2) INFORMATION FOR SEQ ID NO: 165: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2933 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165: 
GGGTCGACCC ACGCGTCCGG CAGCCGTCGT TTGAGTCGTT GCTGCCGCTG 
GATCAGGAGC CAGTGTATAC CGCCCGCCCA CCGCCTTGGT GCCGCTAGAG 
GGAGGCCGCC TGCGGTTTGT CGCCGCAGCT CGCCCMCYGY CYGGRAGAGC 
CCCAGTCGGT CGCYTGCCAC CSCTCGTAGC CGTTACCCGC GGGCCGCCAC 
CGGGAGAGGC GCGCGCCATG GCYTCTGGAG CCGATTCAAA AGGTGATGAC 
CCATTCTCAA ACAGAAGAAC CGTCCCAATC GGTTAATTGT TGATGAAGCC 
ACAACAGTGT GGTGTCCTTG TCCCAGCCCA AGATGGATGA ATTGCAGTTG 
ACACAGTCTT GCTGAAAGGA AAGAAGAGAC GAGAAGCTGT TTGCATCGTC 



CCCCCTCCCG 
GAAACGAGAA 

CGAGCCCCGG 
AGCCGCCGGC 
CTATCAACAG 
ATCAATGAGG 
TTCCGAGGTG 
CTTTCTGATG 
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ATACTTGTTC TGATCAGAAG ATTCX3GATGA ATAGAGTTGT TCGGAATAAC CTTCGTGTAC 540 

GCCTAGGGGA TGTCATCAGC ATCCAGCCAT GCCCTGATGT GAAGTACGGC AAACGTATCC 600 

ATOIGCTGCC CATTGATGAC ACAGTGGAAG GCATTACTGG TAATCTCTTC GAGGTATACC 660 

TTAAOCCGTA CTTCCTQGAA GCGTATCGAC CCATCCGGAA AGGAGACATT TrTCTTGTCC 720 

OIGGTCGGAT GCGTGCTGTG GAGTTCAAAG TQGTQGAAAC AGATCCTAGC CCTTATTGCA 780 

TTCTTCCTCC AGACACAGTG ATCCACTGCG AAGGGGAGCC TATCAAACGA GAGGATQAGG 840 

AAGAGTCCTT GAATCyVAGTA GGGTATGATG T^TTGGTGG CTGCAGGAAG CAGCTAGCTC 900 

AGATAAAGGA GATQGTGGAA CTGCCCXTTGA GACATCCTGC CCTCTTTAAG GCAATTQGTG 960 

TGAAGCCTCC TAGAGGAATC CTGCTTTACG GACCICCTGG AACAGGAAAG ACCCTGATTG 1020 

CTCGAGCTGT AGCAAATGAG ACTGGAGCCT TCTTCTTCTT GATCAATGGT CCTGAGATCA 1080 

TGAGCAAATT GGCTGGTGAG TCTGAGAGCA ACCTTCGTAA AGCCTTTGAG GAGGCTGAGA 1140 

AGAATGCTCC TGCCATCATC TTCATTGATG AGCTAGATGC CATCGCTCCC AAAAGAGAGA 1200 

AAACTCATO3 CGAGGTOGAG CGGCGCATTG TATCACAGTT GTTGACCCTC ATGGATGGCC 1260 

TAAAGCAGAG GGCACATGTG ATTGTTATGG CAGCAACCAA CAGACCCAAC AGCATTGACC 1320 

CAGCTCTACG GCGATTTGGT CGCTTTGACA GGGAQGTAGA TATTQGAATT CCTGATGCTA 1380 

CAGGACQCTT AGAGATTCTT CAGATCCATA CCAAGAACAT GAAGCTGGCA GATGATGTGG 1440 

ACXTTGGAACA GTAGCCAATG AGACTCACGG GCATGTGGGT GCTGACTTAG CAGCCCTGTG 1500 

CTCAGAGGCT GCTCTCCAAG CCATCCGGAA GAAGATGGAT CTCATTGACC TAGAGGATGA 1560 

GACCATTCAT GCCGAGGTCA TGAACTCTCT AGCAGTTACT ATQGATGACT TCCGGTGGGC 1620 

CTTGAGCCAG AGTAACCCAT CAGCACTGCG GGAAACCGTG GTAGAGGTGC CACAGGTAAC 1680 

CTGGGAAGAC ATCGGGGGCC TAGAGGATGT CAAACGTGAG CTACAGGAGC TGGTCCAGTA 1740 

TCCTGTGGAG CACCCAGACA AATTCCTGAA GTTTGGCATG ACACCTTCCA AGGGAGTTCT 1800 

GTTCTA.TGGA CCTCCTGGCT GTGGGAAAAC TTTGTTGGCC AAAGCCATTG CTAATGAATG 1860 

CCAGGCCAAC TTCATCTCCA TCAAGGGrTCC TGAGCTGCTC ACCATGTGGT TTGGGGAGTC 1920 

TGAGQCCAAT GTCAGAGAAA TCTTTGACAA GGCCCGCCAA GCTGCCCCCT GTGTGCTATT 1980 

CTTTGATGAG CTX3GATTCGA TTGCCAAGGC TCGTGGAGGT AACATTGGAG ATGGTQGTGG 2040 
GGCTGCTGAC CGAGTCATCA ACCAGATCCT GACAGAAATG GATGCCATCT CCACAAAAAA 2100 
AAATCTGTTC ATCATTGGCG CTACCAACCG GCCTGACATC ATTGATCCTG CCATCCTCAG 2160 
ACCTGGCCGT CTTGATCAGC TCATCTACAT CCCACTTCCT GATGAGAAGT CCCGTGTTGC 2220 
CATCCTCAAG GCTAACCTGC GCAAGTCCCC AGTTGCCAAG GATGTGGACT TCGAGTTCCT 2280 
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GGCTAAAATG ACTAATGGCT TCTCTGGAGC TGACCTGACA GAGATTTGCC AGCGTGCTTG 
CAAGCTQGCC ATCCGTOAAT CCATCGAGAG TGAGATTAGG CGAGAACGAG AGAGGCAGAC 
AAACCCATCA GCCATGGAGG TAGAAGAGGA TGATCCAGTG CCTGAGATCC C3TCGAGATCA 
CTTTGAAGAA GCCATGCGCT TTGCXX:GCCX5 TTCTGTCAGT GACAAIGACA TTCGGAAGTA 
TGAGATGTTT GCCCAGACCC TTCAGCAGAG TCGGGGCTTT GGCAGCTTCA GATTCCCTTC 
AGGGAACCAG GOTGGAGCTG GCCCCAC3TCA GGGCAGTGGA GGCGGCACAG C3TGGCAGTGT 
ATACACAGAA GACAATGATC ATC5ACCTGTA TGGCTAAGTG GTGGTGGCCA GCGTGCAGTG 
AGCTGGCCTG CCTGGACCTT GTTCCCTGOG GGTGGGGGCG CTTGCCCAGG AGAGGGACCA 
GGGC?1QCX3CC CACAGCCTGC TCCATTCTCC AGTCTGAACA GTTCAGCTAC AGTCTGACTC 
TtSGACAGGGG GTTTCTGTTG CAAAAATACA AAACAAAAGC GATAAAATAA AAGCGATTTT 
CATTTCGTAA AAAAAAAAAA AAAAAAAAAT CXX^GGGGGGG GCCCGAACCA TTT 

(2) INFORMATION FOR SEQ ID NO: 166: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2243 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166: 
TCGGAGAGCC GGCGGGCGNG CGCCTCTCGG CCAGGAAGCG CCTCTTGGAC GCGTGTNACC 
GATCCCCAGA AGTGGCCTTG GGCTGGGGAT CACCATAGCT TTTCTAGCTA CGCTGATCAC 
GCAGTTTCTC GTGTATAATG GTGTCTATCA GTATACATCC CCAGATTTCC TCTATATTCG 
TTCTTOGCTC CCTTCTATAT TTTTCTCAGG AGGCGTCACG GTGGGGAACA TAGGACGACA 
GTTAGCTATG GGTGTTCCTG AAAAGCCCCA TAGTGATTGA GTCTTCAAAA CCACCGATTC 
TGAGAGCAAG GAAGATTTTG GAAGAAAATC TGACTGTGGA TTATGACAAA GATTATCTTT 
TTTCTTAAGT AATCTATTTA GATCGGGCTG ACTGTACAAA TGACTCCTQG AAAAAACTCT 
TCACCTAC?rc TAGAATAGGG AGGTGGAGAA TGATGACTTA CCCTGAAGTC TTCCCTTGAC 
TGCCCOCACT GGCGCCIGTC TGTGCCCTGG AGCATTCTGC CCAGGCTACG TGGGTTCAGG 
CAGGTOGCAG CTTCCCAAGT ATTCGATTTC ATTCATGTGA TTAAAACAAG TTGCCATATT 
TCAAAGCCTT GAACTAAGAC TCAATTACCA ACCCGCAGTT TTGTGTCAGT GCCCAAAGGA 
GGTAGGTIX3A TGGTGCTTAA CAAACATGAA GTATGGTGTA ATAGGAATAA TATTTATCCA 
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AAAGATTTTT AAAAATAGG6 CTOK5TITAA AAAAAAAAAC AAAACARGAA AAGCAGCAGT 
GATTATAGAG AGGTCACACT CTAAGTCGGG TCC3CGGCGTG GCCACGCTTC ACGGTCACGC 

TCGTCCxyrcc tccagtggcg totttacatg gtcacacgtg tgtgtatcac cagtgggtca 

ACTGCITGTC ATTCCTCCCG TX3GCAGTTTG TCTAGACAAT CTTACTGAGC AAAAQC5CAAT 
GAAAAGTCTT GGTTCCCACA CKSCGATATA TrrOGAATTTT CACCTCAGTT TATGAAGTTT 
ATTTCGAAAT CCATAGTCAT CTAAGAATGA ATACCTGTCT GCCATGTATT TCAATCTTAG 
TCAGCCAAAA TIX3TTTCTTT GTTACTACAG AATAGAGATG ACTGTTTTTT GCCACAGCCC 
TATGGRATTT GCAATCTGTC ATrGCCTTGT AAAAAGGAGA GTGCATATGG CACTGCATTA 
AACOratGGT GTTTCTAGTC AATGATATTG GTGAGCACAA TGTATTCATT TAATGGCATA 
GACCATACCA GACCTAATTT GCAAGTATTG GGTCTTAAAC TTCAAGTGCA ATGTATATGA 
AAACCAATCT GAGCCTTOTA TCTCTTAAAT ATTTATTTTT TTTAACGTGT GAGATGTTCG 
AGRGAAGGTr CTCCATTCAT TTCAGTGCTG CXTTGCAGCSAA ACTCGGCAAT GATTTCTTTC 
AGTTCIGAAG TTCCTTTCGT GTTACACCCT CCACTGAACC CTCAACCTTC GAAATACTCC 
AGmTCTGG GTTTOGTCAT TrTTACITAT AAATTTACCT TTTTGTATTT TGCAATTTAC 
ATCTGTrrGG TTrGTTTTAA ATTCTGTGAA AGTGGCTTGA TTAAAAGACT CCTTTTAAAT 
GGAAGCCACC AGTCAGCAGA ATGGAAGCTT AGAGGAACTT GCCTGTGAGC GCTGGTCnT 
GTCTTIGGTT TIGTCATGTA ACGATCTTTG CTGGGGTrrT TrccrrTGrr TTGAGGGAAA 
TCTCrrOGAG TAAATTTTAA GTTCCTQGAG TrAATTTGTT TTACAGGAAT TTTGrmTTT 
AAAAAAATAG GATCATTCTC AACTTTQGAA TGACCCCCTT ATATATTTTC TGAAAATGAA 
AACACTPTACA TGAAAAAAAT TTCCAATGAA GATGTCAGCA TTTTATGAAA AACCAGAAGT 
TATTAGATCA AAGCAQCX5AG TGAATCTTTA AAACAGACTT GATCACGCAC ACACAATAAG 
TCTTTCTCTC CGAAACCGGA AGTAAATCTA TATCTGTTAG AAATAATGTA GCCAAAAGAA 
TGTAAArrTC AGGATTTTTT TGCCAATAGT TTATAGAAAA TATATGAACC AAAGTGATTT 
GAOTTTCTAA AAATCTAAAA TAGTATGAAC AAAATTTGCA CTCTACCAGA TTTGAACATC 
TAGTGAGGrr CACATTCATA CTAAGTTTTC AACATTGTGT TCTTTTTGCA TTCATTTTTr 
ACTTTTATTA AACGTTCAAA ACC 
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(2) INFORMATION FOR SEQ ID NO: 167: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1816 base pairs 
60 tB) TYPE: nucleic acid 
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60 



(C) STRANDEDNESS: double 

(D) TOPOUIGY: linear 

(xi) SEQUENCE DESCRIPTIOJ: SBQ ID NO: 167: 
GGTOGGNAGC TTrNAA-mC CCCTTACWGG GGCGCTOTAA GC3QGAMCCT TCCCGGAATT 
TTCGGCnCGA CCCflCGCGTC CGGCCAGCCT AGGAGAAGAA GTTCGTAGTC CCAGAGGTCA 
GGCAQCaVGGC QGCAOTTICT GC3CGGGTGAG GGCC3GAGCTG AAGTGACAGC GGAGGCGGAA 
GCAACGOICG CTGQCSGOCSGA C3AAGQGGC3CT GGCCCCAGGA GGAGGAGGAA ACCCTTCCGA 
CaAAACACSCA ACAAGCTCAG CTCCTOTGAC AGAGGGGAAC AAGATGGCGG CGCa»AQQG 
CaGCCICTOG GIGAGCSACCC AACTOQQGCT CCCGCCGCTG CTCCTGCTGA CCATCGOCTT 
GGCCGGAGGT TCOGGGACO; CmXJGCTCA AGCATTTGAC TCGCnCTTOG GTGATAOGQC 

cyrcrroccAc coogcctotc agttgaccta ccccttqcac acctacccta aqcsaagaaga 

GITOTACGCA TOTCAGAGAG GTTOCAQGCT GnTITCAATr TCTCaGrTTG TGGATGATGG 
AA-rreACITA AATCGAACTA AATTGGAATG TGAATCTGCA TGTACAGAAG CATATTCCCA 
ATCTCATCAG CAATATCCTT GCCATCTTOG RTOCCAGAAT CAGCTGCCAT TCGCTGAACT 
GAGACAAGAA CAACrTATCT CCCTCATGCC AAAAATGCAC CTACTCrTTC CTCTAACTCT 
GCSTOAGCTCA TTCTCGACyre ACATCATGGA CTCCGCACAG AGCTTCATAA CCTCTTCATG 
GACmTTAT CTTCAAGCCG ATCACGGAAA AATAGTTATA rTCCRGTCTA AGCCCAGRAA 
TCCCSGOTAC GCACCACATT TGGAGCCAQG AGCCCTACCA AATTTGRGRG RAWOTrCTCT 
AAGCAAAATC ■TCCNTCAKMr CGSMAATGAG AAATTCACAA GCGCACAGGA ATTrrCTrGA 
AGATQGAGAA ASK3ATOGCT TTTTAAGAT6 CCTCTCTCTT AACTCTGGGT GGATTTTAAC 
TACAACTCTT GTCCTCTCGG TCATCGTATT GCTTTGGATT TGTOGTGCAA CITGTTGCTA 
CACGCTOTTO GACGCAGTAT AGnTTCCCTC TGAGAAGCTG AGTATCTATG GTGACTrGGA 
OTTTATCAAT GAACAAAAGC TAAACAGATA TCCAGCTTCT TCTCTTGrGG TTGTrAGATC 
TAAAACTCAA GATCATCAAG AAGCAGGQCC TCTACCTACA AAAGTCAATC TTGCTCATTC 
TGAAATTTAA GCAnTTTCT TTTAAAAGAC AAGTGTAATA GACATCTAAA ATTCCACTCC 
TCATAGAGCT TITAAAATGG TTTCATTCGA TATAGGCCTT AAGAAATCAC TATAAAATGC 
AAATAAACTT ACTCAAATCT ffTOAAAAAAA AAAAAAAAAA AAAAAAAAAC TCGAGGGGGG 
GCCCOTTACC AAKTCX3CCCT ATOGTCADTB GTATIMrTAT TTTACTAATA TCTGTAGCTA 

rm ur m -r KGcrrKGOTr atogttttty tcccttytct wagctatrag ctgatcatkg 

CySCrrCTCA CCTCCTCCCA TCATACTGTC AG-rrACCTTA gttaacaagc tgaatattta 
C?rAGAAATCA TCCnCTCCT CAGGAATGGC CCACAAATCT GTAATTTGAA ATTTAGCAGG 
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AAATGACCrr TAATCACACT ACATTITCAG GAACTGAAAT CATTAAAATT TTATITCAAT 
AATTATOTGC TGAAAAAAAA AAAAAAAAAA AMWMRARASK RRWWACTCGA GGGGGGGCCC 
GGTACCCNAT TCGCCG 

(2) INFORMATION FOR SEQ ID NO: 168: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENC3TH: 945 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168: 
AGAAACCCTT GATOQGACTG AGAAACCAGA GfTTAAAACCT CnTOGAGCT TCTGAGGACT 
CAGCTOGAAC CAACQGGCAC AGTTCGCAAC ACCATCAACT TCTCCCAAGC AGAGAAACOC 
25 GAACCCACCA ACCAGGGGCA GGATAGCCTG AAGAAACATC TACACGCAGA AATCAAAGTT 
ATTGGGACTA TCCAGATCTT GTOTGGCATG ATGCTATTGA GCTTQGGGAT CATTTrGGCA 
TCTGCrrcCT TCTCTCCAAA TTTTACCCAA GTGACTTCTA CACTGTTGAA CTCTGCTTAC 
CCATTCATAG GACCCmTT ITITATCATC TCTGGCTCTC TATCAATCGC CACAGAGAAA 
AGGTTRACCA AGCTTTIGGT GCATAGCAGC CTGGrTGGAA GCATTCTGAG TGCTCTGTCT 
35 GCCCTOGTCG GTTrCATTAT CCTGTCTGTC AAACAGGCCA CCTTAAATCC TGCCTCACTG 
CAOTOMAGT TOGACAAAAA TAATATACCA ACAAGAAGTT ATGITTCrTA CTTTTATCAT 
GATTCACTTT ATACCACGGA CTCCTATACA GCCAAAGCCA GTCTGGCTGG AWCTCTCTCT 
CTCATGCTGA TTTOCACrcT GCK3GAATTC TGCCTAGCTG TGCTCACTGC TGTGCTGCGG 
TOGAAACAGG CrTACTCTCA CTTCCCTCGG AGTCSTACITr TCCrGCCTCA CAGTTACATr 
45 GGTAATTCTG GCATCTCCTC AAAAATGACT CATGACTGTG GATATGAAGA ACTATTGACT 
TCTTAAGAAA AAAGGGAGAA ATATTAATCA GAAAGTTGAT TCPTATGATA ATATGGAAAA 
CTTAACCATT ATAGAAAAGC AAAGCTIGAG TTTCCTAAAT GTAAGCTTTr AAAGTAATCA 
ACATTAAAAA AAACCATTAT TTCACTGTCA TTTAAAGATA ATGTG 
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(2) INFORMATION FOR SEQ ID NO: 169: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 902 base pairs 
60 (B) TYPE: nucleic acid 
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(2) INFORMATION FOR SEQ ID NO: 170: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1883 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 169: 
^ GGCAGAGCCA CAGGAAGGAT GAGGAAGACC AGGCTCTGGG QGCTGCTGTG GATGCTCTTT 60 
GTCTCAGAAC TCCGAGCTGC AACTAAATTA ACTGAGGAAA AGTATGAACT GAAAGAGGGG 120 
10 CAGACCCTGG AlHSOGAAATG TCACTACACG CTAGAGAAGT TTGCCAGCAG CCAGAAAGCT 
TOGCAGATAA TAAGGGACGG AGAGATQCCC AAGACCCTGG CATGCACAGA GAGGCCTTCA 
AAGAATTCCC ATCCAGTCCA AGTCGGGAGG ATCATACTAG AAGACTACCA TGATCATGGT 
TTACTCCGCG TCCGAATCGT CAACCTTCAA GTGGAAGATT CTQGACTGTA TCAGTGTGTG 
ATCTACCAGC CTCCCAAGGA GCCICACATG CTGTTCGATC GCATCCGCTT GGTGGTGACC 420 

20 AAGGGrrrrr cagggacccc tggctccaat gagaattcta cccagaatgt gtataagatt 480 

CCTCCTACCA CCACTAAGGC CTTCTCCCCA CTCTATACCA GCCCCAGAAC TGTGACCCAA 540 

GCTCCACCCA ACTCAACrcC CGATCTCTCC ACTCCTGACT CTGAAATCAA CCTTACAAAT 600 

gtcacagata TCATCAGGGT TCCGGTGTTC AACATTGTCA TTCTCCTGGC TGGTQGATTC 660 

CTCAGTAAGA GCCIGGTCTT CTCTCTCCTG TTTGCrGTCA CGCTGAGGTC ATTTGTACCC 720 

TAGGCCCACG AACCCACGAG AAICTCCTCT GACTTCCAGC CACATCCATC TGGCAGTTGT 780 

QCCAAGGGAG GAGQGAGGAG GTAAAAGGCA GGGAGTTAAT AACATGAATT AAATCTGTAA 840 

TCACCRGCTA AAAAAAAAAA AAAAAAAACN CGANCCTNGG TTTrCAGCTC CATCAGCTCC 900 
TT 



902 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170: 
AGAAAACAAC TGAAAAACCA CATTTTTCTA CATACAQCTG QGGAGGTAGC TGAGAACTTG 
GCACrcCGCA CACATACTAG GTTCAAAGAG AGTTGAGGAA ACCAGAAGGC CAAGTGGATC 
55 TGCTGGCAAA CCCTGAACCT G1CTCCTGCG CTTGCTCTAC AGTTCTGAAG TTGAAAATCC 180 
TTTTCATGCC TAGCATCK5C TIX3AGTTATA AACCCCAAGG CAGCCATGTC ATAGACTAGT 
GITTACTCTT GTrTTGACTT TCTTTTAATG CTTCCTAAGA CCCAAGTGCC TCCTGCTGTT 

60 
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TCXrrcCTTTG •TCGTAGCCTC TC3GCCATCTG QGACCTCAAT CCCCAGCTTT CCCACTTTCA 
GCAGTCCTTT GCTCTCTITC CTTCTACCTC AAATAC3CCCC AGGAGTGGC3C TTTAGTCTCC 
AATATCGAGC ATVTCAAGCT TCTCCTGQC3G GATQC3GGATT GQGATGGGCA GAATCTGTTT 

TCCvmrTCCG GOTTArrrcc agtcggtgta aaagcagagc tggcscctttc ccrcTcrrAT 

CCCTCAGGGT GGGTAAGAAG GACTCTATCT ACACCTGTTC TTCCCTACCT TCTCTTITCT 
TAGGGAGGCC TCATTCTAAG TTCCTCAAGA GAGTCCTTGG CTTAAAGCTG TAGCAAGGGT 
GTGCTAGGTC GGGGATTTCG AGCAAAACCG TCGAGTAGGC ATGATACTGG TATGGAGTGG 
15 GCCTGCAAAA TCAGACAGAA ATGGCTTGAG AAGCCGCAGG GGAGCATGCC TGTCTCTCAG 
TGATAGAGTA K3GGAGGGAC CTCCCTAGCT TGGAAAATGA GAATTGAAGG GGTTATGAAC 
AAATAGGATC CCTAOrTOAG GATOPTCCCA AAGTITrGTC CAATCTTATC ATTAGTAGAT 
nTATAAGCC ACAGAGACAA ACCAGAAACXS GAATAATGTT ACTTTGGATG CTTTATTITr 
TTGTTCTAGG TOTOGCTTTG TACATCCAGA AGAATGCTAT ATGCTGCACA TmGCCITr 
AAAGTCTTAC GACTTTCCCC ATTTrAGTCT AATQGGAAGA TACAGATGTG CAAGTCTGCT 
TmrcTTTT TTOTTATTAT riTl T mT i' TTGCTCTGTG TTATGGACAT TTTCAGACAT 
GCACAGAAGT GGAGAGGATG GTCCTTCGAC CCCATGTGTC CATCACCTAG CTGCATCACT 
TATCAGCTAT GOTCAACCTO GTITCA'TCTC TATCTCTCTC TITrCACCTG TATTGirTAT 
TCAAAATCCA AGACACTATC CCAA'TCCAAC CGTGACTACT rrCGGAGATT GGTAGTCTCT 
35 TTTCATCGrc ATAGTOATGG GGTCCACTAT CATAATCACA TCAGGTCTGC TTTrVGCTW 
TAATOTTAAC TAATCAAGTT CCAGAGATGG GCCTTAGAAA TCTGTITrAA GAATTAACAA 
GGAOTirrcAA AAAGAAATCA GAGGGATCCT TCCTTTCCCC TTGCATCTAC AAAACAAGAG 
AGAGACTCTT CTOITCrAAA ACTCTTTCAA AAATTCTGAT ATGGTAAGGT ACITCAGACC 
CITCACCAGA ATOKTAATCT TmrTCTCT GTAACATGGA AACrTGTCTG ACCATTAGCA 
45 TTCTTATCAG CTTOTACTCG TCTCATAACT CTGGnTTGG AAGAATAATT TGGAAATTGT 
TGCTOIOTTC TCTCAAAATA ACCTCCCCAA AATAATTAGT AACTGGTTGT TCTACTTGGT 
AATTTGACAC CCTCTTAATA ACGCAATTAT TTCTGTGTTC TTAAACAGTA TAAATAGTTG 

TAAOrrrcCA 'TCCATGATGG aaaaataaaa acctgtatct ctgttaaaaa aaaaaaaaaa 

AAAAAAAAAA AAAAAAAAAA AAA 
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(2) INFORMATION FOR SEQ ID NO: 171: 
60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 2100 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 

TACTTTTAGA TTTACTGCCT TCAAAAAGTG CCTATTCTGA GCAACATAAA CGTTATTCCT 60 

10 TACATATGTA TGTACACACG GTACCCAGAG TOGTACTGTG GCAGCCTTCA AAAACATACC 120 

ATCAGAAAGA GTAGGTOCTG AGATAAGC^ ACTTTGCCAA ATGNAAGAAA GTCACTCACT 180 

TCCAATATCC CCTCTTCAAG CGGCTACCGT GRAASGQGCT GCAAACACAT TCCCTGAGCA 240 

15 

TCCCTTGCTG ATACAGCTTC TTTATATTTA TATOCTACTG GATQGTAGCA TATTGCTAAG 300 

GTTTCCTGTA CTCTGCTTCA AGGGAATGTA AGYTTTATGG CATTGAAACA TTTAGGAAAA 360 

20 AAAAAGATGT TTAAGAGAAT TAATAGAGCC GTAGTCTGTA TTAGGATGTG TGTCATATGT 420 

GTGTTCTATA AACTAAGCAT CGGTGGGTTT AGAGTGTTAA AGTGTCAGCA CATTCCTTCT 480 

CCTTTTGTCT CTCAGGCTAA CATGAGAGAA AATAGAAAAG TCTTGGCTGT GGGGATTGGA 540 

25 

AGCTCAGGGG GCCAAATGTC CTTGCCAGAT CCTTAGAGCA TTACTTTGAC TCCTAAAAAT 600 

AGTAGTGTAT GTTATTTGAT GQCTTTTGTT TCCATAGTTC CATCACTGAC AAAACTGTCA 660 

30 ATACTGTTGA TQGAGCAQCA GCATAGCCTA GAGTGATGCA TTCTTACCCA GAGGTGGCAA 720 

TAGGAGAGGG TCCATGTAAA TAGGACGAGG TAGACAGTGC ATGATTGTAG GAGAAGGGTT 780 

GAAGGGAGGA CATGATTCCA AAAAAGATCG TTCTCAATGT GTCGTCTGAC TCAACCAGCT 840 

35 

GGCAGATTAC ACTTGCCAAG TCGTTCCCTT TCCTTCTAAG TCAGTTGGCT CCATATTCAC 900 

TTGAATATGC CTCTGTTTGG GCAAAGCAAG ATACCTCCAC TTAACCTTTA TCCAAQGAAG 960 

40 CTCTTGGTGT CCTCTTGGTC ATAAAGTTGT CTCCTACCTA ACCCAGTTTT ACCAAATGGA 1020 

AGTAAAAGGG GACAAACTAT GGAAGATGGA CTCCATGCCA TTGCAGTCAG CCACCATTCT 1080 

CTTTTCCATA TAAGGAGCCC CATTACATAA GCTACQGGTG AGGTTGGAAC AGCTATGTTT 1140 

45 

CATAATTTCA AGAGTGTGAC CACCCTGCTC TAGTCATCAT CATTGGATGA ATCCAGTTGA 1200 

CTCTTTGGCA AAAQGGTGAT ACTTTTCACT AAAAATGCCT ACTCTTCCTG TTGATGTTCC 1260 

50 TTTTCTGTTT TrACCTTGTC CAATTTCCAC ACTAGTCATT TTTTTTATTT TTTAGAGGAT 1320 

CA6ATTTTAG CGCTGGAAAA TGAGTTCAAA AATTTCAGTG TAATGTCATA AGGATGTTGG 1380 

GATACAGAGA TTTTrTTTTT CCTTGGAAAC AAATGGACTG GGAAGAAACA CAGCATGGCT 1440 

55 

TTGCTCTGAG TTTCAATCTG ATGATTATGA CCATGGAAGA TAGTCTTATG TAAAGGTTAA 1500 

ATGGTGTTTA CAAGTGGATA GATAAGGCGG AGATGGTGAG AAGCCGGGTT TTCTCTATGC 1560 

60 TAAATGTGTC TACTAAGAGC AGCACTTCCT ACTAGCTAAG CACAATCATA GCCCCACCGT 1620 
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GATGAGCTGC TAGTCTGAAT AACATTCCCT GACTTAOGGA AAGQCACACA AAAACATATA 1680 

AAGAATATGT CTATTTTCAT ATGTGTCATA CTGACAGAGC CATGCSTATTC CTAAAATATA 1740 

5 

GGTTTCTCTT TTTTCTTGTA TTCTTAGCAA ATTQCATTTA TTCACTACAT TACAAACXAT 1800 

CACTGATGTA TCCAAAATA6 CACACATAGT TCAGTATGAA AATAAGA6AA TAAAATCTGT 1860 

10 TATAAGCAAG TGATTTAGGT ATTTTCTTTT GTGTTTATGC ATTATCTGAC TATATTAAAA 1920 
CCTGPTTTTC TATTTACCTT CTATCAGTTT TCTCTACCAA TTATGTTTTT TCAATGCTCT ' 1980 

ATAAGAATGA ATATGGAAAT TATATTTCTT TTTTCTGTAA AAGAGTTGCA ACTACTTTAT 2040 

15 

TATATTTAGA AATCCAATAA ACTTCTTATT ACATTTAAAA AAAAAAAAAA AAAACTOGAA 2100 



20 

(2) INFORMATION FOR SEQ ID NO: 172: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1930 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEENESS: double 
<D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172: 

30 

CCTTTOANTG TGGTCCCGGG TGCNGATTGG CAGCGCCTCC GCCGCGGCTC GTGGTTGTCC 60 

CGCCATGGCA CTGTCGCGGG GGCTGCCCCG GGAGCTGGCT GAGGCGGTGG CCGGGGGCCG 120 

35 GGTGCTCGTG GTQGGGGCGG GCGGCATCOG CTGCGAGCTC CTCAAGAATC TCGTGCTCAC 180 

CGGTTTCTCC CACATCGACC TGATTGATCT GGATACTATT GATGTAAGCA ACCTCAACAG 240 

ACAGTTTTTG TTTCAAAAGA AACATGTTGG AAGATCAAAS GCACAGGTTG CCAAGGAAAG 300 

40 

TGTACTGCAG TTTTACCCGA AAGCTAATAT CGTTQCCTAC CATGACAGCA TCATGAACCC 360 

TGACTATAAT GTGGAATTTT TCCGACAGTT TATACTGGTT ATGAATGCTT TAGATAACAG 420 

45 AGCTGCCCGA AACCATQTTA ATAGAATGTG CCTGGCAGCT GATGTTCCTC TTATTGAAAG 480 

TQGAACAGCT GGGTATCTTG GACAAGTAAC TACTATCAAA AAGOGTGTGA CCGAGTGTTA 540 

TGAGTGTCAT CCTAAGCOGA CCCAGAGAAC CTTTCCTGGC TGTACAATTC GTAACACACX: 600 

50 

TTCAGAACCT ATACATTGCA TCGTTTGQGC AAACJTACTTG TTCAACCAGT TGTTTGGGGA 660 

AGAAC3ATGCT GATCAAGAAG TATCTCCTGA CAGAC3CTGAC CCTGAAGCTG CCTQGGAACC 720 

55 AACGGAAGCC GAAGCCAGAG CTAGAGCATC TAATGAAGAT GGTGACATTA AACGTATTTC 780 

TACTAAGGAA TQGGCTAAAT CAACTGGATA TGATCCAGTT AAACTTTTTA CCAAGCTTTT 840 

TAAAGAIXSAC ATCAGGTATC TGTTGACAAT GGACAAACTA TGGCGGAAAA QGAAACCTCC 900 
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AGTTCXX3TTG GACTQQGCTG AAGTACAAAG TCAAGGAGAA GAAACGAATG CATCAGATCA 960 

ACAGAAOGAA CCCCAGTTAG GCXTOAAAGA CCAGCAGGTT CTAGATOTAA AC3AGCTAT0C 1020 

5 ACGTCTTTTT TCAAAGAGCA TCGAGACTTT GAGAGTTCAT TTAQCAGAAA AGOGGGATOG 1080 

AGCTGAGCTC ATATQGGATA AOGATGACCC ATCTGCAATG GATTTTGTCA CCTCTGCTGC 1140 

AAACCTCAGG ATQCATATTT TCAGTATCAA TATQAAGAGT AGATTTGATA TCAAATCAAT 1200 

10 

GGCAGGGAAC ATTATTCCTG CTATTGCTAC TACTAATGCA GTAATTGCTG QGTTGATAGT 1260 

ATTGGAAGGA ITGAAGATTT TATCAGGAAA AATAGACCAG TGCAGAACAA TTTTTTTGAA 1320 

15 TAAACAACCA AACCCAAGAA AGAAGCTTCT TGTGCCTTGT GCACTGGATC CTCCCAACCX: 1380 

CAATTGTTAT GTATGTGCCA GCAAGCCAGA GGTGACTGTG CX3GCTGAATG TCCATAAAGT 1440 

GACTGTTCTC ACCTTACAAG ACAAGATAGT GAAAGAAAAA TTTGCTATQG TAGCACCAGA 1500 

20 

TGTCCAAATT GAAGATGGGA AAGGAACAAT CCTAATATCT TCCGAAGAGG GAGAGACGGA 1560 

AGCTAATAAT CACAAGAAGT TGTCAGAATT TGGAATTAGA AATGGCAGCC GGCTTCAAGC 1620 

25 AGATGACTTC CTCCAGGACT ATACTTTATT GATCAACATC CTTCATAGTG AAGACCTAGG 1680 

AAAGGACGTT GAATTTGAAG TTGTTGGTGA TGCCCCXX3AA AAAGTGGGGS CCAAACAAQC 1740 

TGAAGATGCT GCCAAAAGCA TAACCAATGG GCAGTGATGA TGGGAGCTTC AGCCCTCCAC 1800 

30 

CTYCACAGCT TCAAGGAGGC AAGATGGACG TYTCYCATAG TTGATYCGGR TGAAGAAGRT 1860 

TCTCCAATAA TTGCCCGACG TTCATTGAAG GAAGGAQGAG QAGGCCCGCC AAGAGGQGAA 1920 

35 TTTAGGNTTG 1930 



40 (2) INFORMATION FOR SEQ ID NO: 173: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1509 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS: dovible 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173: 

50 GGCCCTGGCC TCTGGGCTGA GGCTTGCTAG GGACTCGGGG TGGCTCTAAG GGGCAiGGGAT 60 

AGGGCTGGGG AGCGCCGGCC TGfTGGCCCTG ACCAGCCCCT TCTCGTGCRG GTTCCACCCC 120 

GATGCAGGTG GTCACGTGCT TGACGCQGGA CAGCTACCTG ACQCACTCCT TCCTCCAGCA 180 

55 

CCTCATGGTC GTGCTGTCCT CTCTGGAACG CACGCCCTCG CCGGAGCCTG TTGACAAGGA 240 

CTTCTACTCC GAGTTTGGGA ACAAGACCAC AGGGAAGATG GAGAACTACG AGCTGATCCA 300 

60 CTCTAGTCGC GTCAAGTTTA CCTACCCCAG TGAGGAGGAG ATTGGGGACC TGACGTTCAC 360 
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TGTGGCCCAA AAGATGGCTG AGCCAGfiGAA GGCCCCAC3CC CTCAGCATCX: TGCTGTACGT 420 

GCAGGCCTTC CAGGTGGGCA TGCCACCCCX: TGGGTQCTGC AQGQGCCCCC TGCGCCTCAA 480 

5 

GACACTCCTG CTCACCAGCT CCXSAGATCTT CCTCCTQGAT GAGGACTGTC TCCACTACCC 540 

ACTGCCCGAG TTTGCCAAAG AOCCOCOGCA GAGAGACAGG TACCGGCTGG ACGATGGCCG 600 

10 CXX3CGTCCGG GACCTQGACC GAGTGCTCAT GGGCTACCAG ACCTACCCGC i«XrCTCACC 660 

CTCGTCTTCG ATGACGTGCA AGGTCATGAC CTCATGGGCA GTGTCACCCT GGACCACTTT 720 

GGGGAGGTGC CAGGTGGCCC GGCTAGAGCC AGCCAGGGCC GTGAAGTCCA GTGGCAGGTG 780 

15 

TTTGTCCCCA GTGCTGAGAG CAGAGAGAAG CTCATCTCGC TGTTGGCTCG CCAGTGGGAG 840 

GCCCTGTGTG GCCX3TGAGCT GCCTGTCGAG CTCACCGGCT AGCCCAGGCC ACAGCCAGCC 900 

20 TOTCGTGTCC AGCXTTGACGC CTACTGGGGC AGGGCAGCAG GCTITTGTGT TCTCTAAAAA 960 

TGTTTTATCC TCCCTTTGGT ACCTTAATTT GACTGTCCTC GCAGAGAATG TGAACATGTG 1020 

GTTAATTCTT TCTCATGTTG GGAGTGAGAA TGCOGGGCCX! CTCAGQGCTG 1080 

GTCAGCXrrCC CACAGGTGGT ACAGCCX5TGC ACACCAGTGT CGTGTCTGCT 1140 

GTTGTGGGAC CGTTGTTAAC AOGTGACACT GTGGGTCTGA CTTTCTCTTC TACACGTCCT 1200 

30 TTCCTGAAGT GTCGAGTCCA GTCCTTTGTT GCTGTTGCTG TTGCTGTTGC TGTTQCTGTT 1260 

GGCATCTTGC TGCTAATCCT GAGGCTG6TA GCAGAATGCA CATTGGAAGC TCCCACCCCA 1320 

TATTGTTCTT CAAAGTGGAG GTCTCCCCTG ATCCAGACAA GTGGGAGAGC CCGTGGGGGC 1380 

35 

AGGGGACCTG GAGCTGCCAG CACCAAGCGT GATTCCTGCT GCCTGTATTC TCTATTCCAA 1440 

TAAAGCAGAG TTTGACACCG TCAAAAAAAA AAAAAAAAAA AAAAAAAAAA ATTNCTGCOG 1500 

40 CCTCAAGGG 1509 



25 



45 (2) INFORMATION FOR SEQ ID NO: 174: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3173 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEXX^S: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174: 
55 TCGACCCCAS GCGTCCGTGC TTTTCCACAG AAGGTTAGAC CCTGAAAGAG ATGGCTCAGC 60 
ACCACCTATG GATCTTGCTC CTTTGCCTGC AAACCTGGCC GGAAGCAGCT GGAAAAGACT 120 



60 



CAGAAATCTT CACAGTGAAT QGGATTCTGG GAGAGTCAGT CACTTTCCCT GTAAATATCC 180 



W0 98A39448 



403 



PCTA;S98/04493 



AAGAACCACG GCAAGTTAAA ATCATTGCTT 
CACCAGGAGA CTCAGAAACA GCACCCGTAG 
5 GGATACATGC CTTAGGTCCX; AACTACAATC 
CAGGAGACTA CAAAGCAGAC ATAAATACAC 
ACAACCTGCA AATCTATCGT CGGCTTGGGA 

10 

CTGTGAACAG CACCTGTAAT GTCACACTGA 
TGACATACAA TTGGAGTCCC CTGGGAGAAG 
15 CTGAGGACCA AGAQCTGACT TACACGTGTA 
ACTCCATCTC TGCCTGGCAG CTCTGTGCAG 
CCX3GGTTGCT GAGCGTGCTG GCTATGTTCT 

20 

TTTTGTTCCG TITGTTCAAG AGAAGACAAG 
ATATCATGGC TTCAAQGAAC ACCCAGCCAG 
25 AGTCCAAGGT GCTTCCCTCC AAGGAAGAGC 
TTGCTGATAA GATX3GGGAAA GCXIAGCACAC 
ATGAAA1TGT GATCTAGGCT GCTGGGCTGA 

30 

CCAATACTGG CAGGTTCCCT GGATCCAGAT 
CAAACTQCCA CATCTCAGCC TGTAAGCAAA 
35 QCCTAAATGG ACAAATGGAT GCATACXXHT 
AAAGCAGGTT ACCTAGTATA GTTTTCCCAA 
TAATATTTTT ATGGCACACT GGGATAAACA 

40 

ATGACTAGAG GCCTCTTGTG ACTQGAQGTA 
GGATCAATAT TTTGCACACC TGTAATAGGC 
45 CAGTCAGTAT GTGTGAAGAT CCCTGGTGCG 
GGAAAATGTA CCXriTCGCTT GAGGCAGATG 
AGCAGAATGT GGGCTGCATA TAAGCACACT 

50 

GCATAACAQG CTTAGTAAGT CCAAACACAG 
TTGTQGCTCT CAGCCATGTA GACACACTCT 
55 GCAGGGTCTA GAGACTGCTG GGACACTTTT 
ATTTTCTTTC TGGCCAAGAT TTCCTTCTGT 
AGCCATGCCC AGTATTCCCA CTCTCCAAAA 

60 



GGACTTCTAA AACATCTGTT GCTTATGTAA 240 

TTACTGTCAC CCACAGAAAT TATTATGAAC 300 

TGGTCATTAG CGATCTGAGG ATQGAAGAOG 360 

AGGCTGATCC CTACACCACC ACCAAGOGCT 420 

AACCAAAAAT TACACAGAGT TTAATGGCAT 480 

CATGCTCTGT AGAGAAAGAA GAAAAGAATG 540 

AGGGTAATGT CCTTCAAATC TTCCAGACTC 600 

CAGCX:CAGAA CCCTGTCAGC AACAATTCTG 660 

ACATCGCAAT GGGCTTCCGT ACTCACCACA 720 

TTCTGCTTGT TCTCATTCTG TCTTCAGTGT 780 

ATGCTGCCTC AAAGAAAACC ATATACACAT 840 

CAGAGTCCAG AATCTATGAT GAAATCCTGC 900 

CAGTGAACAC AGTTTATTCC GAAGrTGCAGT 960 

AGGACAGTAA ACXnCCTGGG ACTTCAAGCT 1020 

ATTCTCCCTC TGGAAACTGA GTTACAACCA 1080 

CTTCTCTGCC CAACTCTTAC TGGGAGATTG 1140 

GCAGGAAACC TTCTGCTGGG CATAGCTTGT 1200 

CCTGAAATGA CTCCCTTCTG AATGAATGAC 1260 

ACTTCTTCCC ATCATAGCAC ATGTAGAAAA 1320 

AGCAAGATTG CTCACTTCTG GAAGCTGCAT 1380 

ACAACCCTGC (XAGTAACTG TGGGAGAAG6 1440 

CATQGCACAC CAGCCAAGAT GCTCTGCTCA 1500 

TQGCCTTCAC CACGCATCTT GAGCAAATTA 1560 

CAGCCCTTCC CCCGAGTGCA TGGCTTGGAG 1620 

CATCCXTTTTG TCTGGGAATC TTTGTGCAGG 1680 

ATGACAGTGC TGTGTQGGTC TCTGTCAGAG 1740 

CCAAATGGAG TGTTGGAAAA TGTTCTTTCT 1800 

CTTQGAGTGC TACITCAGAA GCCTTATAGG 1860 

ATCACTCCAA GCAGCCTCAG CAGAAGAAGC 1920 

GGAACTGACC AGCTTATATT TCTCACACTT 1980 
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CTCGGGAACT GQGTATAATC CAACCATCAA 
TCTCCAGAAG GAACTTGGGA GATGATGGTG 
5 CAAAGACTCA GAGAACTAGA GTTTAAGCTG 
ACAAACAGAT CACCAGCCAG CTTACACAGG 
TCACAACTGA GCAAGACATT CATATGATCA 

10 

AAGTATAATC GGCTAACTCC TAAATCCCAA 
TGCAATTAGG CAGATAAAGA CATCAGTCCC 
15 CAACTACCAT TAGCACTATG TTAGGAGCTG 
GTCTQCTCTT GTGTAGCTCA GGAGACAATT 
TCCAQCTQCA AGTAATAGCA TGAAC7\GTCA 

20 

AGCTQGGCCT TGAAGGATGG ATGAAATTTG 
AAGTGAGAGA AGCATGAAAA ATGAGCAGGG 
25 CTCrcCAGAT GCACCATGCA TGCTCACAGT 
CCAGATGTGT GCCCCCACCC CATGTCCATT 
TACCTCTTCT GTAAAGCTTT CXXTTGCTATC 

30 

CACTGCTGTT TTTTCCTCTT TGGTCCTTCT 
AGCATAACTA ATTATTTGTT TTCCTCACTA 
35 GAAGCCKGCT GGGGTGGTGG CTCACGCCTG 
GCGGATCACC TGAGGTCAGG ARTTCGAGAT 
OTACTAAAAA TACGAAATTA GCCAGGTGTG 

40 



404 

AATAGAAGAC CTTGCAAGAA GCAGAGTCAT 2040 

CAGATGATGA AACTGGGTTC ATCCCAGTTC 2100 

AGGCAGAGTG CCGCCACCXTT GGCATGCCCC 2160 

CATTAACTCT CXTTCAATGAG GAAGAATCAT 2220 

TTTAAGGAAG TGTTTCCCTT ATGTGTTAGC 2280 

TGAATAGTCC TAGGCTQGAC AGCAATQGGC 2340 

AGTAAATGAA TCCATAGACT CATCTAGCAC 2400 

CAAGGCCCCA AAGTAGAAGA TGTQCATAAT 2460 

OCAQCACAGA CACTACAGTT AACXXTTGAAC 2520 

GAAAAATACC TTATGAGGGG GCAGGQCTGA 2580 

GATAGAGAAT GAGGAAGACA GAGGGCCTCC 2640 

GCCTGGATCA GTGGGGTGTA TTCAGAGCAC 2700 

CCCTTGCCTA TGTGTGGCAG AGTGTCCCAG 2760 

TACATGTCCT TCAATGCCCA CCTCAAAAGG 2820 

AGGAATCAAA ATTAATCAGG GATCTTTTCA 2880 

ATCACTAAAA CTCATCTCAT TGAGCCTTAC 2940 

CATTGTACAT GTGQGAATTA CAGATAAACG 3000 

TAATCCCAAC ACTTTQQGAG GCCAAGGCAG 3060 

TARTCTGGCC AACATGGTGA AACCCCATNT 3120 

GTGGCACACA TCTGTAGTCC GAG 3173 



45 



55 



(2) INFORMATION FOR SEQ ID NO: 175: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 991 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDE£)NESS: double 
50 <D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175: 

AAATTCGGCA CAGCTGAGAG GAGACACAAG GAGCAGCCCG CAAGCACCAA GTGAGAGGCA 60 

TGAAGTTACA GTCTGTTTCC CTTTGGCTCC TGGGTACAAT ACTGATATTG TGCTCAGTAG 120 

ACAACCACGG TCTCAGGAGA TGTCTGATTT CCACAGACAT GCACCATATA GAAGAGAGTT 180 



60 TCCAAGAAAT CAAAAGAGCC ATCCAAGCTA AGGACACCTT CCCAAATGTC ACTATCCTGT 



240 
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CCACATTGGA GACTCTGCAG ATCATTAAGC 
TCCTGGCGTT CTACGTGGAC AGGGTGTTCA 
TGAGAAAAAT CAGCAGCATT GCCAACTCTT 
G?rCAGGAACA GAGGCAGTGT CACTGCAGGC 
ATGACAACTA TGATCAGCTG GAGGTCCACX? 
ACGTCTTTCT AGCCTGGATT AATAAGAATC 
GAACCTGTAT AGTGATCCAG GGATGAACAC 
CACCTTGAAG GGGAAGGAGA TGGGGAAOGC 
CTCAGGCTGT CTTATTCCGC TTGAAAATAG 
ACTCTATCTG CTGAAAGGGC CTGCAGGCCA 
AATTTATTGT GAAGTCATAT AGTCCATGTC 
ACACATTGTA CTGAGTGGTT TTTCTGAATA 
AAAAACTCGA GQGGGGGCXIC GTACCCAATT 



CCTTAGATGT GTGCTGCGTG ACCAAGAACC 
AGGATCATCA QGAGCCAAAC CXTCAAAATCT 
TCCTCTACAT GCAGAAAACT CTGCGGCAAT 
AGGAAGCCAC CAATGCCACC AGAGTCATCC 
CTGCTGCCAT TAAATCCXTO GGAGAGCTCX3 
ATGAAGTAAT GTCCTCAGCT TGATGACAAG 
CCCCTGTGCG GTTTACTGTG GGAGACAGCC 
CCCTTGCAGC TGAAAGTCCC ACTQGCTGGC 
CCAAAAAGTC TACTGTGGTA TTTGTAATAA 
TCCTGGGAGT AAAGGGCTGC CTTCCCATCT 
TGTGATGTGA GCCAAGTGAT ATCCTGTAGT 
AATTCCATAT TTTACCTAAA AAAAAAAAAA 
T 



(2) INFORMATION FOR SEQ ID NO: 176: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1290 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SBQ ID NO: 176: 
ACAGCCCTCT T0GQAGCCT6 AGCCCGGCTC TCCTCACTCA CCTCAACCCC CAGGCGQCCC 
CTCCACAGGG CCCCTCTCCT GCCTQGACGG CTCTGCTGGT CTCCCCGTCC CCTQGAGAAG 
AACAAGGCCA TGGGTCGGCC CCTGCTGCTG CCCCTRCTGC YCCTGCTGCW GCCGCCAQCA 
TTTCTGCAGC CTRGTGGCTC CACAGGATCT GGTCCAAGCT ACCTTTATGG GGTCACTCAA 
CCAAAACACC TCTCAGCCTC CATGGGTGGC TCTGTGGAAA TCCCCTTCTC CTTCTATTAC 
CCCTGGGAGT TAGCCAYAGY TCCCRACGTG AGAATATCCT GGAGACGGGG CCACTTCCAC- 
GQGCAGTCCT TCTACAGCAC AAGGCCGCCT TCCATTCACA AQGATTATGT GAACCGGCTC 
TTTCTGAACT GGACAGAGGG TCAGGAGAGC GGCTTCCTCA GGATCTCAAA CCTGCGGAAG 
GAGGACCAGT CTGTGTATTT CTGCCGAGTC GAGCTGGACA CCCGGAGATC AGGGAGGCAG 
CAGTTGCAGT CCATCAAGGG GACCAAACTC ACCATCACCC AQGCTGTCAC AACCACCACC 
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ACCTGGAOGC CCAGCAGCAC AACCACCATA GCCGGCCTCA GGGTCACAGA AAGCAAAGGG 660 
CACTCAGAAT CATGGCACCT AAGTCTGGAC ACTGCCATCA GGGTTGCATT GGCTGTOGCT 720 
5 G'raCICAAAA CTGTCATTTT GGGACTGCTG TGCCTCCTCC TCTGTGGTGG AGGAGAAGGA 780 
AAGGTAGCAG QGCGCCAAGC AGTGACTTCT GACCAACAGA GTGTGGGGAG AAGGGATGTG 



10 



20 



25 
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(2) INFORMATION FOR SEQ ID NO: 177: 



840 



TATTAGCCCC GGAGGACGTG ATGTCAGACC CX3CTTGTGAG TCCTCCACAC TCGTTCCCCA 900 



960 



TTCGCAAGAT ACATGGAGAG CACCCTCaGG ACCTTTAAAA GGCAAAGCCG CAAQGCAGAA 

QGAGGCTOGG TCCCTGAATC ACCGACTCGA GGAGAGTTAC CTACAAGAGC CTTCATCCAG 1020 

15 GAGCATCCAC ACTTGCAATGA TATAGQAATG AGGTCTGAAC TCCACTGAAT TAAACCACTG 1080 

GCATTTGGGG GCTGTTYATT ATAGCAGTGC AAAGAGTTCC TTTATCCTCC OCAAGGATGG 1140 

AAAATACAAT TTATTTTGCT TACCATACAC CCCTTTTCTC CTCGTCCACA TTTTCCAATC 1200 

TGTATGGTOG CICTCTTCTA TGGCAGAAGG TTTTGQGGAA TAAATAGCGT GANATGNTNC 1260 

TGACTNAAAA AAAAAAAAAA AAAAACTCGA 1290 



30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2290 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 177: 
TGGGGCCCCT TTTCGATGCT CTQQGTGTTT TTGCCAAGAG TTACAGGATG TCAAGTGTGG 60 
40 GGAGCTCAGC ACCCTTGCTG TGGACCAGTG AAGGCTGTTC CAGACCAGGT GCTTCCAGAC 120 
ATTTCCAGGC TCCAGGAGAG AGGCTGQGAG CCCCCACAGA AAGCACAGGA AAATGCAAAA 180 
AAAAAACAGT CrTTTT T rT T TTTTTQCTTT TTATTATGAA AACAAAACAA ATGCCCCAGG 240 
AGAAGGGTCC ATGATTACCA GAAACATCAA AGAGTACTTT CTACCATTTT TATTCTGTTG 300 
TGTTGAGGCC AQCATTGCAA TAAACAAGCT AAACTACTTA CATTGGACTC ATTTTCAGTA 360 
50 ACTCACATTT ACAGGAATAT ACTAGAAACG GCACTAAAAA GTTTAAGAAA AGTTACGGTA 420 
AACTTGCATG CACATCATAC AGAAAAGTAA CATTTTAAAT ATAAAAAAGA AAAACTTCCT 480 
GGAAGCATTA TGCCAGTATT AAGGAACAGT GCTACTCTGG ATGTGACAAA TTCTGTATGT 540 
GGGTGTTACT CTTTCCCAAA AGACTGTCAG AQGCGTGAGT GCTGCAAAAG AACAACAACA 600 
AAAACAAACA CACAAAAAAA TGTGTCTTAC AGTTTGTAAG CAAGATGACA CTGCCCAACA 660 
60 CAAAGAGGGG TCTGGAGTTC AGTTCACGCC CGAAGCCTGC CCCCTCGGCC TCCAGGGGTC 720 
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ATTCAGAGTG TTCTCAAATC CAATTCOSAC ACACX3ACTTG TCACTACTCC TCTCCCCTTG 780 

AAAAAAGCAT GTTAGAAGCT GCCCTACAGG TCTCAGCAGT GGGACAATCT AATTGAATCA 840 

5 

CCGCAQCCTT CTAATACAGA AGAAACGGAC GTGACTGTCA CCCTCAGCCC GCCAGCAAGG 900 

GCGCTGAGGA AGTCATTAAT CCTTCGAAAC TCTGAAAAGA AACCAGTGTT GAAGTCTGGA 960 

10 CAGAAAGCCT TAAAAAAGTG ACAGCACCAA TGCAGCTGCT CAGTGTACCC NCCGTGGGCT 1020 

GTCAQGGTCA GTGQCTTCTT TCTAGATGAA AGGAGCAGAG GCGAGCCGAC GCX»CCQTCA 1080 

CAGAGAACCA GCCGAGAAGG AAAGGCCCCA CX».TGCTCCC TGTGCGCTOC CCCCACAOCC 1140 

15 

GGCCGCTCCC CCGACGGCTC ACACAGQCAG CACCTCACTG CCCTOPGGCT QGAGGGGCAT 1200 

TGCAAGGAGC GCCCCCCAS3C CCCAGGCACC CXXX3GCTTAG GGTOTACGTA TCACCCAGCC 1260 

20 CTGTGCTQGC AGCACGTTAC CAACCAGCCT GCGTGAAGAC CTGTCAACTG TCGTGTGTGA 1320 

ATTCCTTAAA TTCGGTTTAA ATAGTCCATT AAAGATCTGT TTAGAAAATA CCTTTGAAAA 1380 

CGAGGGTAAC TTTAAAAAAT GGAAACTTTC AAATCCATTT ATATTTTTAT TATAAACAAA 1440 

25 

ACTTAATTAA AAGTTTAACA AACTGGCTGA AAACTCACCA AGTGrTCAGAC TCACCAGCAA 1500 

TTTAAAAAAT GATAATTTAC CAGCATCTCC TCATCAGAGT TCCCTCTCCA GTAAGGGTAT 1560 

30 ACCTACATCT GTAAGGGTCA GTQGACTCTG AATCAATTTT ATGGTTGTTT TAAAATCACC 1620 

GTCTATTAGG ATACTAATGA TAGTCCCTAT ATCCATCX^G AAATGCTGGC AGAAAQCACT 1680 

GGCCACCATA CAGGACAGAC CACACCACAG CTCCATACCC AGCGTCTGCC TGGAGGCTCX: 1740 

35 

CXTACGCTGA GGTCCGGGAG AATGCCTGGT TTCAGTCATT TCCQGACTAA CTGTGACAAC 1800 

GCGTGAGCAG GGAGCACCGT GCGAGTCTCC GGGAGGGAAT CCTCCTGGGG CCCAGAGACT 1860 

40 CCICCACCCX: TGOGGAGGGC AGACAGQCTC GGGARGGCCT GGCCAGGCCA CTGGAGGCTG 1920 

GCAGGGAGCA GGCATGTCCA CCCGCAAGCC TGGGAGGCTA ACTCTGGCAT TCCTGGCCGG 1980 

AGCCGCCATG CTCATTGGTG GGCCAGTTTG GGACATCCCC GTACTCAAAG ACCATATGGC 2040 

45 

AGCCTCTGGG AAAACAAAAC CAAAACATCA CCTTCTATTA AACTCTGTAT ATTATTATTT 2100 

TTTACAATAG AAAGTTAAAA ATCAAGACTT AGATTTACTA TACTOTTTTT CTCTCAGATT 2160 

50 ACAAAGTrTA TATTATATAA CTGQGGTTCC CTAAATTGAT TTCTTTTAAA ACAGTCTTAA 2220 

AGAGACCAGA AGTGAATACA AAAGAACTAA ACAAAATAAA AAATTAGAAT GTGCTGTAGC 2280 

TGAAACCTGT 2290 

55 



60 



(2) INFORMATION FOR SEQ ID NO: 178: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 549 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



10 



20 



30 



45 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 178: 
GGCACGAGCC ATGCCTGGCC TCTCCTTGAT TCTTACAGTC ACTTTGTTGG CTGTTTCTGA 
CrCAGCAGCT ACCTGCATTS TGGCCAAAGG ATGACCTATT CCTTCTCAGG AGGGCAAAAA 
TGTGGAATAG TGTCTGTCCA TQCCTCTCCT CATGGGCTAC CACCTCTGCC ACCGTQGTTA 



55 



(2) INFORMATION FOR SEQ ID NO: 179: 



60 
120 
180 



15 AlCAGTAACA ACCAGGAGAG AAGCTGCTGG AACTGACCTC TGGGAACTCC CTGGGATQGT 240 



TTOGTCCAQG AATGTAGTAG GCATACACGT GGTTQCGTGG ATCTQGQCCC TCCTGATGTG 300 

ACyPAGAGAGG TAAAAGGCCA CCATCTCCTT GACCTCTQGG GAACTCATCC ACAAAGAAGA 360 

TGTTTCCAAG ATGCTTCTCA AGATTGCCTA AAAATAGCCG GTTTCCACCC CCGTGAATGC 420 

ATCCATTCTA GAATGCTCCT TCACCAQGAC CAGAGAACTG ATTTACAGAA GTGACATGAA 480 

25 AACATTCCAT CCCAGAATTT GCAGTAGCTC AAATTAAGTT TCTAGCTATT AAAAAGAAAA 540 
AAAAAAAAA 



549 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 1509 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179: 

GGCACGAGGG CICATTCATT CCGCGCCGQG CCTGCCAGAC ACCTGCGCCC TTCTGCAGCC 60 
GCCCGCCGCA TCCGCCGCCG CAGCCCCCAG CATGTCGGGC CCAGACGTCG AGACGCCGTC 120 
CGCCATCCAG ATCTOCCGGA TCATGCGGCC AGATGATGCC AACGTGOCCG GCAATGTCCA ISO 
CGGGGGGACC ATCCTGAAGA TGATCGAGGA GQCAGGCGCC ATCATCAGCA CCCGGCATTG 
50 CAACAGCCAG AACGGGGAGC GCTGTGTGGC CGCCCTGGCT CGTGTCGAGC GCACC6ACTT 

CCTGTCICCC ATGIXSCATCG GTGAGGTGGC GCATGTCAGC GCGGAGATCA CCTACACCTC 360 
CAAGCACTCT G'TCGAGGTGC AGGTCAACGT GATGTCCGAA AACATCCTCA CAGGTGCCAA 420 
AAAGCTGACC AATAAGGCCA CCCTGTQGTA TGTGCCCCTG TCGCTGAAGA ATGTGGACAA 480 
GGTCCTCGAG GTCCCTCCTC TTGTGTATTC CCGGCANGAG CAGGAGGAGG AGGGCCGGAA 540 
60 GCGGTAIXSAA GCCCAGAAGC TGGAGCGCAT GGAGACCAAG TGGAGGAACG GGGACATCGT 



240 
300 



600 
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cx:agcx:agtc CICAACCCAG AGCCGAACAC TGTCAGCTAC AGCCAGTCCA GCTTGATCCA 660 

CCTGGTG5GGG CCTTCAGACT GCACCCTGCA CGGCTTTGTG CACX3GAGGTG TGACCATGAA 720 

5 

GCTCATGGAT GAQGTCGCCX5 GGATCGTGGC TGCACGCCAC TGCAAGACCA ACATCGTCAC 780 

AGCTTCXXSTG GACGCCATTA ATTTTCATGA CAAGATC3«3A AAAGGCTGCG TCATCACXAT 840 

10 CTCGGGACGC ATGACCTTCA CGAOCAATAA GICCATGGAG ATCGAGGTGT TGGTGGACGC 900 

CGACCCTGTT GTGGACAGCT CTCAGAAGCG CTACCGGGCC QCCAGTGCCT TCTTCACCTA 960 

CGTC?rCGCT6 AOCCAQGAAG GCAOGTCGCT GCCTGTOCCC CAGCTOGTGC CCGAGACCGA 1020 

15 

GGACGAGAAG AAGCGCTTTG AGGAAGGCAA AGGGCGGTAC CTGCAGATGA AGGCGAAGCR 1080 

ACAGGGCCAC GCXSGASCYTC AGCTCTAGAC TCCCTCCTCC TGCXIACrGGT GCCTCGACTA 1140 

20 GCTATGGCAA CGGGCCCAGT GTCXAGTCAC TTAGAAGTTC CCCCCTTGGC CAAAAACCCA 1200 

ATTCACATTG AGAGCTGGTG TTGTCTGAAG TTTTCGTATC ACAGTGTTAA CCTGTACTCT 1260 

CTCCTGCAAA CCTACACACC AAAGCTTTAT TTATATCATT CCAGTATCAA TGCTACACAG 1320 

25 

TGTTGTCCCG AGCGCCGGGA GGCGTTGGGC AGAAACCXTTC GGGAATGCTT CCGAGCACGC 1380 

TGTAGGGTAT GGGAAGAACC CAGCACCACT AATAAAGCTG CTGCTTGGCT GGAAAAAAAA 1440 

30 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1500 

AGAAAAAAN 1509 



35 



(2) INFORMATION FOR SEQ ID NO: 180: 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 1316 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEI^SS: double 
<D) TOPOLOGY: linear 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180: 

AGCTGTATCA TAGGAAAGAT GGCCACACCG GCGGTACCAG TAAGTGCTCC TCCQGCCACG 60 

CCAACCCCAG TCCCGGCGGC GGCCCCAGCC TCAGTTCCAG CGCCAACGCC AGCACCGGCT 120 

50 

GCGGCTCCGG TTCCCGCTGC GGCTCCAGCC TGCATCCTCA GACCCTGCGG CAGCAGCGGC 180 

TGCAACTGCG GCTCCTGGCC AGACCCCGGC CTCAGCGCAA NTCCAGCGCA GACCCCAGCG 240 

55 CCCGCTCTGC CTGGTCCTGC TCTTCCAGGG CCCTTCCCCG GCGGCCGCGT QGTCAGQCTG 300 

CACCCAGTCA TTTTGGCCTC CATTGTGGAC AGCTACGAGA GACGCAACGA GGGTGCTGCC 360 

CGAOTTATCG GGACCCTGTT GGGAACTGTC GACAAACACT CAGTGGAGGT CACCAATTGC 420 

60 
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TTTTCAGTGC CGCACAATGA GTCAGAAGAT 
AATATGTATG AACTGCATAA AAAAGTTTCT 
5 ACGGGCCATG ACATCACAGA GCACTCTGTG 
CCCCAACCCC ATCCACCTCA CTGTGGACAC 
AGCCTACGTC AGCACTTTAA TGGGAGTCCC 

10 

TCTGACAGTG AAATACGCGT ACTACGACAC 
GACCTGCTTT AGCCCCAACA GAGTGATTGG 
15 GGCATCAGCT CX3CATCCAGG ATGCCCTGAG 
GTCTGGAAAG GTGTCAGCTG ACAATACTGT 
AGTACXX5AAA ATAGTTCCCG ATGACTTTGA 

20 

TTTGATQGTG ACCTACCTGG CCAACCTCAC 
TGTAAACCTG TGAATGGACC CCAAGCAGTA 
25 CTCAGAAGTG AAGGAGAAAT GGGrTTTTTG 
GTGTGTGACT CTAATAAACG GAGCCTACCT 
SGRQGGGGGG CCCGGTCCCA TTSSCCCTTT 

30 



GAAGTGGCTG TTGACATQGA ATTTGCTAAG 480 

CCAAATGAGC TCATCCTGGG CTGGTACGCT 540 

CTGNATCCAT GAGTACTACA GCCGAGAGGC 600 

AAGTCrrcCAG AACGGCCGCA TGAGCATCAA 660 

TCGGAGGACC ATGGGAGTGA TGTTCACGCX: 720 

TGAACGCATC GGAGTTGACC TGATCATGAA 780 

ACTCTCAAGT GACTTGCAGC AAGTAGGAGG 840 

TACAGTGTTG CAATATGCAG AGGATGTACT 900 

GC3GCCGCTTC CTGATGAGCC TGGTTAACCA 960 

GACCATGCTC AACAGCAACA TCAATGACCT 1020 

ACAGTCACAG ATTGCACTCA ATGAAAAACT 1080 

CACTTGCTGG TCTAGGTATT AACCCCAGGA 1140 

TGGTCTTGAG TCACACTGftG ATAGnCAGTT 1200 

TTTGTAAATT AAAAAAAAAA AAAAAAACC2J 1260 

NGTAATTCGT NTTACAATCC CCNGGC 1316 



{2) INFORMATION FOR SEQ ID NO: 181: 

35 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 777 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
40 (D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 





GGCATGWKCA GACATGACTT CTATTGCCAG GCTGGTCAAG TQGCAGGGTC ATGAGQGAGA 


60 


45 








CATCGATAAG GGTC3CTCCTT ATGCTCCCTG CTCTGGAATC CACCAGCGQG CTATCTQCGT 


120 




TTATCGGGCT GQGGACTAGA ATTGGATGCT TCAAAACCAT CACCTGTTQG CCAACAAGTT 


180 


50 


TGACCCAAAG GTAGATGATA ATGCTCTTCA GTQCTTAGAA GAATACCTAC GTTATAAGGG 


240 




CCATTCTATT GGGACCTGAA CTTTGAAGAC CACAMTATTG AAGAGGCGTT GCTTACCYGT 


300 




TGGGGGCCAA GAGGCATGTT ACCAAACATG GYYCARGAAM YTTGGYKGQG AMCARKKKKG 


360 


55 








GKKGGGARRM CMRGGGVTTG SCAAWTTCSK KGGCMWCCYT TTAGGGTAAR RRGGGCKGTO 


420 




ATTAGATTCT GGGTAAAGTA GGATCTTTTG CCCTTGCAAA TTTGCTGCCT GGGTGAATGY 


480 


60 


TGCTTGTTCC TTCTCMACCC CTAACCCTAG TAGTTCCTCC ACTAACTTTC TCACTAAGTG 


540 



wo 98/39448 



411 



PCTAJS98/04493 



10 



AGAATGAGAA CTGCTGTGAT AOGGAGAGTG AAOGAGGGAT ATGTGGTAGA GCACTTGATT 600 

TCAGTTGAAT GCCTGCTGGT AGCTTTTCCA TTCTGTGGAG CTGCCGTTCC TAATAATTCC 660 

AGGTTTQGTA GCGTGGAGGA GAACTTTGAT GGAAAGAGAA CCrTCCCTTC TGTACTOTTA 720 

ACTTAAAAAT AAATAGCTCC TGATTCAAAG TAAAAAAAAA AAAAAAAAAA AAAAAAA 777 

(2) INFORMATION FOR SEQ ID NO: 182: 

15 (i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 791 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEQNESS: double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182: 
GGCACAGATA ACTATGTACA TGTATTCCTT AAATGTTTTT TTAAGTTTTA TATTCTTGQC 60 
25 ACTGGTCTTC AAATGTGTAC ATGTGTGCCA GGGAGCAAAT GCCTTCTTGT TTCTGAAATT 120 
GGTCTTTTAG ACTGTTCTTT TTTCCCATCT TCTCACCTCC TGCCCCTCCT TCAGGGTACT 180 
TCCGTGGCCA GAACCCCTCC AGGTCAGAGG CAGAAGAGAA GCCTCATGOG TCACAGCAGC 240 

30 

AGATGTQGGC TGGAGATCTA TTCATTTGGT TTTGGCTTGA ATTTTCTGRA TGGTTTACTT 300 
GATCYTGGGA AAGANATATC TTGCCAGGAA AAATGATAGN CCTTGACAAT GTTGAATGAT 360 
35 CCTGCACCAC CTTGAAAGAC ATTTCTAATA TGGTTTGTCA GGCAAAGTGG TTAGTAGTCA 420 
TTTGTGQCCT GAGGTAGAAG TCCTCAGAAA TCAGCAGACT TCACTGATAA AATGCTGACT 480 
TGCCCCTGGA CTGGGCTCTG TGAGAGTGGC CTTCTGCACT GTGCACAGTA GGTGTGAACA 540 

40 

CACCACACCT ACAGGGACCA CGTQGTGGGC TGTGGACTAG CGGCCAAGCT CCCTGCAGGC 600 
CCACTAATAG AATTCAGCTT TTAGCATGGG CTGTTTCATA CTGTTCTGAT GAAACTGATT 660 
45 TGGTTTCTTT CCTCCATACC CCTTCTGCAT TTCAGTGTTT TTGTTTAGTT TTCCTGGTTT 720 
TTAATTATAA CTACAAAATA AAATCTTTAG GCTATTCACC TTAGCTTAGT AAAAAAAAAA 780 
AAAAAAAACT C 791 

50 



55 



(2) INFORMATION FOR SEQ ID NO: 183: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1405 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNESS : double 

60 (D) TOPOLOGY: linear 
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txi> SEQUENCE DESCRIPTION: SBQ ID NO: 183: 
AAATTCATTA ACAGCITCAA AGAAGGCTCT GGTTTTGAAG GCCTAGATAG CAGCACTGCC 60 

^ AGTAGCATCG AGCTGGAAGA ACTTCOGCAT GAGAAAGAGA TGCAGAGGGA GGAAATACAG 120 

AAGCIGATGG GCCAGATACA TCAGCTCAGA TCCGAATTAC AGGATATGGA CX3CACAGCAA 180 

10 GITAATGAAG CAGAATCAC5C AAGAGAACAG TTACAGGWTC TGCATGACCA AATAGCTGGG 240 

CAGAAAGCAT CCAAACAAGA ACTAGAGACA GAACTGGAGC GACTCAAGCA GGAGTTCCAC 300 

TATATAGAAG AAGATdTTA TCGAACAAAG AACACATTGC AAAGCAGAAT TAAAGATCGA 360 

GACGAAGAAA TTCAAAAACT CAGGAAICAG CTTACCAATA AAACTTTAAG CAATAGCAGT 420 

CAGTCTCAGT TAGAAAATCG ACTCCATCAG CTAACAGAGA CTCTCATCCA GAAACAGACC 480 

20 ATCCTCGAGA GTOTCAGCAC AGAAAAGAAC TCCCTGGTCT TTCAACTGGA GCGCCTCGAA 540 

CAGCAGATCA ACTCCGCCTC TGGAAGTAGT AGTAATGGGT CrTCGATTAA TATGTCTGGA 600 

ATTGACAATG GTGAAGGCAC TCGTCTOCGA AATGTTCCTG TTCPTTTTAA TGACACAGAA 660 

25 

ACTAATCTCG CAGGAATCTA CGGAAAAGTr CGCAAAQCTG CTAGTTCAAT TGATCAGTTT 
ACTTATTCGCC TOGGAATTTT TCTCCGAAGA TACCCCATAG CGCGAGTTTT TGTAATTATA 
30 TATATCGCTT TOCTTCACCT CTCGGTCATG ATTGTTCTGT TGACTTACAC ACCAGAAATG 840 
CACCACGACC AACCATATCG CAAATCAACC AAGCCCAGTT CTTQCAGrGA TrGGTTGTCT 

rrrrcTAGAC tigggatctc caagaaggcc aattgcctaa aatttctgag aacagtocac 
35 

AAGATTATTT TATCACTACA AGCTTTTAAC TTTTTAAGTr ATTGTACAAG TATTCTACCT 1020 
AAATCTTCCA ATTTCCTTTA AATCGTAAGA GTTTCTAAAA CAGACAATAA TTTAACAAGC 1080 
40 TCAGcrcroc tttatcigag tttagtcgic CTAATATATA TGTAGAGAAA GATGGTCGGG 1140 
TTOITCACCT CTOTACAGAC CATCTOTATXS TTAOGTGACA TTGATTATGG GTTATAATCA 
GGGAAACTAA TTOTATTTAG TCACAAAAAT AAAAAGTTTT rmTTATAA TTCAGTCTGC 

45 

•mTOGATTT TCATATATTT AACTTTGCAA AAAGATTTAC TTrGTACATG TTACAGGCTr 



720 
780 



900 
960 



GATTGGTCTA AAlCTTTrTA TAAATACATA AATAAAAGNA AAATATCCAT TmCTTTrC 
50 TAAAAAAAAA AAAAAAAAAA CTCGA 



1200 
1260 
1320 
1380 
1405 



55 (2) INFORMATION FOR SEQ ID NO: 184: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1596 base pairs 

(B) TYPE: nucleic acid 
go (C) STRANDEDNESS : double 
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(D) TOPOLOGY: linear 
(xi) SBQUQICE DESCRIPTION: SEQ ID NO: 184: 

GTCATGCAGT GCX3CCQGAGA ACTGTGCTCT TTGAGGCOGA OGCTAGQGGC CCQGAAGQGA 60 

AACTGCGAGG CGAAGGTGAC CGGQGACCX3A QCATTTCAGA TCTQCTCGGT AGACCTGGTG 120 

CACCACCACC ATGTTGGCTO CAAGGCTGGT GTGTCTCCX3G ACACTACCTT CTAQGGTTTT 180 

CCACCCAGCT TTCACCAAGG CCTCCCCTGT TGTGAAGAAT TCCATCACGA AGAATCAATG 240 

GCTGTTAACA CCTAGCAGGG AATATGCCAC CAAAACAAGA ATTQQGATCC GGCGTGGGAG 300 

15 AACTGGCCAA GAACTCAAAG AGGCAGCATT GGAACCATCG ATGGAAAAAA TATTTAAAAT ' 360 

TGATCAGATG GGAAGATGGT TTG1TGCTGG AGGGGCTGCT GTTGGTCTTG GAGCATTGTG 420 

CTACTATGGC TTC3GGACTGT CTAATGAGAT TGGAGCTATT GAAAAGGCTG TAATTTGGCC 480 

20 

TCAGTATGTC AAGGATAOAA TTCATICCAC CTATATGTAC TTAGCAGQGA GrATTGGTTT 540 

AACAGCTTTG TCTQCCATAG CAATCAGCAG AACGCXTTCTT CTCATGAACT TCATGATGAG 600 

25 AGGCTCTTGG GTGACAATTG GTOTGACCTT TGCAGCCATG GTTGGAGCTG GAATGCTGC3T 660 

ACGATCAATA CCATATGACC AGAGCCCAGG CCCAAAGCAT CTTGCTTGGT TGCTACATTC 720 

TGGTGTGATG GGTGCAGTGG TGGCTCCTCT GACAATATTA GGGQGTCCTC TTCTCATCAG 780 

30 

AQCTGCATGG TACACAGCTG GCATTGTGGG AGGCCTCTCC ACTGTGGCCA TGTGTGCGCX: 840 

CAGTGAAAAG TTTCTGAACA TGGGTGCACC CXrTGGGAGTG GGCCTQGGTC TCGTCTTrGT 900 

35 GTCCTCATTG GGATCTATGT TTCTTCCACX: TACCACCGTG GCTGGTGCCA CTCTTTACTC 960 

AGTGGCAATG TACGGTGGAT TAGTTCTTTT CAGCATGTTC CTTCTGTATG ATACCCAGAA 1020 

AGTAATCAAG CGTQCAGAAG TATCACCAAT CTATQGAGTr CAAAAATATG ATCCCATTAA 1080 

40 

CTCGATGCTG AGTATCTACA TGGATACATT AAATATATTT ATGCGAGTTG CAACTATGCT 1140 

QGCAACTGGA GGCAACAGAA AGAAATGAAG TGACTCAGCT TCTGGCTTCT CTGCTACATC 1200 

45 AAATATCTTG TTTAATQGQG CAGATATGCA TTAAATAGTT TGTACAAGCA GCTTTCGTTG 1260 

AAGTTTAGAA GATAAGAAAC ATGTCATCAT ATTTAAATGT TCCGGTAATG TGATGCCTCA 1320 

GGTCTGCCTT TTTTTCTGGA GAATAAATGC AGTAATCCTC TCCCAAATAA GCACACACAT 1380 

50 

TTTCAATTCT CATGTTTGAG TGATTTTAAA ATGTTTTGGT GAATGTGAAA ACTAAAGTTT 1440 

GTGTCATGAG AATGTAAGTC TTTTTTCTAC TTTAAAATTT AGTAGGTTCA CTGAGTAACT 1500 

55 AAAATTTAGC AAACCTGTGT TTGCATATTT TTTKGGAGTG CAGMMTAWTG TAATTARAGC 1560 

ATTCCAGTAA NAGTGTNTTT AAAGTTGWTC TATATN 1596 

60 
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(2) INFOPMATION FOR SEQ ID NO: 185: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 2293 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 185: 

GCGCAGAGCC CGYACGAGCA GGACGACGAC GACAAGGGCG ACTCCAAGGA AACGCGGCTG 60 
ACCCTCATCG AGGAAGT3CT CCTGCTGGGC CTCAAGGACC OCGARQGTTA CACATCATTT 120 
TCGAATCACT GTATATCATC TGGATTACGT GGCTGTATGT TAATTGAATT AQCATTGAGA 180 
QGAAGGTTAC AACTAGAGGC TTC3TGGAATG AGACGTAAAA GTCTATTAAC AAGAAAGGTA 240 

20 ATCTC3TAAGT CAGATGCTCC AACAGGGGAT GTTCTrCTTG ATGAAGCTCT GAAGCATC3TT 
AAGGAAACTC AGCCTCCAGA AACGGTCCAG AACTGGATTG AATTACTTAG TC3GTGAGACA 
TQC3AATCCAT TAAAATTCCA TTATCAGTTA AGAAATGTAC C3C3GAACGATT AGCTAAAAAC 420 

25 

CTGOTGGAAA AGGGICTATT GACAACAGAG AAACAGAACT TCCTACTTTT TGACATGACA 480 
ACACATCCCC TCACCAATAA CAACATTAAG CAGCGCCTCA TCAAGAAAGT ACAGGAAGCC 540 



300 
360 



600 
660 



30 CTTCTTGACA AATGC3GIGAA TCACCCTCAC CGCATGGACA GGCGCTTGCT GGCCCTCATT 
TACCTCQCTC A1X3CCTCQGA CGTCCTGGAG AATGCTTTTG CTCCTCTTCT GGACGAGCAG 

TATGATTTOG CTACCAAGAG AGTCCGQCAG CTTCTCGACT TAGACCCTGA AGTGGAATGT 720 

35 

CTCAAGGCCA ACACCAAIXSA GGTTCTGTGG GCGGTGGTGG CGGCGTTCAC CAAGTAACTC 780 

TGCTCGGGGT GAACCATTCT CCTTTCTCTC AAGTAAACCA GTAGmTTC TTCTGTTGAC 840 

40 TTCTGOTrrr CTCTAATITC TACTTTCCCA CACTATAATT GGCTTCTGTT TTACAAAATG 900 

GTCGCyrGGCT r m trmTT TGTACGTGTA CAGGATTCTG CTGGTACGAG AGGCCTTCCT 960 

CTTTCTOPTT TTAAAAAAAG TTTTACTCCC ATATTQGCAT TCCATTCCCT GTTGCCATCC 1020 

45 

TCACTOTTAC CTCITITCGG TTTCTGGrrCT ACTTTGACTT TCAAAGTACC TCCAGCCTCC 1080 

TCATACGCAC AGCITITOGA TCACCTCAGC TTGAGlTrCT CCATATGTGC ATGTACATCT 1140 

50 AGCATTCTGC CTACAGTTCA GACAGAAGTC ACAAAAAGGC CTTCAACTCA CCAAAQGTAA 1200 

ATATCTGTAT CTATTAGGAC ATrTTTTACA TAGACTTCAG TTGAGATGTA TACTTAGCAA 1260 

AATTATTTIT AAATTGAAAC AGCACAGTAA ATACTTAATA TAAAATGTCC CTTGCATITr 1320 

GCTTCCCATG TAAATCTATT GTATTATTAC ACTTGTTATA ATTTTAACTA TAAAGGTCCA 1380 

ATrcmCAC AGAGCCAGTT TGGGATGGGC 1X3CATTCCAT TTATGCTGTA TATAGTTTGA 1440 

60 ATTATATATA AATTACCCCT TCTTCTOGCC ACCCCTGCTC CCATCTTAGT ATTTTGCAAG 1500 
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ATCTAATCAG TTCTACACCT GGTGCCCCTC GCTTGCTTCA ATCATC5GTTA TTTGATGGCA 1560 
AAATOGACCT CTTGTCGCTC AAGGAGAGAG AAAAGATGTG TGPTCTCATTG GTCCTGGGAT 1620 

5 

TTTTTGAGCT GTGCCATTTA TGGTACTCTT TGCCTATGCA TCCCCTTTTT AGATTTTTTT 1680 

TAAATTTTAT CTTACTGTTT TTATAATTTC TATTCGGAAG AGGCTTGrTGA CCAGTACCAA 1740 

10 TCTTGAGTTT CTTTTTCTGT CCACAAGTAA ATTAATATCT GCTCTGAAAT GTCATTTATC 1800 

TACTCACACA TTCTTGGGGA AAAAAATCAA ATGTCACTCC TAGCAGATGT TGCATGTAAA 1860 

TTQGTAGCAA GTAATGATTA CAACOCAGAG GATTAAGAAT TTTGTAACAG AAAGCTCTAT 1920 

15 

GTTrTAATTT TTTATATACA ATTAGGATAA TTAGCATTGT CAC3ACTATAA ACCTTTGCTT 1980 

nTAAAGTTT ATTTTTACTA TTrCTTTATC ACTTTATTGT ATCATCACCA TTGGTTTCAT 2040 

20 AATGTAAATA CTATATGTTG AACAAATTAA ATGOCAAAAT TTTTTATTAC CATAGTCCAT 2100 

GTTAATAGTG GGGCTTTCAG GTC3TTTAGAG ATTITTTTTG TTGTTGTTAA CATTCATTGC 2160 

AAAAGTACTA GATGGTGTAT AACTCTAGAG TTGAATTTTA AGGGATTCCC TAATATGTAT 2220 

25 

ACTATCTTTT TATCTGAAGT AATAAATAAA CAATGATCTT GAAAGTGCCY RAAAMAAAAA 2280 

AAAAAAAAAA AAA 2293 

30 

(2) INFORMATIOI FOR SEQ ID NO: 186: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1212 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEraOESS: double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTIC»): SEQ ID NO: 186: 
GGCACGAGGC GAGCCGGCGC ACCGTACGCT GGGACGTGTG GTTTCAGCTC GTGOGCCTCC 60 
45 CCGTGGGTTT GCGACGTTTA GCGACTATTG CGCCTGCGCC ACGCCGGCTG CGAGACTQGG 120 
GCCGTQGCTG CTGGTCCCGG GTGATGCTAG GCGGCTCCCT GGGCTCCAGG CTGTTGCGGG 180 
GTGTAGGTQG GAGTCACGGA CGGTTCQGGG CCCGAGGTGT CCGCGAAGGT GGCGCACATG 240 

50 

GGCGGCAGGG GAGAGCATGG CTCAGCGGAT GGTCTOGGTG GACCTGGAGA TGACAGGATT 300 
GGACATTGAG AAGGACCAGA TTATTGAGAT GGCCTGTCTG ATAACTGACT CTGATCTCAA 360 
55 CATTTTGGCT GAAGGTCCTA ACCTGATTAT AAAACAACCA GATGAGTTGC TGGACAGCAT 420 
GTCAGATTGG TGTAAGGAGC ATCACGGGAA GTCTGGCCTT ACCAAGGCAG TGAAGGAGAG 480 
TACAATTACA TTGCAGCAQG CAGAGTATGA ATTTCTGTCC TriGTACGAC AGCAGACTCC 540 

60 
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TCCAGQGCrC TOTCCACTTC CAGGAAATTC AGTTCATGAA GATAAGAAGT TTCTTCACAA 600 

ATACATC30CX CAOTTCATCA AACATCTTCA TTATAGAATA ATTGATGTGA GCACTGTTAA 660 

AGAACrorcC AGAOSCTCGT ATCCAGAAGA ATATGAATTT GCACXIAAAGA AGC3CTGCTTC 720 

TCATAGGGCA CTTGATCACA TTAGTGAAAG CATCAAftGAG CTTCAGTTTT ACCGAAATAA 780 

CATCrrCAAG AAAAAAATAG ATX3AAAAGAA GAGGAAAATT ATAGAAAATG GGGAAAATGA 840 
GAAGACCGTG AGTTCATXXTC AGTTATCATG CTGCCACTAC ATCGTTATCT GGAGGCAACT 
TCTCOTGCnT TTTTTTTCTC ACGCTGATGG CTTGGCAGAG CACCTTCGGT TAACTTGCAT 



10 



45 



900 

960 



15 CTCCAGATTC ATTACTCAAG CAGACAGCAC ACGAAATACT ATTTTTCTCC TAATATGCTG 1020 
TTTCCATTAT GACACAGCAG CTCCTTTGTA AGTACCAGGT CATGTCCATC CCTTGGTACA 1080 
TATATOCATT TOCnTTAAA CXJVTTTCrPr TGTTTAAATA AATAAATAAG TAAATAAAGC 1140 

20 

TAGlTCTATr GAAATCCAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1200 

1212 

AAAAAAAAAA AN 

25 

(2) INFORMATION FOR SBQ ID NO; 187: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1605 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187: 
GCTTCCGGAA GTTCCTTTTG TCCAAACATC CGGGCTTCTC CTTTTTGTGT TCCGGCCGAT 60 
40 CCCACCTCTC CTCGACCCTG GACGTCTACC TTCCGGAGGC CCACATCTTG CCCACTCCGC 120 
GCGCQGGGCT AGCGCGGGTT TCAGCGACGG GAGCCCTCAA GGGACATGGC AACTACAGCG 180 
GCGCCQGCGG GCGGCGCCCG AAATGGAGCT GGCCCGGAAT GGGGAGGGIT CGAAGAAAAC 
ATCCAGGGCG GAGGCTCAGC TCTGATTGAC ATGGAGAACA TGGATGATAC CTCAGGCTCT 
AGCTTCGAGG ATATXSGGTGA GCTOCATCAG CGCCTQCGCG AGGAAGAAGT AGACGCTGAT 360 
50 GCAGCTGATG CAGCTCCTGC TGAA6AGGAG GATGGAGAGT TCCTGGGCAT GAAGGGCTTT 420 
AAGGGACAGC TGAGCCGGCA GGTOGCAGAT CAGATGTGGC AGGCTGGGAA AAGACAAGCC 480 
TCCAGGGCCT TCAGCITCTA CGCCAACATC GACATCCTCA GACCCTACTT TGATGTOGAG 540 

55 

CCTGCTCAGG TGCGAACAGG GCTCCTGGAG TCCATGATCC CTATCAAGAT GGTCAACTTC 



CCCCAGAAAA TTGCAGGTGA ACTCTATGGA CCTCTCATGC TQGTCTTCAC TCTGGTTGCT 
60 ATCCTACTCC ATO3GATGAA GACGTCTGAC ACTATTATCC GGGAGGGCAC CCTGATGGGC 



240 
300 



600 
660 
720 
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ACAGCCATTG GCACCTGCTT CX3GCTACTGG CTGGGAGTCT CATCCTTCAT TTACTTCCTT 780 

GCCTACCTGT GCAACGCCCA GATCACCATG CTGCAGATGT TGGCACTQCT GGGCTATGGC 840 

CTCTTTGGGC ATTGCATTGr CCTGTTCATC ACCTATAATA TCCACCTCCA CGCCCTCTTC 900 

TACCTCTTCT GGCTGTTGGT GGGTGGACTG TCCACACTGC GCATGGTAGC AGTGTTGGTG 960 

TCTCGGACOG TQGGCCCCAC ACAGCGGCTG CTCCTCTGTG GCACCCTGGC TGCCCTACAC 1020 

ATGCTCTTCC TGCTCTATCT GCATTTTGCC TACCACAAAG TGGTAGAGGG GATCCTGGAC 1080 

ACACTGGAGG GCCCCAACAT CCCGCCCATC CAGAGQGTCC CCAGAGACAT CCCTGCCATG 1140 

CTCCCTGCTG CTCGGCTTCC CACCACCX?rC CTCAACGCCA CAGCCAAAGC TGTTGCGGTG 1200 

ACCCTGCAGT CACACTGACC CCACCTGAAA TTCTTGQCCA GICCTCTTTC CCGCAGCTGC 1260 

AGAGAGGAGG AAGACTATTA AAGGACAGTC CTGATGACAT GTTTCGTAGA TQGQGTTTGC 1320 

AGCTGCX:ACT GAGCTGTAGC TGCGTAAGTA CCTCCTTGAT GCNTGTCGGC ACTTCTGAAA 1380 

GGCACAAGGC CAAGAACTCC TGGCCAGGAC TGCAAGGCTC TGCAGCCAAT GCAGAAAATG 1440 

GGTCAGCTCC TTTGAGAACC CCTCCCCACC TACCCCTTCC TTCCTCTTTA TCTCTCCX:AC 1500 

ATTGTCTTGC TAAATATAGA CTTQGTAATT AAAATGTTGA TTGAAGTCTG GAAAAAAAAA 1560 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAC TCGAG 1605 

(2) INFORMATION FOR SEQ ID NO: 188: 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1516 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC»4: SEQ ID NO: 188: 

ATTCGGCATG AGGQGGTCAC GTGGTGGCTG GGCCGGGGAA ATGGCGQCTT CAGGAGAGAG 60 

CGGGACTTCA GGCGGCGGAG GCAGCACCGA GGAAGCATTT ATGACCTTCT ACAGTGAGGT 120 

GAAACAAATA GAGAAGAGAG ACTCGGTTCT AACTTCGAAA AATCAGATTG AAAGACTGAC 180 

CCGTCCTGGT TCCTCTTACT TCAATTTGAA CCCATTTGAG GTTCTTCAGA TAGATCCTGA 240 

AGTTACAGAT GAAGAAATAA AAAAGAGGTT TCGGCAGTTA TCCATCTTGG TGCATCCTGA 300 

CAAAAATCAA GATGATGCTG ACAGAGCACA AAAGGCTTTT GAAGCTGTGG ACAAAGCTTA 360 

CAAGTTGCTA CTGGATCAGG AGCAAAAGAA GAGGGCCCTG GATGTAATTC AGGCAGGAAA 420 

AGAATACGTG GAACACACTG TGAAAGAGCG AAAAAAACAA TTAAAGAAQG AAGGAAAACC 480 
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TACAATTOTA GAGGAGGA1G ATXXTOAGCT C3TTCAAACAA GCTGTATATA AACAGACAAT 540 
GAAACTCTTT GCAGAGCTCG AAATTAAAAG GAAAGAGAC3A GAAGCCAAAG AGATGCATGA 600 
5 AAGGAAACGA CAAAGGGAAG AAGAGATTGA AGCTCAAGAA AAAGCCAAAC GGGAAAGRGA 660 
GTCGCAGAAA AACTTTGAGG AAAGTCGAGA TGGTCGTGTG GACAGCTGGC GAAACTTCCA 720 
AQCCAATACG AAGGGGAAGA AAGAGAAGAA AAATCGGACC TTCCTGAGAC CACCGAAAGT 780 

10 

AAAAATGGAG CAACGTGAGT GACCGCCCAA QGTCACAGGC ACAGAACCTT TCCCCTGCTA 
TCTCCCTTCC TGCTPCGAAG GACTCATTCT TTCCTCCCAC TTCCACCCCA ACATAGAGTA 
15 OTATITOCTT OTTAffrCCAT TTKnTTTCA ATACGATTTA ATATCGATCA GAGTAATTCT 
TTICTACATT GAAATCAGGG GCTTGGTTTA AAAAAAGACC TTTCCCTCTC CCTOCXXCTA 
GAACAACXIAG TATTAGAAGG TOCCACCATT GGTGCTQCCT TCTCTTCCCA CAGCCTGTAA 

20 

CTCAcnOTTT TOTACTTCAC TCAATTGTGA TQGTTAGAAA CTTCGTGGAT AGTTTGfrGGA 1140 

AATCATCCAA TTAAACATAC TCCTTAAAAC AGTGTTGCTG TGACITCAGA GACAAGCCTG 1200 

25 GAAGGGGCAC CTTAGGAAGC CCCTTCGCTT CAGTTQCTCG CTTCTGGGTG TGCTCCCTTC 1260 

GAAGGCCCAG ATAAGACAGG GAACACTTGT GAGCACACAG AGCAGCATCT GATGCCCTGT 1320 

GCTCTPTGGC ATOTGCCCCC TCTCTACTGA CCAATCAGTG TGGCATGAGG CCCACGCCAC 1380 

30 

CCAAACCrrr CACnrcCAA AGAGCTAGCC GTCCTCCACC CAGTACCATG TCCTAGCCTG 1440 

TCTGCATTO; TTACnGGTAA TATTCTTTAT GTATAATAAA TTTTTATACC CAAAAAAAAA 1500 

35 AAAAAAAAAA ACTCGA ^^^^ 



840 
900 
960 
1020 
1080 



40 (2) INFORMATION FOR SEQ ID NO: 189: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 681 base pairs 

(B) TYPE: nucleic acid 
45 <C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 189: 
50 GCTCCCATCT TGCTCGCTGT CCGTACATCA CCCTGTCCCC TGCAGGAGGG OGCTACAGGC 60 
CATCTCCCTC CIGTAGGCCT CTGACTCCCC TCCACTTTTG QGCCCTCAGC TTATCTCGQG 120 
CAGGGGACCA TTCCAGCATC CTCCCCTCCT CNGGACTCAA GGTGCIGAGG TATAAGCCCT 

55 

GGGCCCCAGA TCCCTGRTKA CACCTTCCTG GAGAAGACTC TCAAAAGTGA CTGTATATTT 
GA£?rTCACCA GCAATAACTC CCCACACTCG AAGCAGGTCC AAACCCMAGG ATCCCAGGGT 
60 CCTTGGGCTC TGTGGCACTG TCTTCCCAAG ATCCITCCTG TTGCACAATG GGAAACCTAA 



180 
240 
300 
360 
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GAGGAAAAAG ACAGGGGCCT GCTTGCXXTVG CXSITGCGAGG GATTCCATGC CCACCTGCXT 420 

TCTGYCTGCC TCGCTOGAAT GTCGGCCCCT GCTCCCOGTC AGGTTGTGCT GTCTCTGACC 480 

5 

TATOrrTACA TCCCCX3AGGG GTTTCTGCCT CCTCCCCACC CAGGTCAGGG TGTQGTCX»G 540 

CAGCTTGCTG TGGGGTGCTG ACATC5TGTCA CCACTGCCCC CCTTGCCCCX: GGGGGGGTrCA 600 

10 TGGTCTCCTC CTGGATGCTG CTCCTTGAAT YTTTTTTVTT GAWAAACCYT TTAMAATTAA 660 

AAAAAAAAAA AAAAAACTCG A 681 



15 



(2) INFORMATION FOR SEQ ID NO: 190: 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 1014 base pairs 

(B) TYPE: nucleic acid 

(C) STRANCCCNESS: double 
<D) TOPOIOGY: linear 

25 ixi) SEQUENCE DESCRIPTIC^l: SEQ ID NO: 190: 

GCCTCAAGCC ACGCATATGA TAATTTTCTG GAACATTCAA ATTCAGTGTT TCTACAGCCA 60 

GTTAGTCTAC AAACCATTQC AGCAGCACCA TCAAACCAGA GTCTGCCACT TTTTC3TCATC 120 

30 

GCTGGATGAT TGCTGGGCAA AGGTGGCCTT TTAGAGCTCT TAAAAGCCCA CAAAAAQGCT 180 

ATTCGTAGAG CCACAGTCAA CACATTTGGT TATATTGCAA AGGCCATTGG CCTCATGATG 240 

35 TATTGGCTAC ACTTCTGAAC AACCTCAAAG TTCAAGAAAG GCAGAACAGA GTTTGTACCA 300 

CTGTAGCAAT AGCTATTGTT GCAGAAACAT GTTCACCCTT TACAGTACTC CCTGCCTTAA 360 

TGAATGAATA CAGAGTTCCT GAACTGAATG TTCAAAATGG AGTGTTAAAA TCGCTTTCCT 420 

40 

TCTTCnTGA ATATATTQGT GAAATGGGAA AAGACTACAT TTATGCCGTA ACACCGTPAC 480 

TTGAAGATGC TTTAATGGAT AGAGACCTTG TACACAGACA GACGGCTAGT GCAGTGGTAC 540 

45 AQCACATGTC ACTTGQGGTT TATGGATTTG GTTGTGAAGA TTCGCTGAAT CACTTGTTGA 600 

ACTATGTATG GCCCAATGTR TTTGAGACAT CTCCTCATGT AATTCAGGCA GTTATGGGAG 660 

CCCTAGAGGG CCTGAGAGTT GCTATTQGAC CATGTAGAAT GTTGCAATAT TGTTTACAGG 720 

50 

GTCTGTTTCA CCCAGCCCGG AAAGTCAGAG ATGTATATTG GAAAATTTAC AACTCCATCT 780 

ACATTGGTTC CCAGGACGCT CTCATAGCAC ATTACCCAAG AATCTACCAA CGATGATAAG 840 

55 RACACCTATA TTCGTTATGA ACTTGACTAT ATCTTATAAT TTTATTGnW ATTTKGTGKT 900 

TAATGCACAS TACTTCACAC CTTAAACTTG CTTTGATTTG GTGATGTAAA CTTTTAAACA 960 

TTGCAGATCA GTGTAGGACT GGTCCATAGG GGAAGAGCTA GGAAMTCCAT AGGC 1014 

60 
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180 

240 
300 



(2) INFORMATION FOR SEQ ID NO; 191: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2779 base pairs 

(B) TfPEi nucleic acid 

(C) STRANDEDNESS: double 
10 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191: 

TCGCAGCAGG GIGTCTCCAG ATGGTCAGTC TCTGGTGGCT AGCCPGTCCT GACAGOGGAG 

AOTTAAGCTC CCGYTCICCA CCGTCCCGGC TGGCCAGGTG GGCTGAGGGT GACCGAGAGA 120 

CCAGAACCTG CITCCTCGAG CTTAGTGCTC AGAGCTGGGG AGQGAGGTTC CGCCGCTCCT 

20 CTOCIGTCAG CGCCGGCAGC CCCTCCCGGC TTCACTTCCT CCCGCAGCCC CTGCTACTGA 

GAAGCTCCGG GATCCCAGCA GCCQCCACGC CCTGGCCTCA GCCTQCGGGG CTCCAGTCAG 

GCCAACACCG ACGCGCANTC GGAQGAAGAC AGGACCCTTG ACATCTCCAT CTGCACAGAG 360 

25 

GTCCTOGCTG GACCGAGCAG CCTOCTCCTC CTAOGATGAC CTCACCCTCC AGCTCTCCAG 420 
TTTTCAGCTr GGAGACATTA GATQGAGGCC AAGAAGATGG CTCTGAGGCG GACAGAGGAA 480 
30 AGCIXSGATTT TOGGAGCGGG CTGCCTCCCA TGGAGTCACA GTTOCAGGGC GAGGACCOGA 540 
AATTCGCCCC TCAGATAAGA GTCAACCTCA ACTACCGAAA GGGAACAGGT GCCAGTCAGC 
CGGATCCAAA CCGATTTCAC CGAGATCGGC TCTTCAATGC GGTCTCCCGG GGTGTCCCCG 

35 

AGGATCK3GC IKSGACTTCCA GAGTACCTGA GCAAGACCAG CAAGTACCTC ACCGACTCGG 720 
AATACACAGA GGGCTCCACA GGTAAGACGT GCCTGATGAA GGCTGTGCTG AACCTTAAGG 780 
40 ACGGGCyrcAA TGCCTGCATT CTCCCACTGC TGCAGATCGA CCGGGACTCT GGCAATCCTC 840 
AGCCCCTCGT AAATCCCCAG TGCACAGATG ACTATTACCG AGGCCACAGC GCTCTGCACA 
TCGCCATTGA GAAGAGGAGW CTCCAGTGTG TGAAGCTCCT GGTQ6AGAAT GGGGCCAATG 

45 

IXSCATCCCCG GGTCTCCGGC GCTTCTTCCA GAAGGGCCAA GGGACTTGCT TTTATTTCGG 
TGAGCTACCC CTCTYTTTOG CCGCTTGCAC CAAGCAGTGG GATGTGGTAA GCTACCTCCT 
50 GGAGAACCCA CACCAGCCCG CCAGCCTOCA GGCACTGACT CCCAQQGCAA CACAGTCCTG 1140 
CATGCCCTAG TGATCATCTC GGACAACTCA GCTGAGAACA TTGCACTGGT GACCAGCATG 1200 
TATGATGGGC TCCTCCAAGC TCGGGCCCGC CTCTGCCCTA CCGTCCAGCT TGAQGACATC 1260 
CCXTAACCrcC AGGATCTCAC GCCICTGAAG CTGGCCGCCA AOGAGGGCAA GATCGAGATT 1320 
TTCAGGCACA TCCTCCAGCG GGAGTrTTCA GGACTGAGCC ACCTTTCCCG AAAGTTCACC 1380 
60 GAC?roC?rcCT ATCGGCCTCT CCGGGTCTCG CTCTATGACC TGCCTrcrGT GGACAGCTGT 1440 



600 
660 



900 
960 
1020 
1080 
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GAGGfiGAACT CAGTGCTGGA GATCATTGCC TTTCATTGCA AGAGCCCGCA CXXSACACOGA 1500 

ATGGTCGTTT TCGAGCXXTCT GAACAAACTG CTGCAQGCGA AATGGGATCT GCTCATCCCC 1560 

5 

AAGTTCTTCT TT^AACTTCCT C^TGTAATCTG ATCTACATGT TCATCTTCAC CGCTGTTGCC 1620 

TACCATCAGC CTACCXTOAA GAAGCAGGCC GCCCCTCACC TGAAAGCGGA GGrTCGAAAC 1680 

10 TCCATGCTGC TGACGGGCCA CATCCTTATC CTGCTAGGGG QGATCTACCT CCTCGTGGGC 1740 

CAGCTGTGGT ACTTCTGGCG GCXSCCAOSTG TTCATCTGGA TCTCGTTCAT AGACAGCTAC 1800 

TTTGAAATCC aX,TlXX"llJlT CCARGCCCTG CTCACftCTGG TGTCCCARGT GCPGTGTTTC 1860 

15 

CTGGSCATCG AGTGGTACCT GCCCCTGCTT GTGTCTGCGC TCGTGCTQGG CTQGCTGAAC 1920 

CTGCTTTACT ATACACGTQG CTTCCAGCAC ACAGGCATCT ACAGTGTCAT GATCCAGAAG 1980 

20 CCCTGGTGAG CCTGAGCX:AG GANNTTQGCG CCCCGAAGCT CCTACAGGCC CCAATGCXAC 2040 

AGAGTCAGTG CAGCCCATGG AGGGACAGGA KGACGAKGGC AACGGGGCCC AGTACAGGGG 2100 

TATCCTQGAA GCCTCCITGG AGCTCTTCAA ATTCACCATC GGCATGGGCG AGCTGGCCTT 2160 

25 

CCAGGARCAG CTGCACTTCC GCQGCATGGT GCTGCTGCTG CTGCTGGSCT ACGTGCTOCT 2220 

CACCTACATC CTGCTGCTCA ACATGCTCAT CGCCCTCATG AGCGAGACCG TCAACAGTGT 2280 

30 CX3CCACTGAC AGCTQGAGCA TCTGGAAGCT GCAGAAAGCC ATCTCTGTCC TGGAGATGGA 2340 

GAATQQCTAT TGGTGGTGCA GGAAGAAGCA OCGGGCAGGT GTGATGCTGA CCGTTGGCAC 2400 

TAAGCCAGAT GGCAGCCCSG ATGAGCGCTG GTOCTTCAGG GTGGAGGAGG TGAACTGGGC 2460 

35 

TTCATGGGAG CAGACGCTGC CTAOGCTGTG TGAGGACCCG TCAGQQGCAG GTGTCCXrPCG 2520 

AACTCTCGAG AACCCTGTCC TGGCTTCCCC TCCCAAGGAG GATGAGGATG GTGCCTCTGA 2580 

40 GGAAAACTAT GTGCCCGTCC AQCTCCTCCA GTCCAACTGA TGQCCCAGAT GCAGCAQGAG 2640 

GCCAGAGGAC AGAGCAGAGG ATCTTTCCAA CCACATCTGC TGGCTCTGGG GTCCCAGTGA 2700 

ATTCTGGTGG CAAATATATA TTTTCACTAA CTCAAAAAAA AAAAAAAAAA AAAAAAAAAA 2760 

AAAAAAAAAA AAAAAAGGC 2779 



45 



50 

(2) INFORMATION FOR SEQ ID NO: 192: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1923 base pairs 
55 (B) TYPE: nucleic acid 

(C) STRANDECNESS: double 

(D) TOPOIiOGY: linear 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192: 
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ACCCGCTCCG CTCCGCTCCG CTCGGCCCXX5 CX3COCXXXX3T CAACATGATC CGCTGCQGOC 60 

TCGCCTGCGA GCGCTCCCGC TGGATCCTQC CCCTOCTCCT ACTCAGCGCC ATCGCCTTCG 120 

5 AC»TCATCX3C GCTGGCCGGC CGCX3GCTGGT TGCAGTCTAG CGACtACXSGC CAGACX3TCCT 180 

CC3CTGTGGTG GAAATGCTCC CAAGAGGGCG GCGGCAC5CGG GTCCTACGAG GAGGGCTGTC 240 

AGAGCCTCAT GGAGTACGCG TGGGGTAGAG CAGCGGCTGC CATGCTCTTC TGTQGCTTCA 300 

10 

TCATCCTQGT GATCTGTTTC ATCCTCTCCT TCTTCGCTCT CTGTQGACCC CAGATGCTTG 360 

TCTTCCTGAG AGTGATTGGA GGTCTCCTTG CCTTGGCTGC TGTGTTCCAG ATCATCTCCC 420 

15 TQC3TAATTTA CCCCGTCAAG TACACCCAGA CCTTCACCCT TCATGCCAAC CSTGCTOTCA ' 480 

CTT7WCATCTA TAACTCGGCC TACGGCTTTG OSTGGGCAGC CACGATTATC CTGATVGGCT 540 

GTCCCTTCTT CTTCTGCTGC CTCCCCAACT ACGAAGATGA CCTTCTGGQC AATGCCAAGC 600 

20 

CCAGGTACTT CTACACATCT GCCTAACTTG GGAATGAATG TGGGAGAAAA TCGCTGCTGC 660 

TGAGATQGAC TCCAGAAGAA GAAACTGTTT CTCCAGGCGA CTTTGAACCC ATTTTTTQGC 720 

25 AGTGTTCATA TTATTAAACT AGTCAAAAAT GCTAAAATAA TTTQQGAGAA AATATTTTTT 780 

AAGTAGTGTT ATAGTTTCAT GTTTATCTTT TATTATGTTT TGTGAAGTTG TGTClTiTCA 840 

CTAATTACCT ATACTATGCC AATATTTCCT TATATCTATC CATAACATTT ATACTACATT 900 

30 

TGTAAGASAA TATGCACGTG AAACTTAACA CTTTATAAGG TAAAAATGAG GTTTCCAAGA 960 

TTTAATAATC TCATCAAGTT CTTOTTATTT CCAAATAGAA TGGACTCGGT CTGTTAAGGG 1020 

35 CTAAGGAGAA GAC5GAAGATA AGGTTAAAAG TTGTTAATGA CCAAACATTC TAAAAGAAAT 1080 

GCAAAAAAAA AGTTTATTTT CAAGCXTTTCG AACTATTTAA QGAAAGCAAA ATCATTTCCT 1140 

AAATCCATAT CATTTGTGAG AATTTCTCAT TAATATCXTTG AATCATTCAT TTCAGCTAAG 1200 

40 

GCTTCATGTT GACTCGATAT GTCATCTAGG AAAGTACTAT TTCATGGTCC AAACCTGTTG 1260 

CCATAGTTGG TAAGGCTTTC CTTTAAGTGT GAAATATTTA GATGAAATTT TCTCTTTTAA 1320 

45 AGTTCTTTAT AGGGTTAGGG TGTGGGAAAA TGCTATATTA ATAAATCTGT AGTGTTTTGT 1380 

GTTTATATGT TCAGAACCAG AGTAGACTGG ATTGAAAGAT GGACTGGGTC TAATTTATCA 1440 

TCACIGATAG ATCIGGTTAA GTTGTGTAGT AAAGCATTAG GAGGGTCATT CTTGTCACAA 1500 

50 

AAGTGCCACT AAAACAGCCT CAGGAGAATA AATGACTTGC TTTTCTAAAT CTCAGC3TTTA 1560 

TCTGGGCTCT ATCATATAGA CAGGCTTCTG ATAGTTTGCA ACTGTAAGCA GAAACCTACA 1620 

55 TATAGTTAAA ATCCTQGTCT TTCTTGCTAA ACAGATTTTA AATOTCTGAT ATAAAACATG 1680 
CCACAGGAGA ATTCGGGGAT TTGAGTTTCT CTGAATAGCA TATATATGAT GCATCGGATA 1740 
GGTCATTATC ATTTTTTACC ATTTCGACTT ACATAATGAA AACCAATTCA TTTTAAATAT 1800 
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CAGATTATTA TTTTGTAAGT TGTOGAAAAA GCTAATTGTA GTTTTCATTA TGAAGTTTTC 1860 
CXIAATAAACC AOGTATTCTA AAAAAAAAAA AAAAAAAC^M GAGOGGGGGC CCGGTACCCA 1920 
5 ATT 1923 



10 (2) INFORMATIC»I FOR SBQ ID NO: 193:- 



(i) SEQUENCE CHARACTERISTICS: 

(A) hmom-. 2346 hase pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS: double 

(D) TOPOIOGY: linear 

(xi) SEQX3ENCE DESCRIPTION: SBQ ID NO: 193: 



20 AGGCTCAGGG GGACACTCTC AAAATTACAC AGCTTTTAAC AGGTGGCAGA ATTGQGGTTC 60 



AGACCCAGAT CTGGGTTCAA GTCACTCATG GTGTC3ATTGC GGCATTCCTT CCCGCATCTG 120 



GGCCTTGCCA TCTCTCTCTC CGAiGTGGACA TGGAGAGGAC GC5C3GGCCCAG CAGCTGGATG 180 

25 

GCTGCAGGGG ATCAAGTCTT CTCTGGGGCT GGGCACGTAN AAGAGCATGT GGCTGGTGGA 240 



CGGCATGCCT GGCTCCTCAC CTGGCAGTCT GCCTC3CCCTG CTAACCGGCT GTCTCTTGTT 300 



30 CCCCTAGTGC CCTCGGCTAG CATGACCCGC CTGATGCGWT SCCGCACAGC CTCTGGTrCC 360 



AGCGTCATTC TCTGGATGGC ACCCGCAGCC QCTCCCACAC CAGOGAGGQC ACCOGAAGCC 420 



GCTCCCACAC CAGCGAGGGC ACCCGCAGCC GCTCGCACAC CAGCGAGGGG QCCCACCTGG 480 

35 

ACATCACCCC CAACTCGQGT GCTQCTOGGA ACAOJGCCGG GCCCAAGTCC ATGGAGGTCT 540 



CCTGCTAGGC GGCCTGCCCA GCTGCCGCCC CCGGACTCTG ATCTCTGTAG TGGCCCCCTC 600 



40 CTCCCCGGCC CCTTTTCGCC CCCTGCCTGC CATACTGCGC CTAACTCGGT ATTAATCCAA 660 



AGCTTATTTT GTAAGAGTGA GCTCTQGTGG AGACAAATGA OGTCTATTAC GTGGGTGCCC 720 



TCTCCAAAGG CGGGGTGGCG GTGGACCAAA GGAAGGAAGC AAGCATCTCC GCATCGCATC 780 

45 

CTCTTCCATT AACCAGTGGC CGGTTGCCAC TCTCCTCCCC TCCCTCAGAG ACACCAAACT 840 



GCCAAAAACA AGACGCGTAC AGCACACACT TCACAAAGCC AAGCCTAGGC CGCCCTGAGC 900 



50 ATCCTGGTTC AAACGGGTGC CTGGTCAGAA GGCCAGCCGC CCACTTCCCG TTTCCTCTTT 960 



AACTGAGGAG AAGCTGATCC AGTTTCCGGA AACAAAATCC TTTTCTCATT TGGGGAGGGG 1020 



GGTAATAGTG ACATGCAGGC ACCTCTTTTA AACAGGCAAA ACAGGAAGGG GGAAAAGGTG 1080 

55 

GGATTCATGT CGAGGCTAGA GGCATTTGGA ACAACAAATC TACGTAGTTA ACTTGAAGAA 1140 



ACCGATTTTT AAAGTTQGTG CATCTAGAAA GCTTTGAATG CAGAAGCAAA CAAGCTTGAT 1200 



60 TTTTCTAGCA TCCTCTTAAT GTGCAGCAAA AGCAGGCRAC AAAATCTCCT GGCTTTACAG 1260 



M^O 98/39448 



424 



PCT/US98/04493 



10 



15 



20 



25 



30 



35 



40 



ACAAAAATAT TTCAGCAAAC GTTGGGCATC ATGGTTTTTG AAGGCTTTAG TTCTGCTTTC 1320 

TCCCTCTCCT CCACAGCX:CC AACCTCCCAC CCCTGATACA TGAGCCAGTG ATTATTCTTG 1380 

TTCAGGGAGA AGATCATTTA GATTTGTTTT GCATTCCTTA GAATGGAGGG CAACATTCCA 1440 

CAGCTCCCCT GGCICTGATG AGTOTCCnTG CAGGGGCCC3G AGTAGGAGCA CTGQGC3TGGG 1500 

QGCGGAATTC GGGTTACTCG ATGTAAGGGA TTCCITGTTG TTGTGTTGAG ATCCAGTGCA 1560 

CTTOTCATTT CTGTQGATCC CAGCTTOGTT CCAGGAATTT TGTGTGATTG GCTTAAATCC 1620 

AOTTTTCAAT CTTCGACAGC TGGGCTGGAA CX5TGAACTCA GTAGCTGAAC CTGTCTGACC 1680 

CGGTCACGTT CTTGGATCCT CAGAACTCTT TGCTCTTGTC GGGGTGQGC3G TGQGAACTCA 1740 

CGTGGGGAGC GGTGGCTGAG AAAATOTAAG GATTCTGGAA TACATATTCC ATGQGACTTT 1800 

CCTTCCCTCT CCTGCTTCCT CTTTTCCTGC TCCCTAACCT TTCGCCGAAT GGGGCAGCAC 1860 

CACTCAOSTT TCTQQGCX3GC CAGTGCGGCT GCCAGGTTCC TGTACTACTG CCTTGTACTT 1920 

TTXyVTTTTQG CTCACCGTCG ATTTTCTCAT AGGAAGTTTG GTCAGAGTGA ATTGAATATT 1980 

CTAAC5TCAGC CACTCGGACC CGAGGATTTC TGGGACCCCG CAGTTGGGAG GAGGAAGTAG 2040 

TCCAGCCTTC CAGGTGGCGT GAGAGGCAAT GACTCGTTAC CTGCCXXTCA TCACXHTOGA 2100 

GGCCTTCCCT GGCCTTGAGT AGAAAAGTCG GGGATCGGOG CAAGAGAGGC TGAGTACX3GA 2160 

TQGGAAACTA TTOTGCACAA GTCTTTCCAG AGGAGTTTCT TAATGAGATA TTTGrTATTTA 2220 

TTTCCAGACC AATAAATTTG TAACTTTGCA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 2280 

AAAAAAAAAA AAAAAAAACT CGAOGGGGGC CCGTACCCAA OTCGCCGTAT ATGATCGTAA 2340 

ACAATC 2346 



45 



50 



55 



(2) INFORMATION FOR SEQ ID NO: 194: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3054 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194: 
TATCTGAACC ACCCTTTATT CTACATATGA TAGGCAGCAC TGAAATATCC TAACCCCCTA 
AGCTCHAGGT GCCCTGTGGN ACGAGCAACT QGACTATAGC AGGGCTQGGC TCTGTCTTCC 
TX3GTCATAGG CTCACTCTTT CCCCCAAATC TTCCTCTGGA GCTTTGCAGC CAAGGTGCTA 
AAAGGAATAG GTAGGAGACC TCTTCTATCT AATCCTTAAA AGCATAATGT TGAACATTCA 



60 
120 
180 
240 
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TTCAACAGCT GATGCCCTAT AAOXCTGCC 
AGCAAGATCT TTACATAATT CAGAGTGGrT 
5 CTCCATTTAT TTGACTAAAG CATCACACAG 
AATACAGTGC TTTATGGCTC TAACATTACT 
GATGGCAGCC TCAGGGCTTC CTTATC3TCCT 

10 

CTTTTTCCCC TA'TCCTGTTC TTCCCCTCCC 
GGTTCTTGGG CTAGGTAGTG GGGACCAAGT 
15 AAACTACGGT ACCAGTGTTA GTGGGAAGAG 
TACTCCTACC TGGTCAACCC GCTGCTTCCA 
TGATAAC3GGA GAGOGAAATA CAAGGAGGGC 

20 

CACAASCCAT AAACCAATAA AACAAGAATA 
ATTCCCACTG CACTTGGTGC TGCrTTGGCT 
25 GACAGGAAGA CTGGAGACTG*TXXACTTCTA 
AGAACTGCTA CCATGAAGTG AAAATGCXIAC 
GAAAAACTGG CTTTTTCCCA GOCCTTTCCA 

30 

GTCCCATCAG CCTATTATTT TTTTAAAGAA 

CTTCCTTCCT GCCCCAAAAT TATAAACTCT 
35 CTrGCTTGTA AAAATATGTA TTATACATCT 
CTCTCCCATT CTCCACTCAC TGCATTTGGG 
CATTGCAGGC CAGTGGACAG AGGGAGAAGG 

40 

TTTCTGACTG ATCCTGAACA AGAAAGAGTA 
CTCCAAAACA CTTATCCTCC TGCAAGAGTG 
45 TAAGCXXXXT CCTCACCCCT TCXTTTTTTTC 
TOGAAAAGAA ACAATATGCT TTACACTCAT 
AAAAATACGG CAGGTGQCCT AAGGCTGCTG 

50 

TACAAGATAA AAAACGAATC CCCTAAACAA 
(XACCTTTCC TGTTCATGAC AGCTACTAAC 
55 AAAGTGGGTC ACCTGACCTC TGAAGAGCTG 
GATGCCAAGG AGGTCCCAQG AAGTCCAGCT 
GCAAGTGAGG CAAGAGAAAT GAGGAAGAAT 



TGGATTTCTT CCTATTAGGC TATAAGAAGT 300 

TCATTGCXOT CCTACCCTCT CTAATGGCCX: 360 

TGGCACTAGC ATTATACCAA GAGTATGAGA 420 

GCCTTCAGTA TCAAGGCTGC CTGGAGAAAG 480 

CXa^CCACAAG AGCTCCTTGA TGAAGGTCAT 540 

CXSCrrCCTAAT GGTACGTGGG TACCCAGGCT 600 

TCATTACCTC CCTATCAGTT CTAGCATAGT 660 

CTGGGTTTTC CTAGTATACC CACTGCATCC 720 

GGTATGGGAC CTGCTAAGTG TGGAATTACC 780 

CTCTQGTGTT CXTOGCCTCA GCCAGCTGCX: 840 

CTGAGTCAGT TTTTTATCTG GGTTCTCTTC 900 

GACTGGGAAC ACCCCATAAC TACAGAGTCT 960 

GCTOGGAACT TACTGTGTAA ATAAACTTTC 1020 

ATTTTGCTTT ATAATTTCTA CCCATGTTGG 1080 

GGGCATAAAA CTCAACCCCT TCGATAGCAA 1140 

AACTTGCACT ' mrm 'i' cn ' tttacagtta 1200 

AAGTGTAAAA AAAAGTCTTA ACAACAGCTT 1260 

GTATTTTTAA ATTCTGCTCC TGAAAAATGA 1320 

GCCTTTCCCA TTGGTCTGCA TGTCTTTTAT 1380 

GAGAACAGGG GTCGCCAACA CTTGTGTTGC 1440 

ACACTGAGGC GCTOGCTCCC ATGCACAACT 1500 

GGCTTTCCAG GGTCTTTACT GGGAAGCAGT 1560 

TTTCTTTACT CTTTTGGCTT CAAAGGATTT 1620 

TTTCAATTTC TAAATTTGCA QGGGATACTG 1680 

TAAAGTTGAG GQGAGAGGAA ATCTTAAGAT 1740 

AAAGAACAAT AGAACTGGTC TTCCATTTTG 1800 

CTGGAGACAG TAACATTTCA TTAACCAAAG 1860 

AGTACTCAGG CCACTCCAAT CACCCTACAA 1920 

CCTTAAACTG ACGCTAGNCA ATAAACCTGG 1980 

(XATCTGTGA GGTGACAGGC AAGGATGAAA 2040 
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GACAAAGAAG GAAAAGftGTA TCAAAGGCAG AAAGGftGATC ATTTAGITOG GTOIGAAAGG 2100 

AAAAGTCTTT GCTATCCGAC ATGTACTGCT AGTACCTGTA AGCATTTTAG GTCCCAGAAT 2160 

GGAAAAAAAA ATCAC3CTATT GGTAATATAA TAATGTCCTT TCCCTGGAGT CAGTTTTTTT 2220 

AAAAAGTTAA CTCTTAGTTT TTACTTGTTT AATTCTAAAA GAGAAGQGAG CTGAGGCCAT 2280 

TCCCTOTAGG AGTAAAGATA AAAGGATAGG AAAAGATTCA AAGCTCTAAT AGAGTCACAG 2340 

CTTTCCCAGG TATAAAACCT AAAATTAAGA AGTACAATAA GCAGAGGTGG AAAATGATCT 2400 

AGTTCCTGAT AGCTACCCAC AGAGCAAGTG ATTTATAAAT TTGAAATCCA AACTACTTTC 2460 

15 TTAATATCAC TTTGGTCTCC ATTTTTCCCA QGACAGGAAA TATGTCCCCC CCTAACTTTC 2520 

TTCCTTCAAA AATTAAAATC CAQCATCCCA AGATCATTCT ACAAGTAATT TTGCACflGAC 2580 

ATCTCCTCAC CCXIAGTQCCT GTCTGGflGCT CACCCAAGOT CACCAAACAA CTTGGTTGTG 2640 

20 

AACCMAACTC CCTTAACCTT CTQQGGGAGG GGGATTAGCT AGACTAOGAG ACCAGAAGTG 2700 

AATGGGAAAG GGTGAOGACT TCACAATGTT GGCCTGTCAG AGCTTGATTA GAAGCCAAGA 2760 

25 CAC3TGGCAGC AAAGGAAGAC TTGGCCCAGG AAAAACCTGT GGGTTGTGCT AATnCTGTC 2820 

CAGAAAATAG GGTGGACAGA AGCTTGTGGG GTGCATGGAG GAATTGGGAC CTGGTTATGT 2880 

TCTTATTCTC GGACTGTGAA TTTTGGTGAT GTAAAACAGA ATATTCTGTA AACCTAATGT 2940 

CTCTATAAAT AATGAGCGTT AACACAGTAA AATATTCAAT AAGAAGTCAA AAAAAAAAAA 3000 

AAAAAACTCG AGGGQGQGCC CGGTACCCAA TTTNCXaJUVP AGAGATNC3TA TTAC 3054 



30 



35 



55 



(2) INFORMATION FOR SEQ ID NO: 195: 



40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 907 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

45 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195: 
QGCAGAGCTC GTGGCCGNAA CTTTTTCTGC TCCTQGCTGC CACCTACTQG CTGGCCGCGG 60 
50 CCCTQGCCTG GQCCTGCACC AGCCTGCQilG CGGGCTCCCA CAGCAGCCCC CTTCCAAGCA 120 
GCGTCCCCAC ACCGCGCACC TTCTGCGGGA ACGTGCTCGC CGTQCCGGGG AOCATATQGA 180 
CGGAAQGCTT TGTGCTCACC TACAAGCTGG GTGAGCAGGG TGCCAGCAGC CTGTTGATCC 240 
TCTTGGCTCC TGCTGGAGCA CGAGCGGCGT TTCTGCTCCC GAGTTGGGAC TGTGGAATGG 300 
TGTGGGTX3CT GTGGTCTGCT CCATCGCTGG CTCCTCCCTG GGTGGGACCT TGCTGGCCAA 360 
60 GCACTGGAAA CTGCTGCCTC TGTGAGGTCG GTGCTGOGCT TCCGCCTCGG GGGCCTAGCC 420 
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TGTCAGACTG CTTTGGTCTT CCACCTTQGA CAOCCTGQGG GCCAGCATGG ACGCTGGCAC 480 

AATCTIGAGA GGGTCAGCCT TGCTGAGCCT ATX3TCTQCAG CACTTCTTGG GARGCCTGGT S40 

5 

CACCACAGTC ACCTTCACTG GGAATGATGC GCTGCAOCCA GCTGGCCCCC AGGGCCTTGC 600 

AiGGCCACACA CTACAGCCTT CTGGCCACGC TGGAGCTGCT GGGGAAGCTG CTGCTGGGCA 660 

10 CTYTGGSCGG AGGGCCTGGC TGAtGGGTTG GGGCCACATC CCTGCTTCTT GCTCCTGCTC 720 

ATCCTCTCTG CCTTTCCCGT TCTGTACCTG GACCTAGCAC CCASCACCTT TCTCTGAGCT 780 

GAGTOGCTQG AGTGGTCAAT AAAGCCACAT GTGCXTOTGG OCCAAAAAAA AAAAAAAAAA 840 

15 

AAAAAAAAAA AAAAAAACTG GAGGGGGQGC CCGGTACCCA AATCGCCGGA TATC3ATCGTA 900 

AACAATC 907 

20 



<2) INFORMATION FOR SEQ ID NO: 196: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1290 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196: 
GGCACGAGGA GGGACAGGGA GTGGGCAAGG GGAAGAAGCA GCTTATTTGA CTAACCAGCC 60 
35 CCTCTGTGGT CCACCAGCGT CTTGGCTTGG TGGGAGGGCT CTCAATCAGC AGGGCCCCAG 120 
KAGGGCAAGA AGAAGTOGGG CAAAGCCTGG CGCTCGGCCG CGGTCGCGGC AGCTTTGCMA 180 
TCTGGAQCCA CGCCTCCTCC AGGCXMGCT CCTTGAACTT GGAAATGTCA ACCGGAGCCC 240 

40 

TTAACACCAG CCCTCCAGCA TCTAATAGAC TTGAATCTAC TCTAAACGAA TATTTAATCC 300 
AACCTCAACT ACATTGTAQC TCAGTCCAAC GACTAACCCT GAAATGGGGG TGTTCCAGCC 360 
45 TTCAGCGAGA TGGCCAAGCG GTCCCCTGGG QGCTGTQGCA GCQGGCTTAT CCTTCTCTGT 420 
TGCCAACCTT GCCGTCCGAC CTCCTCCGCC CCCATGCGGT GACCCCGTCC GTGTCTGTGT 480 
CTGTCCATAC GTGTGAGTCC AGCTAAAAAG ACAAAACAGA ACCCGTGGGC CCAGCTCGGA 540 

50 

AGGTGCGTGG AGAAGGCTCC GACGTCTCCG AAGTGCAGCC CTTGGGATGG CATTCCGTTG 600 
TGTGCCTTAT TCCTGGAGAA TCTGTATACG GCTCGCCTAT AAGAAATATA GCCTCTTCAT 660 
55 GCTGTATTAA AAGGACTTTT AAAAGCAAAA AAAAAAAAAA AAAAACTCGA GGGGGGGCCC 720 
GGTACCCAAT TCGCCCAATA GTGAGTCGTA TTACAATTCA CTGQGCCGTC STTTTAACAA 780 
CGTCGTGAAC TGGGAAAACC CTGGCGTTTA CCCAACTTAA TCGCCTTGCA GCACATCCCC 840 
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CTTTCGCCAG CTGGCGTTAA TAGCGAAAAA NGCCCXXIACC CGAATCGCXX: TTCCCAACAG 



15 AQGGGANCCC CCCATTTAAA ATTTTNGGTN 



900 



TTIOCGCAGC CCTGAATCGC GAAATGGCAA ATTGTAAGCG TTTAATATTT TKKTTAAAAT 960 

5 TCCNcx?nm A w mrxtjrr taaatcarct caattttttt aacccaataa gsccgaaatc 1020 

CXSGCAAATCC CCYTTATTAA TTCCAAAAAA ATAAACCSAA AAWQGGTTTG AATTTTTTKT XOBO 

iTCCCx:AyrT TTGGAAACAA AWTYCCCCCT TTTTAAAAAA GTTGGAACCC CCAMCCYTCC 1140 

AAAGGQGAAA AAACSYTTTT VTGGGGGGNA ANQGC3GCCCC CNTACTTTNA ACAYCCCCCC 1200 

CCAAWCAATT TTTTTQGGGG GTCCCKAAAG GTCCCCCTAA AANCTTTTTT CGGAACXXm 1260 



1290 



20 (2) INFORMATION FOR SEQ ID NO: 197: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1020 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SBQOENCE DESCRIPTION: SEQ ID NO: 197: 
30 GCTGK3CCTC GATQGTCGTG TAGGTGAGTT TTACCAAGGA TTATGGTAfiC AAATGAGTGA 60 

GACCICTATC GAGAAAATAT TCAAGNNCAT TAAAGAAGAC CTCATANTAG GAGftGAATGT 120 

SCTTTCGAGG ATPICTATrG AGCTTTTACA GTATTCATTT TTCAACTCAA GGCAATGGCT 180 

35 

TTCTACACCA ACTCTAATCC ATAAACGGGT CTTATGACAT CTATGAAGTA GTAGCAAGAC 240 

ATCCTTAGTG TGTATTTCTC TCTTTGAGAC ACTGTAATTT CTACCAGAAA TTTCCAGAGC 300 

40 ATTATOTAGG TAGAAAAAAA TGCAAGCAAG CTGITAAAGA TCTTGGATCC CATTATATAG 360 

TATGTATAGC TGAAATCTGT AATTCAATCA CTTTTTCTCT TTTATCCTCT AACCAAAAAA 420 

TTCTTTAATT TTGCATCCCA AATGTTTTTA ATCTTTGTAT ATTTTTTAAA AAYCCTTTTC 480 

45 

TCCTCATCAT IXXXTTrrTTT GTGGTTGTAA ATAGACTTAC TTGCACTTTG AAGATGAGTT 540 
ACTCCTTGTC ATCTTACAAA TATOTGATAT GGTAATTTTC ATAACAGATG TCAGTTTTGA 600 
50 ACCAAGAATT QOTGATITCT TTATAAGAAA AAAACTGGCT TCATTTCTGT GAAATTQCTC 660 
TTTGAAAATr TCTTTrTACA CGTGTAAGCC AACTGAGATA CCGTGATGGT GTTGATTTCT 720 
TTCAATGATG CITACCATCT ATTTTAGCCA CTGAGCCTTT TATTATTTGT CTATTTGTAA 780 
AGTTTArrrG TCTTAACTCA TTTAATAAAT ATACTGTTTA TCTGTTTCTG AATGGGGACT 840 



GAACTmTG GATATTGATA TTGATTTGAA AATATTTTGG AATTTTTTCT ACTTGAAATT 



900 



60 TTAGAAATCT AATKGAAAAT TCTATAATGT ACTGAAAGTA WGGTTGTGTA CAGTGAKCAC 960 
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TCTCTAATAA TATGATGNCT TGCCXTTAAAN GAGGNGGGAC ATGTCCCACT TTCCACCACG 1020 



5 

.(2) INFORMATION FOR SEQ ID NO: 198: 

(i) SEQUENCE CHARACTERISTICS: 
10 <A) LENGTH: 524 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198: 

AATTCCCGAA GCTGAGGGTT GTGTGCCNTC GGGCGAGCCA AGTCTTTTGA CCGGACCCTT 60 

CCCGGCGCAG AAGANCTGAA GTTGATTTGA GAGCCTGTKT TTGGGGTTRA GCCGAGCTGC 120 

20 

TGCGGGCrry GTCGCCGGCC AGGACACAAG YTACTTGCAA CGGGGCGGCG CCTGGCTTAT 180 

GATGTTCCTC AACCCAGGGG CGGCCTCTGC CCTCTACTCG TQCCAGGCCC ACTTGCCAGG 240 

25 CAGGAGCCCT CCCCAAGCCT TCAGQGCTGC TCGGAGTCAC CTGTTGGAAT GGACTAAAAG 300 

GACCCTTGTG TGGGAACAGG TGCTCCAAAC ACCCTGCTGC TGGCTGCCAG GCAGGCCCTC 360 

TGGAAGGGAA GGGGCAGGAC TCATCAGGAC CTCCCTGGAC CCTGCAOGGC AOGCAGTTGG 420 

30 

CCCGftGCCCA AGCATTTGGC TCTGCTTGCC CCAACX3GGAC AGGAAGCCTC TTQQGCCTCT 480 

TCCCTTCCTG GACAAGGCCC CCTOCCTTTG CCTCACATAA ACTG 524 

35 



(2) INFORM^TIC^J FOR SEQ ID NO: 199: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 332 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
{D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199: 
GTGATACAAG GAAGGGTGAT CATCATCTGT CACCATGCAA TTCCTGCTCA CAGCCTTTCT 60 
50 GTTGGTGCCA CTTCTGGCTC TTTGTGATGT CCCCATATCC CTAGGCTTCT CCCCCTOCTA 120 
GAAGQGCTTC TTGATAGATT AGAAAATAAG AATGAGTGAC ATTTCCTATG TGCATATAAG 180 
AAQGAGCCAC AAGACATGTC TTTTAAATAA AAGGACAGTG TCCATCCTTT TAGCTGCCGA 240 

55 

ATAGAACCTT GGTCTCATCC TCCTGGAGCT AGGSCTTAAA ACAGCTTCTG TGTTTCTSAT 300 
TKGTCTCART GTTTTGCCAA GGTTTTATTC GG 332 

60 
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20 



(2) INFOPMATION FOR SEQ ID NO: 200: 

(i) SBC^JENCE CHARACTERISTICS: 

(A) LENGTH: 376 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDECNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 200: 

CCAGGGAAGC CCCARGCCTG TCCTGAATTG ACATCAGTGC TTCCCTGAAC TGCCTCCCCC 60 

15 ACCCCTGGGC ATTATCCCAG GAAACTTATG TTTTCTAGAA GCTAAGCAGC TGCTQGGACT 120 

CAGGGACTGG TGCAGGTAGG CTGAGTGGCA GCTCAGTCCT AGAAGGTCTC TGAAGATCTG 180 

GACTGAGGAC CYTGCTACTC CCCAAGCCAG AGCCCATCAG CCAGGCCTGC TGTGAGCCAC 240 

CK3CCTGTGG AGTGCTGAGC TCAACCAAAG GCTQGCAAGC TCTGQGCCTC ATTTAAGQGA 300 

TTCTGATGAG CCGATGQGCC CTGGAGGCAG CCCATTAAAG CATCTGGCTC GTTTTTGGAA 360 

25 AAAAAAAAAA AAAAAG 376 



30 (2) INFORMATION FOR SEQ ID NO: 201: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1192 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS: double 

<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTICW: SEQ ID NO: 201: 

40 CCCAGTATAT TTCTATAACA TTTATTTTAG TGAACTTATA ATGITTCTTT GTATTAAATT 60 

ATTAGATTAT ATCTTTAGAT AATATTGTTA CTNAATTAGT AGGTAATATA TATTTTATTC 120 

AAAAATAAAT TGTGCATCTA ATGTCTACCA ATTAATGTAC TTGTAGATGT ATCTTATCTT 180 

45 

AACTTCAGTC TTTGCTQCCC CTAATGAGGT GTGAAGGACT CTTCTCCCCT QQGGAAGTTT 240 

TTCTmrCA GGAGGGAGGA GGGCTTTCCC AGGTAATGTG TCTAGAGTGT TGGGCAGAAR 300 

50 AATCTGGGAC CACACCAC7«: CAGTTCTCTC CTTAATCCAC GTCATTTGOC TTCTATCCCA 360 

GCTATGTTTC CAGTGTCCTC TQGGTGTTTC CAAGAGCAAC AAGAAAYGAA TAAATCTCTG 420 

KTGAGTTGTT TATTTGTTCT TCACTTT G TT TTACACTGTA WTTTCTGAGT TTATGGGTGT 480 

55 

CTGTGAATTA AAAAGGAAAA GTRGAAATAA GTAAAACTCA GGTTGAAGGA AATATACATA 540 

AATAAGATAA AGCTGACCTG TAGATATARR CAGGTTATAA RAGCTTAGAG TTGTCTAAGT 600 

60 TCRGTGCAAA KTTTCCTCTG ATCTTTCTGA TGCCGARACA AAAAAGGCAG TCATGTTTGT 660 
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WATGTGATTG GAATGGAACC CGAFAAGAGA GCAYGCTGTG TTCTTGGGGA CAGGAAAGCT 720 

TGYGTGCACC AAGTCTKAAC CACCACCTTC ATGQGACATA GRTTATGTGC TGGAACATAT 780 

5 

TTCACACCGG CCTGGCAGTA AACACTTGTA GTGTTGTGCA GTGGAAACGG TCATCTTCCX3 840 

CTAAAGCACG GCGTGTTGTG CAGCX3GAAAT GGTCATCTGC TGCTAAAACA CAGCTTCCAT 900 

10 CX3TAATGTAT GCTCCTTACT CAAAGAGTGT QGTCCCAAAC AGCCTTTGGG AGGTCCTCCT 960 

TGATTCATGG ATGAAACCTG GAACATCTTG AGGACTGAOT TAACCATAQG TCCTTAAATA 1020 

ACTCTCCACA COTITTTCTT AGTTTATCTC TACATGCAGG GTGTGCAGCA GCCTGTTCAA 1080 

AGTCATATTT TCTGGGAAAT ATTTCCAGTG TTTATTTGCA CTTTAGCCCA CTCTGTGTAG 1140 

CCTTATTTCT TCTAAACTCA CCATTAATCT GAATAATAGT CAAATTTAGG GG 1192 



15 



20 



(2) INFORMATION FOR SEQ ID NO: 202: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) L£I9GTH: 589 base pairs 

(B) TfPB: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 202: 
ATCTTGGGCT ATCTTTGACA GGGGATTCTT GCAAGTTGAT GCTTTCTACA AGTGAATATA 60 
35 GTCAGTCCCC AAAGATGGAG AGCTTGAGTT CTCACAGAAT TGATGAAGAT GGAGAAAACA 120 
CACAGATTGA GGATACGGAA CCCATGTCTC CAGTTCTCAA TTCTAAATTT GTTCCTGCTG 180 
AAAATGATAG TATCCTGATG AATCCAGCAC AGGATGGTGA AGTACAACTG AGTCAGAATG 240 

40 

ATGACAAAAC AAAGGGAGAT GATACAGACA CCMGGGATGA CATTAGTATT TTAGCCACTG 300 
GTTGCAAGGG CAGAGAAGAA ACGGTAGCAG AAGATGTTTG TATTGATCTC ACTTGTGATT 360 
45 CGGGGAGTCA GGCAGTTCCG TCACCAGCTA CTCGATCTGA GGCACTTTCT AGTGTGTTAG 420 
ATCAGGAGGA AGCTATGGAA ATTAAAGAAC ACCATCCAGA GGAGGGGTCT TCAGGGTCTG 480 
AGGTGGAAGA AATCCCTGAG ACACCTTGTG AAAGTCAAGG AGAGGAACTC AAAGAAGAAA 540 
ATATGGAGAG TGTTCCGTTG CACCTTTCTC TGACTGAAAC TCAGTCCCA 589 



50 



55 

(2) INFORMATION FOR SEQ ID NO: 203: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 847 base pairs 
60 (B) TYPE: nucleic acid 
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(C) STRANDEr»JESS : double 
<D) TOPOLOGY: linear 

(xi) SBQUEtKlE DESCRIPTION: SBQ ID NO: 203: 

GGCACGAGCG CAAGCTGCTG GCCGCCATCA ACGCGTTCCG CCAGGTGCGG CTGAAACACC 

QGAAGCTCCG GGAACAAGTG AACTCCATGG TGGACATCTC CAAGATGCAC ATGATCCTGT 

ATGACCTGCA GCAGAATCTG AGCAGCTCAC ACCGGGCCCT GGAGAAACAG ATTGACACQC 

TGGCX3GQGAA GCTGGATGCC CTGACTGAGC TGCTTAQCAC TQCCCTGGGG CXrGAGCAGCT 

TCTAGAACCC AQCCAGCAGT CCAAGTAQCT GGACCCACGA QGAGGAACCA QGCTACTTTC 

CCCAGTACTG AGTGGTGGAC ATCGTCTCTG CCACTCCTGA CCAGCCTGAA CAAAGCACCT 

CAAGTGCAAG GACCAAAGGG GGCCTGGCTT GGATGGGTTG GCTTGCTGAT GGCTGCTGGA 

GGGGACX3CTG GCTAAAGTGG GGAGGCCTTG GCCCACCTGA GGCCCCAGGT GGGAACATGG 

TCACCCCCAC TCTGCATACC CTCATCAAAA ACACTCTCAC TATGCTGCTA TGGACGACCT 

CCAGCTCTCA GTTACAAGTG CAGQCX3ACTG GAGGCAGGAC TCTTGGGTCC CTGGGAAAGA 

GGGTACTAGG GGCCCGGATC CAGGATTCTG GGAGQCTTCA GTTACCGCTG GCCGAGCTGA 

AGAACTGGGT ATC3AGGCTGG GGCQGGGCTG GAGGTGGOGC CCCCTGGTGG GACAACAAAG 

AGGACACCAT TTTTCCAGAG CTGCAGAGAG CACCTGGTGG GGAQGAAGAA GTGTAACTCA 

CCAGCCTCTG CTCTTATCTT TGTAATAAAT GTTAAAGCCA GAAAAAAAAA AAAAAAAAAA 

AAAAAAA 

(2) INFORMATION FOR SEQ ID NO: 204: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 852 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDFXINESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 204: 
ACAAACATAC TCGCAGGAAG GAGTCTCATG CTGCCCQCAG CATCAGCGCA ACNCNTGGCC 
GCCATCAACG CGTTCCGCCA GGTGCGGCTG AAACACCGGA AGCTCCGGGA ACAAGTGAAC 
TCCATGGTGG ACATCTCCAA GATGCACATG ATCCTGTATG ACCTGCAGCA GAATCTGAGC 
AGCTCACACC GGGCCCTGGA GAAACAGATT GACACGCTGG CGGQGAAGCT GGATGCCCTG 
ACTGAGCTGC TTAGCACTGC CCTGGGGCCG AGGCAGCTTC CAGAACCCAG CCAGCAGTCC 
AAGTAGCTGG ACCCACGNAG GAGGAACCAG GCTACTTTCC CCAGTACTGA GGTGGTGGAC 
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ATNCGTCTCT TGCCftCTCCN TC3NACCCAGC 
AAAGGGGGCC CTGGCTTOGA GTGGGTTGGC 
TAAAGTGGGK AOGGCTTOGC CCACCTGAGG 
TGCATACCCT CATCAAAAAC ACTCTCACTA 
TACAAGTGCA GGCOACTGGA GGCAGGACTC 
CCCGGATCCA GGATTCTGGG AGGCTTCAGT 
GAGGCTGGGG CGGGGCYGGA GC3TGGCGCCC 
TTCCAGAGCT GCAGAGAGCA CCTGGTGGGG 
CTTATCTTTG TA 



433 

CCTGAACAAA GCACXTTCAAG TGCAAGGftCC 
TTGCTGATGG CTGCTGGAGG GGACGCTGGC 
CCXXAGGrrGG GAACATQGTC ACCCCCACTC 
TGCTGCTATG GACGACCTCC AGCTCTCAGT 
CTGGGTCCCT GGGAAAGAGG GTACTAGGGG 
TACCGCTGGC CGAGCTGAAG AACTGGGTAT 
CCTGGTGGGA CAACAAAGAG GACACCATTT 
AGGAAGAAGT GTAACTCACC AGCCTCTGCT 



(2) INFORMATION FOR SEQ ID NO: 205: 

(i) SEQUENCE CHARACTERISTICS: 

(A) Lmmii 1354 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205: 
GATTCGGCAC GAGGCTTGCT GGAGCAGC5AG AAGTCTCTRG CCGGCTGGGC ACTGGTGCTG 
GCASGARCTG GCATTGGACT CATGGTQCTG CATQCAGAGA TGCTGTGGTT CGGGGGGTGC 
TCGGCTGTCA ATGCCACTQG GCACCTTTCA GACACftCTTT GGCTGATCCC CATCACATTC 
CTGACCATCG GCTATGGTGA CGTGGTGCCG GGCACCATGT GGGGCAAGAT CGTYTGCCTG 
TGCACTGGAG TCATGGGTGT CTGCTGCACA GCCCTGCTGG TGGCCCrGGT GGCCCGGAAG 
CTGGAGTTTA ACAAGGCAGA GAAGCACGTG CACAACTTCA TGATGGATAT CCAGTATACC 
AAAGAGATGA AGGAGTCCGC TGCCCGAGTG CTACAAGAAG CCTGGATGTT CTACAAACAT 
ACTCGCAGGA AGGAGTCTCA TGCTGCCCGC AGGCATCAGC GCAANCTQCT QGCCGCCATC 
AACQCGTTCC GCCAGGTGCG GCTGAAACAC CGGAAGCTCC GGGAACAAGT GAACTCCATG 
GTGGACATCT CCAAGATGCA CATGATCCTG TATGACCTGC AGCAGAATCT GAGCAGCTCA 
CACCGGQCCC TGGAGAAACA GATTGACACG CTGGCGGQGA AGCTGGATGC CCTGACTGAG 
CTGCTTAGCA CTGCCCTGGG GCCGAGGCAG CTTCCAGAAC CCAGCCAGCA GTCCAAGTAG 
CTGGACCCAC GAGGAGGAAC CAGGCTACTT TCCCCAGTAC TGAGGTGGTG GACATCGTCT 
CTGCCACTCC TGANCCCAGC CCTGAACAAA GCACCTCAAG TGCAAGGACC AAAGGGGGCC 
CTGGCTTGGA GTGGGTTGGC TTGCTGATGG CTGCTGGAGG GGACGCTGGC TAAAGTGGGK 
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AGGCCTTGGC CCACCTGPJSG CCXCAGGTGG GAACATGGTC ACCCCCACTC TGCATACCCT 960 

CATCAAAAAC ACTCTCACTA TGCTGCTATG GACGACCTCC AGCTPCTCAGT TACAAGTQCA 1020 

5 

GGCGACTGGA GGCAQGACTC YTGGGTCCCT GGGAAAGAGG GYACTAGGGG CCCGGATCCA 1080 

GGATTCTGGG AGGCrTTCAGT TACCX3CTQGC OGAGCTGAAG AACTGGGTAT GAGGCTGGQG 1140 

10 CGGGGCTGGA GGTQGCGCCC CCTGG?rQGGA CAACAAAGAG GACACCATTT TTCCAGAGCT 1200 

GCAGAGAGCA CCTGGTGGGG AGGAAGAAGT GTAACTCACC AGCCTCTGCT CTTATCTTTC 1260 

TAATAAATGT TAAAGCCAGA AAAAAATAAA AAAAAAAAAA AAAAAACTCG AOGGGGOCCC 1320 

15 

AGACCCAATC TCCCTATAGT AAGNCGCCNN ANAN 1354 



20 

(2) INFORMATION FOR SEQ ID NO: 206: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1378 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOUXJY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 206: 

30 

TCCCCAGGTG CACAGCCAGG GCCCTCCTGT CTGCAGGAGA ATTCACAGCT GGTC?rGGGAC 60 

TCAGCCCCTA GNCCATTCAA AGCCTTAATG TTGTAATCAT ATCTTACGTG TTGAAGACCT 120 

35 GACTGGAGAA ACAAAATGTG CAATAAOGYG AATTTTATCT TAGAGATCTG TGCAGCCTAT 180 

TTCTGTCACA AAAGTTATAT TGTCTAATAA GAGAAGTCTT AATGGCCTCT GTGAATAATG 240 

TAACTCCAGT TACACGGTGA CTTTTAATAG CATACAGTGA TTTGATGAAA GQACGTCAAA 300 

40 

CAATGTGGCG ATGTCGTGGA AAGTTATCTT TCCCGCTCTT TGCTGTGGTC ATTGTGTCTT 360 

GCAGAAAGGA TGGCCCTGAT GCAGCAGCAG CGCCAGCTGT ANATAAAAAA TAATTCACAC 420 

45 TATCAGACTA GCAAGGCACT AGAACTGGAA AAGACCACAG AAAACAAAGA ATCCAACCCT 480 

TTCATCTTAC AGGTGAACAA ACTGTPGATGA TGCACATGTA TGTGTTTTGT AAGCTGTGAG 540 

CACCGTAACA AAATGTAAAT TTGOCATTAT TAGGAAGTGC TGGTGGCAGT GAAGAAGCAC 600 

50 

CCAGGCCACT TGACTCCCAG TCTGGTGCCC TGTCTACACC AGACAACACA GGAGCTQGGT 660 

CAGATTCCCC TCAGCTQCTT AACAAAGTTC CTCGAACAGA AAGTGdTAC AAAGCTGCCT 720 

55 TCTCGGATAC TGAAAGGTCG AGTTTTCTGA ACTGCACTGA TTTTATTGCA GTTGAAAAAA 780 

AAAAAAAGCT ATTCCAAAGA TTTCAAGCTG TTCTGAGACA TCTTCTGATG GCTTTACTTC 840 

CIGAGAGGCA ATGTTTTTAC TTTATGCATA ATTCATTGTT GCCAAGGAAT AAAGTGAAGA 900 
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AACAGCACCT TTTAATATAT AGGTCTCTCT GGAAGAGACC TAAATTAGAA AGAGAAAACT 960 

GTGACAATTT TCATATTCTC ATTCTTAAAA AACACTAATC TTAACTAACA AAAGTTCTTT 1020 

TGAGAATAAS TTACACACAA TGGCXaCAGC AGTTTGTC?rT TAATAGTATA GTGCCTATAC 1080 

TCATGTAATC GGTTACTCAC TACTGCCTTT AAAAAAAAAA ACCAQCATAT TTATTGAAAA 1140 . 

CATGAGACAG GATTATAGTG CCTTAACCGA TATATTTTGT GACTTAAAAA ATACATTTAA 1200 

AACTGCTCTT CTGCTCTAGT ACCATGCTTA GTGCAAATCA TTATTTCTAT GTACAACTGA 1260 

TGCTTGTTCT TATTTTAATA AATTTATCAG AGTGAAAAAA AAAAAAAAAA AAAAAAAAAA 1320 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAGAA NAAAANAA 1378 



(2) INFORMATION FOR SEQ ID NO: 207: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENC5TH: 1166 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOUXSY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 207: 

AANCCACTGC ANTTTAAACC CCCTCCCCTC CAAGAAAGTT CACAACCGGC CATGGATGAC 60 

CCTCATTTTA GATGGGCCNC AATATTTAAG ATGGACTGRG GMCCCCARAG ACTGACCCTT 120 

GAAAGGQGGA CTCAGAAGAA AGATCCTTGA CATTGCCMAA CATGCTQGGC TTGTCCAACA 180 

CAGTGATC3CG GCTCATCGAG AAROGGGCTT TCCMAQGACA AGTACTTTAT GATAGGTGGG 240 

ATCCTGCTGA CCTGTGTGGT CATGTTCCTC GTQGTGCAGT ACCTGACATG AGCCAGCCAC 300 

GCTCAGTCGC TGAACAGCAT TCCCACAGCC TGCAAGTGTG TGTGTGTGTG AAAGAGAGAG 360 

GGGGCCCAGA GGCCGCCTTT TGAAATGTTT GCCTGTCTGA ACTGTGAAGA CACTTGGGAG 420 

TGATTGTXSGT CTAATTTCCA ACCTGCTCTG TTTTCTGTGA CATCTTGGAG GGGGAGCTAG 480 

TGCCAMCACC ATGCGCGGTG CTTAGGAAAT GAAASAAGTC CCGGGTCTGT CTCTCTCACT 540 

CTCGCTCTCA MTGGGGGAGG GAAAGAATGG CTTTGGTQGC TTTGTTCACA CAGCTGATGC 600 

GTGSCCTGGG AAGGTGTCCA CAGTGAGCCC TGTGTGCAGG ACTGTCCACN ACGGTTCACA 660 

CCTTCTCACC ATCAGGCCTT TCTGGCTCCT GATAGGGTGG AGCAAAAGTG GAAAGGAAAG 720 

GAAAGAGGCY TTTTCTCACA GCCATTATAT TAAATAGTAG GTCGATTCAC ATCYTCGTGC 780 

TCCTGGCCAC CCTCCCCTGT GCCTCAGTGA CATGTAGATG ACTGACTGCC AATACTTGTC 840 

ACCATTCCCT GGAAGCAGCT ACCTAGGGGA AACAAGATGT AGTGCTATTG CCGATAACAA 900 

GTAAGATTTT CCACACTACA GCTGGGTGTT TCTCTTTTCT AAAGTGAGGC CAGTGTTATT 960 
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TCCCQGGAGT GTTCAGTCTT GACCCTAGTC ACTGATTTTT TCTAGTTGTT AATAGAGTGG 1020 

TTQGGCTTTT AAGGTTCAGA GACTGTOQGC TTGGGCACCT GCXXXXIAGQG STTTTGTGQG 1080 

GGCTTTTGCC CCTTAGRAAA GTAGCTTTTA GGGGCAAAGA TTTOITGATT TTCCCCATTA 1140 

CAGTCTTCAG CTCNAGGGTT TTAAAA 1166 

(2) INFORMATION FOR SEQ ID NO: 208: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 697 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

20 

(Xi) SEQUENCE DESCRIPTION: SBQ ID NO: 208: 
TACTTCTAQG ATTATAAGGA ATTAACATTG AGATGACATT TCCATTTGAG AAGGAAAATA 60 
25 GTTGCTTTCA GTGCCTTTTA TTTGATTCCT GGAGAGAGCA GACTCGCACS AACATTCAAC 120 
CCCAGCGCTG ATATGACAGT AATCCTCAGA GGCAGAGCCC AGCACAAAAC AGCAATGCTA 180 
GAAAGTTACA ATTGGAAAGT TTCCTGCCAG CTTCGGGAAT GACACTGCAA AGCTGATGCC 240 

30 

AGAAACTGCC AGRGTAATTC TCCTCATTAC TGCTCTACCC ACCCACTTTC AGCTCCCCAA 300 
ATTAACTAGT GCASTTGACT AATTCTCTTT ACCTTTATCA TTTARGGTGA RGCATTQCAC 360 
35 AAAAACTCTC GACTTTGCCA TATAAGGGCT GTGGTTCTCT GTGGTCCCCT GGATAAGAGG 420 
CATCACCATT ATCTOGAAAC ATGCAGTAAA TGCAGATTNT TCATCTTCTC CCCAGACCTC 480 
CTGAGTTAGA AATTCACAAG TTCTCCAGGT GATCTCATAC ATGCTAAAGT TTGAGAACCA 540 

40 

TTGAGTAAAG TTAATGCATT AflGAAGAGAT TAGATAGGGA TGGTQGCGTA TCTTCCTACA 600 
GTTTCCCTGT TAACAAGAAA GTCAGAGGTC AGTTGATCAG ACATTAGATT ATTTATTGCT 660 
45 AAAACTAAAA AAAATTAAAA AAAACTGGAG GGGGGCC 697 



50 (2) INFORMATION FOR SEQ ID NO: 209: 

(i) SEQUENCE CHARACTERISTICS: 

(A) Lm^TH: 932 base pairs 

(B) TYPE: nucleic acid 
55 (C) STRANDEDNESS: double 

(D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 209: 



60 



CGTGAGTCAC CTCTCTATAG TGQGCGTGGC CGAGGCCGGG GTGACCCTGC CGAAGCCTCC 60 
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GCTGCCAGAA ACGATGTTCA AQGTAATTAA AAGGTCCGTG GQGCCAQCCA GCCTGAGCTT 120 

GCrCACCTTC AAAGTCTATG CAGCACCAAA AAAGGACTCA CXTTCCCAftAA ATTCCGTGAA 180 

5 

GGTTGATGAG CTTTCACTCT ACTCAGTTCC TGAGGGTCAA TCGAAGTATG TGGAGGAGQC 240 

AAGGAGCCAG CTTGAAGAAA GCATCTCACA GCTCCGACAC TATTGCGAGC CATACACAAC 300 

10 CTCGTGTCAG GAAACGTACT CCCAAACTAA GCCCAAGATG CAAAGTTTGG TTCAATGGGG 360 

GTTAGACAGC TATGACTATC TCCAAAATGC ACCTCCTGGA TTTTTTCCGA GACTTGGTGT 420 

TATTGGTTTT GCTGGCCTTA TTGGACTCCT TTTQGCTAGA GGTTCAAAAA TAAAGAAGCT 480 

AGTGTATCCG CCK3GTTTCA TGGGATTAGC TGCCTCCCTC TATTATCCAC AACAAGCCAT 540 

CCTGTTTCCC CAGGTCAGTG GGGAGAGATT ATATGACTGG GGTTTAOGAG GATATATAGT 600 

20 CATAGAAGAT TTGTOGAAGG AGAACTTTCA AAAGCCAGGA AATGTGAAGA ATTCACCTGG 660 

AACTAAGTAG AAAACTYCAT GYTCTGCCAT CTTAATCAGT TATRGGTAAA CATTGGAAAC 720 

TCCATAGAAT AAATCAGTAT TTCTACAGAA AAATGGCATA GAAGTCAGTA TTGAATGTAT 780 

TAAATTGGCT TTCTTCTTCA GGAAAAACTA GACCAGACCT CTGTTATCTT CTGTGAAATC 840 

ATCCTACAAG CAAACTAACC TGGAATCCCT TCACCTAGAG ATAATGTACA AGCCTTAGAA 900 

30 CTCCTCATTC TCATGTTGCT ATTTATGTAC CT 932 



25 



35 (2) INFORMATION FOR SEQ ID NO: 210: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 661 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEONESS: dOljJsle 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210: 

45 GTCATTCTTT AAATAAAAGC TTTCCTGTTT AAAQCTITTC AAAGGAGCAG ACCACCTTGA 60 

AGATTCCCCC TAQGGTTGAT ATGTGTCTAA TTCATTTTAT AAAAATTATT CTTGTCTTCA 120 

TTTTAAAQCT TTQGCTATAT AGTCAGAAAT GTCCTAAATA ACAAACTATT TTGTATTTAA 180 

TTTAGGGAAG ACTAAAGGGA AGAAAAATGA AAACTCAGTC TTTATGTAAG CTCCAAGGAT 240 

ATTAGGGCTT AAAGGGCTTT TCTAGTTTTA TGAGAATTTG TACTACTGAT TTTTATATAT 300 

55 TCCTGTTTTT GAGATGAACA GATCTCTGGG GAAATTGTTG AGTTACAATG GCATTTCACT 360 

GTGATCCCTC TCAAGCTCAG ATCAGTTCTA TAACCCAATG ACAACCTGTC TCTTTGGTTT 420 

ACTGTCCTGT GAAATGTCAG CTCAAGTTTC CCAGAAGTCG TGTGTTTATG ATGAGTCAGA 480 



50 
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GTQCTTTTCC TCGGTGGGAC AGTTGCTGGC CCTCTTAATT TTGGTGTATG TQCTTCCAAG 540 

TATCTAAACC TCCAGTCTGA TCTGTATATG CTATCCTAAC TGTTAATTGT ATTATTGATT 600 

ATGTTGATTA TCTTGCrTGA AGGTTCATAC TTTTCAATTT GATAGAAATA AAGTTTTTTT 660 

C 661 

(2) INFORMATION FOR SEQ ID NO: 211: 



(i) SBOTNCE CHARACTERISTICS: 
15 (A) LENGTH: 592 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 211: 

GAAACTGACA TTGTTAAACA CACTAAAACA GAAGTACTTA CCTCTTGAAG ATTTAATATA 60 

TAATGGTTGA CATGATACAT GTACATGAAT GGAATGACCA GATGCTTATG GTCTACATTT 120 

25 

TCCTTTATCC TGTTAGTATT ACCTTCCTTA ATCTTTGTTC CTTAACATGC TAAATTCCTC 180 

TTCAGTGTTT ATTTTCTAGT GACAGAATGC TAACATTTCT TACACCCTGG CAGAAGGGAG 240 

30 AGAAATGTGT TTTQGGGTGG GTAACTAAAT TTTTGAGTGA AATATCATAA GATGAGAATG 300 

GAAAGAGGGA GACACAAAGA GTTATAACAA AAAAACAATG GTTTTTTTAG CCATTTGACT 360 

GGCTCTTTAA ATAGTCTACA AGACATTCAC GTTNAACATC ACTTTTAGTG AAATAAAATG 420 

35 

TGCCATACTA GTATGTGCTT CAAAAGGGCA AATGTGCTTT AGTGCCCTAA GGCTAAATTT 480 

TGGTCATTTG ACATCAGAGA TGTTGTAAGT ATTGCACTTA ATACGCACCT ATTTCTCAAT 540 

40 AGTGOTATTT TTTTGGCTAG CATTTNCTTT ACCACTAACC TTGTTGGATA GC 592 



45 (2) INFORMATION FOR SEQ ID NO: 212: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 938 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEWBSS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 212: 
55 TGGAGTGGCT TTCCAGCTGA ATGAATCCTA TGTCTCGCGT GCAGGTGGTT GGTTTTCAAT 60 
GTTCTTSCTA ATTTTTTTCC TATTGGCTCT TGQGAGTTTN CTTTGTTTGC TCCTGTGTTT 120 



60 



GCCCAGCTTT AATAAAACCA GGCGCAAACA AAAACCATAG CATTCTGAAA CAATAGGGGG 180 
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CCCACATTGG ACCCAGTATG TCACTTTAAT GGACTTCAAG AAAAAATCTG AATGGGAAAA 240 

TGACACTAGG AATGTATACT CCACACATTT TATGCCATAT AATGGTGTGT TTTCTTAATT 300 

TTGTTTCTTC TQGCGAAATG TGGCTTTCAA ATTAAAATGM OrrTTTCTTC TTKGAAACTT 360 

TTTGTTTKGA CTKGTATAAT TAAGGGTTTG GAAAGATTCA TAATmTGAG AGAGGTTTGC 420 

AACCAGGAGA TACAAAGAAG TCTCAGTAGT AATCTTGTTC ATGTGCTTTT ACAGCCAC5CT 480 

ACATTTAAGR ATGTATTAGT TACAGAAATT ATATGTCTGT GTATGTGTCT CTACTCAATA 540 

AAGTACATGC CTCCACATAA TGCGGTGCTG TCCATCTCGG CAAATACTGG CCAAGimrT 600 

TTATGACAC5G CACACAGAAA CCATAGCATG GTCTGGCTTT CAGAAAATGC CTCTCATCTT 660 

TCCTQGAACC TTATTTTGCT AAATGTCTGT TTTCTTGTGA TTTGTTGTAC CTCACAGCAC 720 

CATTGTGACC ATGGTGATGC CTCATTTGCA TGATATGTAC CTTGTGTTTA ATGTGAAATA 780 

CATTTTCATT GAAGAGTCTG ATGACTTGCT AGCGTTTTAT TTTTTCTGTA AGCTCAATGT 840 

GCTGAAACCA AACCAGGCTT TTAAAAACCT GTGTAGAAGA AAACCAAAAA ATCCTGTGTG 900 

GGTGTCCTTT CCCTGTCAAA CTCATTAAAA AtTCCTTT 938 



(2) INFORMATION FOR SEQ ID NO: 213: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1079 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 213: 

AGCCTGCCGG GAGAGTQGTG GCATCTRARA GGCTGGTCC5T GC3ACTGTGGT TGGGQGAGGT 60 

GGGAGCTGTT TTAACCGTGT GCCCCCTCTC CTGTQCCKGC GTGGGCATCC CCCGGGGCAG 120 

TGGAACGCGG GCGCTCCTCC AGCTTCCGAG TCCAGCCAGC CTGGGCGCGG GGCGCGCCCC 180 

CGAGACACCC GAGGAGTCCG TTCCTCCCTG GTTACGTGGA CTGTGGAGCT GGTCTCTTGT 240 

GGCTCAGCGC CGTGCGGAGG TTGAAGCGTA CCTGCGGAGG TCGCACCAGG C3GCGTGAGGA 300 

GGAGGAGGAA GGGCATGAGC CGAGCTTGAG GAATCCGTGY TCCAAACTCT ACACTCAAGG 360 

RTGCMCTGCG CAACTCTGGT GGCGATGGGC TGGGGCAGAT GTCCTTGGAG OTCTACCAGA 420 

AGAAGAAGTC TCGCTGGCCA TTCTCAGACG AGTGCATCCC ATGGGAAGTG TGGACGGTCA 480 

AGGTQCATGT GGTAGCCCTG GCCACGGAGC AGGAGCGGCA GATCTGCCGG GAGAAGGTGG 540 

GTGAGAAACT CTGCGAGAAG ATCATCAACA TCGTGGAGGT GATGAATCQG CATGAGTACT 600 

TGCCCAAGAT GCCCACACAG TCGGAGGTGG ATAACGTGTT TGACACAGGC TTGCGGGACG 660 
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TGCAGCCCTA CCTGTACAAG ATCTCCTTCC 
CCACCACCAT GCGCAGGCTC ATCAAAGACA 

5 

GOGAGCTCCT TGATGGCTCC CAGACCTTGG 
GCTCK3GAAC CTGCTCTQGG TCATPGGTGA 
10 TTGGTTTTGT GGTTGCCAGC CTCAGGTCAT 
GCCTCTACTG TCTCTCCATA GCCCTGGTGG 
AGCCACCACT GGGATGGGGA ATAAAGTTGA 

15 



AGATCACTGA TGCCCTQGGC ACCTCAGTCA 720 

CCCTTGCCCT CTGAGCGTCG CTGGATCTCT 780 

cnrrGGGAA ttgcactttt GGoccrrroG 840 

GACTTGGAAG GGGCAGCCCC CGCTGGCTTC 90O 

CCTTXTAATC TTTQCTGACG GTTCAGTCCT 960 

GGTCCCCCTT CTTTCTCCAC TOTACAC3AAG 1020 

GAACATGAGT TTGGGCTGAA AAAAAAAAA 1079 



20 



(2) INFORMATION FOR SEQ ID NO: 214: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3791 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: double 
25 (D) TOPOLOGY: linear 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 214: 

TGAAGCAGGC GCTCTIGGCT CGGCGCGGCC CGCTGCAATC CGTGGAGGAA CGCGCCGCCG 60 

AGCCACCATC ATCCCTGQGC ACTTACAGGA AGGCTTCGGC TGCGTGGTCA CCAACCGATT 120 

CGACCAGTTA TTTGACGACG AATCGGACCC CTTCGAGGTG CTGAAGGCAG CAGAGAACAA 180 

35 GAAAAAAGAA GCCQGCGGQG GCGGCGTTQG QGGCCCTGGG GCCAAGAGCG CATCAGQGCC 240 

GCGGCCCAGA CCAACTCCAA CGCGGCAGGC AAACAGCTGC GCAAGGAGTC CCAGAAAGAC 300 



40 



50 



CGCAAGAACC CGCTGCCCCC CAGCGTTGGC GTQGTTGACA AGAAAGAGGA GACGCAGCCG 360 



420 



CCCGTCGCGC TTTAAGAAAG AAGGAATAAG ACGAGTTGGA AGAftGACCTG ATCAACAACT 

TCAGGGTCAA GGGAAAATAA TTGATAGAAG ACCAGAAAQG CGACCACCTC GTGAACGAAG 480 

45 ATTCGAAAAG CCACTTGAAG AAAAGGGTGA AGGAGGCGAA TTTTCAGTTG ATAGACCGAT 540 

TATTGACCGA CCTATTCGAG GTCGTGGTGG TCTTGGAAGA GGTCGAGGGG GCCGTGGACG 600 

•rcGAATCCGC CGAGGAGATG GATTTGATTC TCGTGGCAAA CGTGAATTTG ATAGGCATAG 660 

TGGAAGTCAT AGATCTTCTT TTTCACATTA CAGTGGCCTG AAGCACGAGG ACAAACGTGG 720 

AGGTAGCGGA TCTCACAACT GGGGAACTGT CAAAGACGAA TTAACTGACT TGGATCAATC 780 

55 AAATOTGACT GAGGAAACAC CTGAAGGTGA AGAACATCAT CCAGTOGCAG ACACTGAAAA 840 

TAAGGAGAAT GAAGTTGAAG AGGTAAAAGA GGAGGGTCCA AAAGAGATGA CTTTGGATGA 900 

GTGGAAGGCT ATTCAAAATA AGGACCGGGC AAAAGTAGAA TTTAATATCC GAAAACCAAA 960 
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TCAAGGTCCT GATGGGCAGT GGAAGAAGGG ATTTGTTCTT CATAAATCAA AGAGTGAAGA 1020 

GGCTCATCCT GAAGATTCGG TTATQGACCA TCATTTCCGG AAGCCAGCAA ATGATATAAC 1080 

CTTCTCAGCTG GAGATCAATT TTGGAGACCT OGGCXXSCCCA QGAjOGTQGCX; GCAGGGGAGG 1140 

ACGAQGTOGA CC3TCC3GCGTG GTGQGCGCCC AAACCGTQGC AGCAGGACCX5 ACAAGTCAAS 1200 

TOCTTCTGCT CCTGATGTGG ATGACCCAGA GGCATTCCCA GCTCTQQCTT AACTGGATGC 1260 

CATAAGACAA CCCTGGTTCC TTTGTGAACC CTTCTGTTCA AAGCTTTTGC ATGCTTAAQG 1320 

ATTCCAAACG ACTAAGAAAT TAAAAAAAAA AAGACTGTCA TTCATACCAT TCACACCTAA 1380 

AGACTGAATT TTATCTGTTT TAAAAATGAA CTTCTCCCGC TACACAGAAG TAACAAATAT 1440 

GGTAGTCAGT TrTGTATTTA GAAATGJTATT GGTAGCAGGG ATCTTTTCAT AATTTTCAGA 1500 

GATTATGCAT TCTTCATGAA TACTTTTGTA TTGCTGCTTG CAAATATGCA TTTCCAAACT 1560 

TGAAATATAG GTGTGAACAG TGrTGTACCAG TTTAAAGCTT TCACTTCATT TCTGTTTTTT 1620 

AATTAAQGAT TTAGAAGTTC CCCCAATTAC AAACTOGTTT TAAATATTGG ACATACTGGT 1680 

TTTAATACCT GCTTTQCATA TTCACACATG GTCAACTGGG ACATGTTAAA CTTTGATTTG 1740 

TCAAATTTTA TGCTGTCTCG AATACTAACT ATATGTATTT TAACTTAGTT TTAATATTTT 1800 

CATTTTTGGG GAAAAATCTT TTTTCACTTC TCATGATAGC TGTTATATAT ATATGCTAAA 1860 

TCTTTATATA CAGAAATATC AGTACTTGAA CAAATTCAAA GCACATTTGG TTTATTAACX: 1920 

CTTGCTCCTT GCATGGCTCA TTAGGTTCAA ATTATAACTG ATTTACATTT TCAGCTATAT 1980 

TTACTTTTTA AATGCTTGAG TTTCCCATTT TAAAATCTAA ACTAGACATC TTAATTGCTG 2040 

AAAGTTOTTT AAACTACTTA TTGTTQGTAG GCACATCGTG TCAAGTGAAG TAGTTTTATA 2100 

GGTATGGGTT TTTTCTCCCC CTTCACCAGG GTGGGTGGAA TAAGTTGATT TGGCCAATGT 2160 

GTAATATTTA AACTGTTCTG TAAAATAAGT GTCTGGCCAT TrGGTATGAT 1TCTGTGTGT 2220 

GAAAGGTCCC AAAATCAAAA TGGTACATCC ATAATCAGCC ACCATTTAAC CCTTCCTTGT 22B0 

TCTAAAACAA AAACCAAAGG GCGCTGGTTG GTAOQGTGAG GTGGGGGAGTr ATTTTAATTT 2340 

TTCGAATTTG GGAAGCAGAC AGCTTTACTT TGTAAGGTTG GAACAGCAGC ACTATACATG 2400 

AAATATAAAC CAAAAACCTT TACTGTTTCT AAATTTCCTA GATTGCTATT ATTTGGTTGT 2460 

AAGTTCAGTA TTCCACAGAA AGTGGTAATT ATCTCTTCTC TCTTCCTCCA TTAGAAAATT 2520 

AGGTAAATAA TGGATTCCTA TAATGGGAGC ATCACCACTT ATTAAAACAC ACATAGAATG 2580 

ATCAATTAAA AAAGTTTTCT AGGATTGTCT TTTATTCTGC CACATTTATT GATAAACAGT 2640 

GAAGGAATTT TTAAAAAATT TTTAAGAATT GTTTGTCACG TCATTTTTAG AAATGTTCTA 2700 

CCTGTATATG GTAATGTCCA GTTTTAAAAA TATTGGACAT CTTCAATCTT AAACATTTCT 2760 
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ATTTAGCrcA TTCCTTCTCA CATATACITC TAAAAGAAAC TTTTAlCTrA TAAGAGfTTAC 2820 
TTTTTCGATA AGATTTATTA ATCTCAGTTA CCTACTATTC TCACATTITA GGAAGC3AGGT 2880 
AATTGTTITr AATGATCGAT AAACTTGTCC TOGTGTITrc GATCTTATGA TCCIGAGCAT 2940 
CTTCTCCACT GGTGCTAATC TCTAATATAA TTTTATATTT ACACACATAC GTOCTACCCA 3000 
GAGATTAATT TAGTCCATAT GAACTATTGA CCCATTGTTC ATTGAGACAG CAACATACGC 
ACTCCTAAAT CAOTCnOTrr AGACTTTTCA AGTATCTAAC TCATTTCCAA ACATGTACCA 
TGnTTATAA ACCTCTTGAT TTCCAGCAAC ATACTATAGA AAACACCTGC TACTCAAAAC 
15 ACAACITCTC AOTGTCATCC ATTGCTGTCX5 TGAGAGACAA CATAGCAATA TCTGGTATOT 
aXSCAAGCTTT CAAGATAGCC TGAACTTAAA AAGTTGGrTGC ATTAGTTGTA TCTGATGGAT 
ATAAATTTQC CTCCTAOTTC ACTTTGTGTC AAGAGCTAAA ACTGTGAACC TAACTTTCTC 

20 

TTATTOCTTOG GTAATAACTC AAAATAAAGA TTTArmCA TQCTCACTTC TTAAAAGTCA 3420 
TAAAAACAAT CAAATAGGRT CATOTTTATr GTCATGTCTT TCCTGGKTTC TCACCTOTGT 3480 
25 GCACACCCCT GTCTGTrTAT AATTTTTAAA TTGAATTrTA TATGOGGrTT TTATTTGCTA 
AAAACCAGGC TOTTCAATCA CATTTCGGAA GGGTACTTAT CTTAATGACT AATGACTTAA 
OTOGGAAAGT TCAATTCTTO TAAAATACAA AATCCAAGGA CTTCTTGGGA TTrAATCTAA 

30 

TTOTCACrrC MTTAGOCAGA TNCACnTTT TGGATAATGG AAAGTTAAGC ATACCX5AATG 
CTACmrGG ITCACAAACG QGCCTAATAG TCCGGGGGGA AATCCCTAAC NGGTAAGGNT 
35 CCCAAGTATG G 



3060 
3120 
3180 
3240 
3300 
3360 



3540 
3600 
3660 

3720 
3780 
3791 



40 (2) INFORMATION FOR SEQ ID NO: 215: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1334 base pairs 
(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS: double 

tD) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215: 

50 CAGTCCTCGC TCCTCCTCGG GGCGCTOCGG CCCCGGGCGT CGCCATGACC AGTGAGCTGG 

ACATCTTCGT GGGGAACAGA CCCTTATCGA CGAGGACGTG TATCGCCTCT GGCTCGATGG 

TTACTCGGTG ACCGACGCGG TGGCCCTOCG QGTGCGCTCG GGAATCCTGG AGCAGACTGG 

CGCCACGGCA GCGGTGCTNC AGAGCGACAC CATGGACCAT TACCGCACCT TCCACATGCT 

CGAGCGGCTG CTCCATCCGC CGCCCAAGCT ACTGCACCAG VTCATCTrCC AGATTCCGCC 

60 CTCCCGGCAG GCACTACTCA TCGAGAGGTA CTATGCCTTT RA1GAGGCCT TTGTTCGGGA 



60 
120 
180 
240 
300 
360 
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GGTGCTGGGC AAGAAGCTGT CCAAAGGCAC 
AACAGGCATC ACCCTCAAGA GCTGCCGGAG 

5 

GGTGGTAGAG GAAATGCGGG GCTCCCTGGT 
TCACCXSQTTG G0CAXX3GACT ATGCAGCCAT 
10 AGGGAAGAAA AAACTGCAGT ATCTGAGCTT 
GATCCAAAAC TCGACCXTTTG GAGCCGTCGA 
AGACAAGGAA TTTCTCCAGG ACTTGAAGGA 

15 

TCTQGACCTG CACAAGAGCC TGGTGTGCAC 
TGAGATGGAA OCCAACTTCA AGAACCTGTC 
20 GACCCACAAT AAAGATGTCA GAGACCTCTT 

CTGCCGCTCC GACCACTGGC CACTCAGCGA 
GTCTGTCCAC TCCCTCGATG GCTTCCGACA 

25 

CTCCGCGGCT GCCTCCTGCG CCTGTATCAT 
GCTGACAATA AAGTTGCTCT GAGTTTQGAG 
30 GGCGTGCAGA TGTGCXrrGTC TCTGTCTCTG 
GTGTNCNGTG GCTCCTTGGG AACTGAGACA 
CATCCACCAG AAGA 

35 



CAAGAAAGAC CTGGATGACA TCAGCACCAA 420 

ACAGTTTGAC AACTTTAAAC QGGTCTTCAA 480 

GGACAATATT CAGCAACACT TCCTCCTCTC 540 

CGTCTTCTTT GCTAACAACC GCTTTGAGAC 600 

CXX3TGACTTT GCXnTCTGCXS CTGAGCTCAT 660 

CTCACAGATG GATGftCATQG ACATGGACTT 720 

GCTCAAGGTG CTAGTGGCTG ACAAGGACCT 780 

TCCTCTCCGG GGAAAGCTGG GCGTCTTCTC 840 

CCX3GGGGCTG GTGAACGTQG CXX30CAAGCT 900 

TCTGGACCTC GTGGAGAAGT TTGTGGAACX: 960 

CGTGCGGTTC TTCCTGAATC AGTATTCAGC 1020 

CCAQGCCTCT GGGACCGCTA CATGGGCACC 1080 

GACTGAGGTG CCTCCCAACG CTCCGCCCAC 1140 

ACTOGTOCTC GCTCCGQGGA GCAAGTGGGG 1200 

AGCACCTGGT GrrCCGTGTAC AAGGATOGAT 1260 

TATCTCAGGG AATGGTGTCT GTGCTCAGCX: 1320 

1334 



40 



50 



(2) INFORMATION FOR SEQ ID NO: 216: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1511 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
45 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216: 

GTCGCGGGGA TGCTGCGAGG GGGTCTCCTG CCCCAGGCGG QCCGGCTGCC TACCCTCCAG 60 

ACTGTCCGCT ATCGCTCCAA GGCTGTTACC CGCCACCGTC GTGTGATGCA CTTTCAGCGG 120 

CAGAAGCTGA TCGCTGTGAC TGAATATATC CCCCCGAAAC CAGCCATCCA CCCATCATGC 180 

55 CTCCCATCTC CrcCCAGCCC CCCACAGGAG GAGATAGGCC TCATCAGGCT TCTCCGCCGG 240 

GAGATAGCAG CAGTTTTCCA GGACAACCGA ATGATAGCCG TCTGCCAGAA TGTGGCTCTG 300 

AGTCCAGAGG ACAAGCTTCT TATGCGACAC CAGCTGCGGA AACACAAGAT CCTGATGAAG 360 



wo 98/39448 



444 



PCT/US98/04493 



RTCTTCCXrCA ACCAGGTCCT GAAGCXTCTTC 
CCCCTTITTG TGGGC5CACAA CATGCTGCTG 
5 GTACGGATCT TAAGGACTGT GCCAaTCCTG 
ATCCTCAGCA GGCAGGGCTT TATCAACTAC 
GGGGAGCTTG TAGGAGGCCT CACCTGCCTC 

10 

CAGCCCCTCC AGCTGACCAC CCTGTrOGAC 
TCTGTCATGT CQGCCAATGG GAAGCCAGAT 
15 TGTTTAGCCA GCCCTQCGCA TAAATACACT 
GACATGTGGA AGAACTTGG6 GTCGGGGAGT 
ATATTGTCAG GTATAGQQCC ACTTGGAGAT 

20 

COGCTTCGTC CTTAGTTTTC CCAACTTGGG 
CAGGTCCAAG GCAGAGATCX! TGAAAAGATA 
25 GCCTCTTGCT GCACGGGCTC CTGAGCCACC 
TAGCTCAACX AAGCAGTTGT GCTGAGAATG 
CTTTGTGCTG AGTGCTGTAC ATGTATTAGT 

30 

TCACAGAGAA GGAOCAGAGA AATTAAGTGG 
CAGAACAGGG ACTTGAACCA AGCCCTCTGC 
35 AGAGCTTCCT CATCAGGTTA CCXAGAAGTG 
TGTTAGTTCT CCACCCTCGA GGTGTACGCT 
AAAATGAAAT A 

40 



CTGGAGGATT CTAAGTACCA AAATCTGCTG 420 

GTCAGTGAAG AGCCXAAGGT CAAGC3AGATG 480 

CCGCTGCTAG GTGCSCTQCAT TGATGACACC 540 

TCCAAGCTCC CCAGCCTGCC CCTGGTGCAG 600 

acagcxx:aga CCCACTCCCT GCTCCAGCAC 660 

CAGTACATCA GAGAGCAAOG CX5AGAAGGAT 720 

CCIX3ACACTG TTCOGGACTC C3TA0CCAGCC 780 

CTGCXyrTATT GGCTGTGCTC TCCTCAATGG 840 

GTGTTTGTCA CTTQGTTTTC ACTAGTAATC 900 

GCAGAQGATT CCATTTCAGA TGTCAGTCAC 960 

ACGTGATAGG AGCAAAGTCT CTCCATTCTC 1020 

GGGCTAITOT CCXXTTGCCTC CTTGGTCACT 1080 

CCCTTGGGGC ACAACCTGCC AGTGCCACAG 1140 

GCACCTGGTG AGAGCCTGCT GTGTGCCAGG 1200 

TCCTTTACTG CTGACCACAT TGTACCCATT 1260 

CTTGCTCAAG GTCATGCAGT TAGTAAGTGG 1320 

TCTGAAGACC C3CGTCCTGAA TTTCTTCACT 1380 

GGTCCXATCC ACCATCCAGG TGTGCTTCGA 1440 

GTGAAAAGTT TC3GGAC3CACT QCTTTATAAT 1500 

1511 



45 



(2) INFORMATI<»I FOR SEQ ID NO: 217: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 642 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
50 (D) TOPOLCXSfY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217: 

AGGCCTTACT TTTCCTCCCA CAAAGGAGTC GCAGCCACGC TAGCTCTGAC TTGCCACTGT 60 

GACAAAGTTC ACGTAGCAGG TCTAGGCAAA GACTGGGCAA TTGAGCAGAG GAGACGGACC 120 

TGTGAGTCTG ACCRYGAGSC GGRCCCCTTC ACCTTGGCTG GGCTGGTCCT GGTCCTTAGG 180 



55 



60 



TTTTGTCAGG TTGTCCTTGT TTQGATCCCT CAACTAGGTG ATAAGCACTG GAGGGGGATG 
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ACCCGCCTTG GACX3TGTTTC TTTAACCTCA 
GAGGTAAAGC AGGATGATGG TGTTTTAAGA 

5 

AATTTTCTCT CCTGGTAGCT GAACAAAGGT 
CGTCAGCCAG AGTTCTGAAG GCCATGCTTT 
10 TTCCTATGAA AOCACAGAOC dTAGGGGGC 
GAGCTGTC5AA CftGCAGGGGG TTGTCTGrTCT 
AATAAACCCT ATTTCTTATT TTATATTTAC 

15 



TCCATATAAT AGGQCXX5TQG GATGGTTGrTA 300 

CCAGAGCTTG GGACCAGGGC TCCTACACXTT 360 

CTAAAITAGC TTAACAAAAG AACAGGCTGC 420 

CAGTTTCCCT TGTTGACAAT TGCTCTCCAG 480 

CTGGCCACAG AACACAACCA TCTTAGGCCT 540 

GTTCTGTTTC TCTGCTTGCC GAACrTTTCTC 600 

GTWQGnGCTG GG 642 



(2) INFORMATION FOR SEQ ID NO: 218: 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1241 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
25 <D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 218: 

GGTCCCACTG TTCCATTTTA TQCTAATAGA TTCCATTCTA GGGCCCAGCC G?rCTCTTGAC 60 

30 

TCATGGTGTT CCCTTTAACC CTTGGCATGT ATAATAGAAT TTTGGTGAAT GAAAGAACCC 120 

AAATAGGCCA GATAGTCCCC CCAGGCCCTG ATATCCATAA AAGGCTTQGG AATGCATTAT 180 

35 GTAATTGTCC TTAGTCTTTT TGTTGTTTTA GAAAAAAAAA ACAAGATGGG CTCAGATGGA 240 

TGCCTACGTA AAAATWrTC CTAGCTGTGT ACTCATAACT TTTCTTTGAA TTGAGTAGTG 300 

AAAG6AAGGA QGAOGAAAGG AAATTAAATG TCCTTCTAGT ATTCTCTGGA CTCAAGTCTG 360 

40 

ACATATCRGA TAATAACCTA TATTGAAATG CCAAGAATTG TATCTGAAAC AAGRGAACAG 420 

TTTCACACAT TTATCATGCC TTCATATTAC ATATTAACTG AAACCAATTA ATAAACATAT 480 

45 GAAATATCCA TTOCACAAGG CAAAGGCACC TAAACCTTTT GTTTCTTTTT CTACATAGCA 540 

GAAATTCATT TTTTTTTTAT TTTTTTAGGG GAACCTATAT AATTATGACC CAGTGATGTC 600 

TTTTOGTGAC TTAAGCTTAT GAATTCAOGT TACAATTGAG TTGATTCTAG ATGGTTACTA 660 

50 

CCTTCAAAAG GATGTTGGTG CCTTATGTGA CACGAGCCAG AGCCTGCTGG GAATAAACAA 720 

AGCAGATTCA TCCCAACACC AACTCGTAGC TTTAGTGGCA GATGGGAGTG GTCACAGACT 780 

55 CCCAAAATGT GGGGCTTTGG ATTTCCACAC CATCCCACGT GTGTGTCATC TTCCTCTTTC 840 

ACACICTTGA TGATAATTTG AAAATGRTGA AATCACCTCT GAATTTGCCT ATAGCATGAG 900 

CACATTCTTA TGACAACATA ACAAATAGTT CATAATGTGA ATATTAGAAA CTGTTACAGC 960 

60 
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10 



CTCCAGTTAC CATAATTTTC CATGTITCTG GAATTGATAT TGAAATAGCA QGGCTAAQGA 1020 

ATTACTCGCA AGTTTTAGCC TGrTGGGTAAT ACCTTAOQGT TATTTAAATA TTrGTAATTT 1080 

TATTTAAATC TTCATGAATG TTTGAAAGGA ACAAAATTAT CAGGGATGGC TCTTTGCCAT 1140 

GGGTCTTATT TTCACCCTCT TTTCTGTAAG AAAAAAGAAC AATGTCTTAA TGTATTITTA 1200 

AAGrrTTTTGG TATAGTTTCT AATTCCAATT TTAATAAAAG T 1241 



15 



(2) INFOPMATIQN FOR SEQ ID NO: 219: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1080 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEI»3ESS: doijble 
20 (D) TOPOLOGY: linear 



25 



35 



45 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 219: 

TCTTTATOTG ACCTAAAACA TACACACATG CACACACACA TACATATCCA TTCATTCATT 60 

CATTCAAGTG GTGTTTCCAG TGTCTGTGTG TCACTGTTTA TGCAGTTTCC ATTTCCCAGT 120 

GAATTATGAG TOGAGGGCAA CTTTTCTAAC CAGATTGTCT TTTCAGAACA AAGACCKGGG 180 

30 RATTGAQGAA GAGTTTQGAA AGAQGGAGAG GCAAGGAAAG AGAGCTTTAA ATTGAAAGGT 240 

TAATTTCCTA AGAGGAACCT GGGCTGAATG ACTACAGTGT TATACCCTCC AATCTTTGCA 300 

GQTGGGCATC GAACACTGCT TGTATCACTC TC3TGCACGGT ATAAATCCAT ATATCCACAA 360 

AAACACACAT CCATCCATCA ACATATACAT GGTTTGGGAT GAGCAGGTCA ATAGTTTTGA 420 

GAGGGAGTTT GTTCCTTTTT TTTICTCATT ATACTCTTAA ATTGTTGTCA GTTATCAAAC 480 

40 AAACAAACAG AAAAATTOTT TGGGAAAAAC CTTGCATACG CCrTTTCTAT CMAGTGCrrT 540 

AAAATATAGA CTAAATACAC ACATCCTGCC AGTTnTTCT TACAC3TGACA GTATCCTTAC 600 

CTC3CCATTTA ATATTAGCCT CGTATTTTTC TCACGTATAT TTACCTGTGA CTTGTATTTG 660 

TTATTTAAAC AGGAAAAAAA ACATTCAAAA AAAGAAAAAT TAACTGTAGC GCTTCATTAT 720 

ACTATTATAT TATTATTATT ATTGTGACAT TTTGGAATAC TGTGAAGTTT TATCTCTTGC 780 
50 ATATACTTTA TACGGAAGTA TTACGCCTTA AAAATACGAA AATAAATTTT ACAAGGTTTC 840 
TGTTTTGTCT GGAAGAGTAA TTGATGTTGC TAAGAATGAT GTTTGTTTTT TTGQGGTnT 900 
TGTTGTTTTT TTTTTAAATG TTACCAGCAC TTTTTTTGTA AGTTTCACTT TCCGAGGTAT 960 

TC?rACAAGTT CACACTGTTT GTGAAGTTTG AATATGAAGG AATAATTAAA AAAAAAAAAA 1020 

AAACCNCGGG GGGGGCCCGG TCCCATTGGN CCCAAGGGGG CGGTTACGGG GTCACGGCCG 1080 
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(2) INFORMATION FOR SEQ ID NO: 220: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1258 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 220: 

TGAATTGAGG GCTTAAAGAT AAACATATGG GRTTGGAGTT GTGTGTCCAT AGGGTTTCAC 60 

15 TGCCTATTTG ATTTGAGTTT ATCCCTATTA ATTTTTTACA GTGAAATTIT ATTAAAGTAT 120 

AATCTACATA TATTTTCAGT GGATTTTGCT CTGAAGGTTC TCCAGTGGTC TGACTACGAG 180 

ATAGTGCGGC TTCAGCTGTG GGATATTGCA GGGCAGGAGC GCTTCACCTC TATGACACGA 240 

TTGTATTATC GGGATGCCTC TGCCTGTC5TT ATTATGTTTG ACGTTACCAA TGCCACTACC 300 

TTCAGCAACA GCCAGAGGTG GAAACAGGAC CTAGACAGCA AGCTCACACT ACCCAATGGA 360 

25 GAGCCGGTOC CCTQCCTGCT CTTQGCCAAC AAGTGTGATC TGTCCCCTTG GGCAGTGAGC 420 

CGGGASCAGA TTGACCGGTT CAGTAAAGAG AACOGTTTCA CAGGTTGGAC AGAAACATCA 480 

GTCAAGGAGA ACAAAAATAT TAATGAGGCT ATGAGAGTCC TCATTGAAAA GATGATGAGA 540 

AATTCCACAG AAGATATCAT GICTTTGTCC ACCCAAGGGG ACTACATCAA TCTACAAACC 600 

AAGTCCTCCA GCTGGTCCTG CTGCTAGTAG TGTTTGGYTT ATTOTCCATC CCAGTTCTGG 660 

35 GAGGTCTTTT AAGTCTCTTC CCTTTGGTTG CCCACCTGAC MATTTTATTA AGTACATTTG 720 

AATTGTCTCC TGACTACTGT CCAGTAAGGA GGCCCATTGT CACTTAGAAA AGACACCTGG 780 

AACCCAKGT6 CATTTCTGCA TCTCCTGGAT TAGCCTTTSA CATGTrGCTG RCTCACATTA 840 

40 

GTCCCAGTTA GTGCCTTCGG TGTAAGATCT TCTCATCAGC CCTCAATTTG TGATCCGGAA 900 

TTTTGTCAGA AGGATKAGAA ATCAGCACCT GCGTTTTAGA GATCATAATT CTCACCTACT 960 

45 TCTGAGCTTA TTTTTCCATT TGATATTCAT TGATATCATG ACTTCCAATT GAGAGGAAAA 1020 

TGAGATCAAA TGTCATTTCC CAAATTTCTT GTAGGCCGTT GTTTCAGATT CTTTCTGTCT 1080 

TOGAATGTAA ACATCTGATT CTGGAATGCA GAAQGAGGGG TCTGGGCATC TGTGGATTTT 1140 

TGGCTACTAG AAGTGTCCCA GAAGTrCACTG TATTTTTGAA ACTTCTAACG TCATAATTAA 1200 

GTTTCTCTTG TCTTGGGCAT CAAGANTAGT TCCAATTTTT TGGGCCGGGG CAGGGTGG 1258 



55 



(2) INFORMATION FOR SEQ ID NO: 221: 
60 (i) SEQUENCE CHARACTERISTICS: 
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<A) LENGTH: 1693 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLCXSY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 221: 
CACAATATAT GAAATAGTAC CCTCTAAAAA AGAGAAAAAA AAAATCAGGC GGTCAAACTT 
AGAGCAACAT 1CTCTTATTA AAGCATAGTT TATTTCACTA GAAAAAATTT AATATCAAGG 
ACTATTACAT ACPTCATTAC TAQGAAGTTC TTTTTAAAAT GACACTTAAA ACAATCACTG 
AAAACITCAT CCACATCACA CCCTGTTTAT TTTCCITAAA CATCTTGCSAA QCCTAAQCTT 
CTCAGAATCA TOTGGCAAGT GTGATGGGCA GTAAAATACC AGAGAAGATG TTTAGTAQCA 
ATTAAAQGCT GnTCCACCT TTAAGGACCA GCTGGGCTGT AGTGATTCCT QGGGCCAGAG 
TOGCATTATC TTTTTACAAA ATAATCACAT ATGTCACATG TTTGCATGTT TGriTCCTTG 
TTCAATTTTT GAACAGCCAG TTGACCAATC ATAGAAAGTA TTACTTTCTT TCATATGGTT 
TTTGCjnCAC TQGCTTAAGA GGTTTCTCAG AATATCTATG GCCACAGCAG CATACCAGTT 
TCCA1CCTAA TAGGAATCAA ATTAATTTTG TATCTACTGA TAACAGAATC TGGGTCACAT 
GAAAAAAAAT CATTTTATCC GTCTTTTAAG TATATGTTTA AAATAATAAT TTATGTGTCT 
GCATAITCCA GAACAGCTCT GAGAGCAACA GTTTCCCATT AACTCTrTCT GACCAATAGT 
GCTGGCACCG TTCCnCCTC TTTGGGAAGA GGAAAGGGTG TGTGAACATG GCTAACAATC 
TTCAAATACC CAAATTCTCA TAGCATAAAT AAAGTATTTA TTTTATGCCT CAGTATATTA 
TTATTTAATT nTTAGGrrAA TGCCTATCTC TTGCTTCTATT AAGGAAAGAA GCAATCAGTA 
GAGAATTCAG GATAGTTTTG TTTAAATTCT TGCAGATTAC ATOmTTAC AGTGGCCTGC 
TAOTCAGGAA AGGTATTCTT CYATACAACT TGTTTTAACC TTTGAGAACA TTGACAGAAA 
TTATGCAATG GTTK?ITCAG ATACGGACTT GATGGTGCTG TTTAATCAGr TTGCTTCCAA 
AffKSQCCTAC TCAAGAGGCC CTAAGACTGG TAGAAATTAA AAGGATTTCA AAAACTITCT 
ATTCCTTTCT TAAACCTACC AGCAAACTAG GATTGTGATA QCAATGAATG GTATGATGAA 
GAAACTTTGA CCAAAnTCT TTTTTTGTTG TTGTTGTTGT TTTGAATTTG AAATCATTCT 
TATTCCCTTT AAGAATOTTT ATGTATGAGT GTGAAGATGC TAGCGAACCT ATGCTCAGAT 
ATTCATCGTA ACTCTCCCTT CACCTGTTAC AGAGTTTCAG ATCGGTCACT GATAGTATGT 
ATTrCTTTAG TAAGAATCTG TTAAAATTAC AATGATCTTT TAAAAAGATG ATGCAGTTCT 
GTATTTATTC TGCTCTGTCT GGTCCTAAGT GGAGCCAATT AAACAAGTTT CATATGTATT 
irrCCAGTGT TGAATCTCAC ACACTGTACT TTGAAAATTT CCTTCCATCC TGAATAACGA 
ATAGAAGAGG CCATATATAT TGCCTCCTTA TCCTTGAGAT TTCACTACCT TTATGTTAAA 
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AGTTGrTGTAT AATTGTTAAA ATCTGTGAAA GAATAAAAAG TGGATTTAA?^ 1TARAAAAAA 
AAAAAAAAAA AAA 
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(2) INFORMATION FOR SEQ ID NO: 222: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 1196 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 222: 
ACGCGTOGGT CGACCCACGC GTCCGCGACN TGGCGTGOTG GGGAAGQGAG AAGGATTTGT 
AAACCCCGGA GCGAGGTICT GCTTACCCGA GGCCGCTCCT GrTGCGGAGflC CCCCGGGTGA 
AQCCACCGTC ATCATGTCTG ACCAQGAGGC AAAACCTTCA ACTGAQGACT TCXXX3GATAA 
GAAGGAAGGT GAATATATTA AACTCAAAGT CATTQGACAG GATAGCAGTG AGATTCACTT 
CAAAGTOAAA ATGACAACAC ATCTCAAGAA ACTCAAAGAA TCATACTGTC AAAGACAGGG 
TGTTCCAATG AATTCACTCA GGTTTCTCTT TGAGQGTCAG AGAAOTOCTG ATAATCATAC 
TCCAAAAGAA CTGGGAATGG AGGAAGAAGA TGTGATTGAA GTTTATCAGG AACAAACGGG 
GGGTCATTCA ACAGTTTAGA TATTCTTTTT ATTTTTTTTC TTTTCCCTCA ATCCmTTT 
ATTTTTAAAA ATAfiTTCTTT TGTAATGTQG TGTTCAAAAC GGAATTGAAA ACTGGCACCC 
CATCTCTTTO AAACATCTGG TAATTTGAAT TCTAGTGCTC ATTATTCATT ATTGTTTGTT 
TrCAITCTGC TGATTTTTQG TGATCAAGCC TCAGTCCCCT TCATATTACC CTCTCCrTTT 
TAAAAATTAC GTGTOCACAG AGAOGTCACC TTTTTCAGQA CATTGCATTT TCAGGCTTGT 
GGTGATAAAT AAGATCGACC AATGCAAGTG TTCATAATGA CTTTCCAATT GGCCCTGATG 
TTCTAGCATG TGATTACTTC ACTCCTGGAC TGTGACTTTC AGTGGGAGAT GGAAGTTTTT 
CAGAGAACTG AACTGTGGAA AAATGACCTT TCCTTAACTT GAAGCTACTT TTAAAATTTG 
AGGGTCTGGA CCAAAAGAAG AGGAATATCA GGTTGAAGTC AAGATGACAG ATAAGGTGAG 
AGTAATCACT AACTCCAAAG ATGGCTTCAC TGAAGAAAAG GCATTTTAAG ATTTTTTAAA 
AATdTGTCA GAAGATCCCA GAAAAGTTCT AATTTTCATT AGCAATTAAT AAAGCTATAC 
ATOCAGAAAT GAATACAACA GAACACTGCT CTTTTTGATT TTATTTGTAC riTTTGGCCT 
GGGATATGGG TTTTAAATGG ACATTGTCTG TACCAGCTTC ATTAAAATAA ACAATA 
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(2) INFORMATION FOR SEQ ID NO: 223: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 1791 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

10 (xi) SEQUENCE nESCRIPTICN: SEQ ID NO: 223: 

TCAGGGAGGT GGCAGGAAAG GCTTCGAACA GCTGCCGGAG TGACGGftGCG GCGC3CCCCGC 60 

CCGGTK3CGC TCGAGGTCGA AGCTTCCAGG TAGCGGCCCG CWSftGCCTGA CCCAGGCTCT 120 

GGACATCCTC AGCCCAAGTC CCCCACACTC AGTGCAGTGA TC3AGTC3CGGA AGTC3AAGGTG 180 

ACAGQGCAGA ACCAGGAGCA ATTPCTGCTC CTAGCCAAGT CGGCCAAC5GG GGCAGCGCTG 240 

20 GCCACACXCA TCCATCAGC3T GCTOGAGGCC CCTGGTGTCT ACGTGTTrGG AGAACTCCTG 300 

GACATGCCCA ATOTTAGAGA GCTGGCTGAG AGTGACTTTG OCTCTACCTT CCGGCTGCTC 360 

ACAcnxymG CTTATOCSGAC ATACGCTCAC TACTTAGCTG AAGCCCGCSAA TCTTCCTCCA 420 

25 

CTAACAGAGG CICAGAAGAA TAAGCTTCGA CACCTCTCAG TTGTCACCCT GGCTGCTAAA 480 

CTAAAGTOTA TCCCATATCC AGICTTOCTG GAGGTCTTGC CCTC3CGTAAT GIGOOGCAGC 540 

30 TOGAAGACCT TCTGATTGAG GCTCTC3TATG CTGACGTQCT TCGTCXSCTCC CTGGACCAGC 600 

GCAACCAGCG GCTCGAGGTT GACTACAGCA TCQGGCGC3GA CATCCAGCGC CAGGACCTCA 660 
OTSCCATTGC CCQAACCCTG CAGGAATC3GT GTGTGGC3CTG TRAGGTCGTG CTC3TCAGGCA 720 

35 

TO3AGGAGCA GOTGAGCCGT GCCAACCAAC ACAAGGAGCA GCAGCTGGGC CTGAAGCAGC 780 
AGATTGAGAG TCAGGTTGCC AACCTTAAAA AAACCATTAA AGTTACGACG GCAGCAGCAG 840 
40 CCGCAGCCAC ATCTCAGGAC CCTGAGCAAC ACCTGACTGA GCTGAGGGAA CCAGCTCCTG 
GCACCAACCA GCGCCASCCA GCAAGAAAGC CTCAAAGGGC AAGGGGCTCC GAGGGAGCGC 

CAA6ATITOG TCCAAGTCGA ATTOAAAGRA CTCTCOTITC CTCCCTGGGG ATGTGGGGTC 1020 

45 

CCAGCTGCCT GCCTCCCTCT TAQGAGTCCT CAGAGAGCCT TCTGTGCCCC TGGCCAGCTG 1080 

ATAATCCTAG CTTCATGACC CITCACCTCC CCTAACCCCA AACATAGATC ACACCTTCIC 1140 
50 TAGGGAGGAG KCAAATOTAG GTCATOTTTT TGTTGGTACT TrCTGTITrT TGTGACTrCA 1200 
TOTCTTCCAT TCCTCCCCOC TCCCAIGCTC TCTCCCTTGT TTCCTTAAGA QCTCAGCATC 1260 
TGTCCCIGTT CATTACATOT CATTCAGTAG GTGGGTAGCC CTGATGGGGG TCGCTCTGTC 1320 
TCGAGCATAA CCCACAGGCG TTITrrCTGC CACCCCATCC CTGCATGCCT GATCCCCAGT 1380 
TCCTATACCC TACCCCTGAC CTATTCAGCA GCCTCTGAAG AGCCATAGGG CCCCCACCTT 1440 
60 TACTCACACC CTCflGAATTC TOGGAGCCAG TCTGCCATGC CAGGAGTCAC TGGACATGTT 1500 
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CATCCTAGAA TCCTGTCACA CTACAGTCAT TTCTTTTCCT CTCTCTGGCX: CTTGQGTCCT 1560 

GGGAATGCTG CTGCTTCAAC COCAGAGCCT AAGAATGGCA GCCGTTTCTT AACATGTTGA 1620 

5 

GAGATGATTC TTTCTTGGCC CTGGCCATCT CGGGAAQCTT GATGGCAATC CTGGAAGQGT 1680 

TTAATCTCCT TTTGTGAGTT TGGTGGGGAA GGGAAGGGTA TATAGA1TGT ATTAAAAAAA 1740 

10 AAAAOSTATA TATGCATATA TCTATATATA ATATGACC3CA GAAATAAATC T 1791 



15 (2) INFORMATION FOR SEQ ID NO: 224: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2517 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224: 

25 ACACTAGTGG ATCCAAAGAA TTCGGCACAG CGGCACAGCA TTGTTGAGCT TTTCTGTGTG 60 

TOTGQGGCCC TCAAGCGAGC TCGACTGGTC CATCCTQGGG TAGCGASGTG GTGTTTGTGA 120 

AAAAGGACGA TQCCATCACC GCATAYAAGA AGTACAACAA CCGGTGTCTG GACGGGCAGC 180 

30 

CGATCAAGTG CAACCTTCAC ATGAATQGGA ATGTTATCAC CTCAGACCAG CCCATCCTGC 240 

TGCGGCTCAG TGACAGCCCA TCAATGAAAA AGGftGAGCGA GCTGCCTCGC AGGGTGAACT 300 

35 CrcCCTCCTC CTCCAACCCC CCTGCCGAAG TGGACCCTGA CACCATCCTG AAGGCACTCT 360 

TCAAGTCCTC AGGGGCCTCT KTGACCACGC AGCCCACAGA WTTCAAAATC AAGCTTTGAG 420 



CAGGGGAGTR AGGCAGCCAG AAGTGGGGGC AGAGGAGGGT GGCTCTGTTT CCCCAAGGCA 480 

40 

AAGCTTATGA CCAATGGGCC ATCGGACTGG AGACCCCTGA TTGTGGGAAG GGTTGCCAGG 540 

GATAAAGAGC TTCCTCACTC GATGGGACCC GCCTTTCTGT GTTGTGTTCT GCCCTGTGCT 600 

45 CTTCTCTCTA CGTTAACGTT TCCTGTAGTA TGTTTCTTCA TCTCATCGCC AAGGTAGGCT 660 

TGTGTTTTTM AGTGTGTGCC TCCCCGAGCC TCAGCCCCAA GCTGATTTCT TATCTGGAAA 720 

TOGTACACTG AATTCTCTCG GTGGCTTTCT TGTQGCCCCA TGGGATGCAG CGTGGGGGCT 780 

50 

GTCTGAAGGA CCCTGCTTTT TCCAGGGGCC GAGGGGCTGC CTTTCCTTTG TGTGTATTAA 840 

GCTTTTCAAA CAATGGAGGG GATGGAGAGC CCTGGTGTCC TGACGGGAGC CAGGTCGGCC 900 

55 TGAGAGCTGT GCCGCTCCTC TGTCTTGTCA GTGGAGGTGC CTGGGTGGGG AGCAGGTCTC 960 

AGGCCTCTTG TCCTCTCCCC AGTGGCTCCA GGCCTCACTA GTGGCAAGGG CAGGATGAGG 1020 

CK3CACCGCT GGGAAGAGTC TATCTAAGCT CTTGGCTTGG AGTCCCGTGT CGTCTCCRCC 1080 
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CAGAGGAAGT TCTCCAGAGT TCACCTTTCC CTTTrCCTTG AGTTGTGCIG AATGCCCCAC 
CCCAGCTCTC TrrCCCTTCT G«n<3TCTTT GCTGGGAGGG GGCTGTGTTC TGAGCCCTCC 
5 CX3GTTCTCAC CTCGCXITCGC ACTTAACCAC ACCCTQGTTT TGTGTAQCXXS CCAGCTCTCT 
TCTOCTTCQG CTTTTGAAAG C3CTCAGCCTC CCATTGTGCA GTCCTTQQGT TTGGAGCTTA 
•PITCAATGGA AGAGC?rCAGT TTGTTCCTQG CTCTCCATTT CTGGCCTCAG TTGTCTACAG 



GACAOTGGTC AGGGATGCCT GGAGGCATAT ATCCAGCTGC CACCAAGGGG CACTCTTrGT 
TCCCACTTAT GTGAGTGACC CCATCCATCC ATGACCAGAG GATTATTTTC CTGCCTTGGC 
15 AGAGGAGGAG GAGTCAAGGG AQCAGGGCAG CTCTACCAGG CAAGGTGTTT CCCCAGCATA 
GGCGCAGACA GTTCGGACGA AACTTCAGAG CCCAGGCAGT CCCTGAATGA CCAGGCCAGT 
GTTGTCACTG AGIGGTCCCC TQCTGGTTGG GAGTCAAGAG AATCCAGGCT GGCAGAGCTG 



GAGCCAGTTG GGGAGCACGG TTCTGGGAGC TCTGCAAAAT CAGTAGCAAG TGCTQGAAAA 
QGCACATOCC GAAGATACTC AAGAGCTCCC AAGATTTQCT TGAGGCTAOC CCAGTCAAAA 
25 AAACCAGAGA CTCATCTTIC CAGQQGTCAG TCTGTCAGGC AGGAAGGACC CAGGATTTGA 
ACCCAGCTTC AGTCrTGCAGG CTCOXSAGGCT GCCCAGGACG GGAAAGTCCA AGGAAGGGQC 
CTCGTCGTGC TCCACTTGCA GTTCTTTAAA GAATGCTGCT TTTTATTCTC CTAACCCTTT 



CAAGIGGGTG CAGACTTCTC GTTAGCAGCT GGAAGACATT CCTCCCACAC TTTTCCCTTC 
CTCGCCCAAG AGAGCATCCA GAAGGCAGTA GGACCTGGTT TTTCAGGTAC TGGGAGCCGG 
35 QGGCTCACTG CTTGCACTGT GCTTAGGGTA GGGATQGTAA ATATCCTCCC TGCATGGCTT 
TATCCTCCCT CTCATCOCAA AGCAGGTATC TTCTGGTTGT CACAGAGTTT CATTGAGTCC 
AGCIGCAGCC ACGTGGCCAT CTGGAGCTQG TGCTATAGGT GACCATCTQG TACATTGAQG 



GGACCTOTTT GOCTCCTCCA CTCTATAAGC AGTCATCTTG GGAGACCXX3G AGGAGAAGGT 
GCTQGGCTAG TCCTGTGTCC TCCTCCACTT CCCATGCCTC TATGTTAOCC ATCTGTGTCT 
45 CCTOIGCAGA AQGAGAGGAA GGGGCATTAA GAGATGAAGG GTGATTATGT ATTACTTATC 
CATTTCTCAA TAAACATTTG TTATTCCTAA AAAAAAAAAA AAAAAACTCG AGGQGGG 
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(2) INF0RMATIC»4 FOR SEQ ID NO: 225: 



(i) SEQUENCE CHARACTERISTICS: 



55 



(A) I^GTH: 2424 base pairs 

(B) TYPE: nucleic acid 

(C) STRANI^DNESS : double 

(D) TOPOLOGY: linear 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225: 
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TTGTANCTAA TCGAGGATTG ATTCTAATGA CAGAGTCTTT CAACACTTTG CACATGATGT 60 

A1CACGAAGC TACAGCTTGC CATGTGACTG GAGATTTAGT AE3AACTTCTG TCAATATTTC 120 

5 

TTrCGGTTTT GAAGTCTACA CX3CCCTTATC TTCAGAGAAA AGATGTGAAA CAAGCATTAA 180 

TCCAGTGGCA GGAGCGAATT GAATTTGCCC ATAAACTGTT AACTCTTCTT AATTCCTATA 240 

10 GTCCTCCAGA ACTTAGAAAT GCCTGTATAG ATGTCCTCAA GGAACTTGTA CTTTTGAGTC 300 

CCCATGATTT TYTTCATACT CTGGTTCCCT TTCTACAACA CAACCATTGT ACTTACCATC 360 

ACAGTAATAT ACCAATGTCT CTTGGACCTT ATTTCCCTTG TCRAGAAAAT ATCAAGCTAA 420 

15 

TAGGAGGGAA AAGCAATATT CX5GCCTCCGC GCCCTGAACT CAATATGTGC CTCTTGCCXZA 480 

CAATGGTGGA AACCAGTAAG GGCAAAGATG ACGTTTATGA TCGTATGCTG CTAGACTACT 540 

20 TCTTTTCTTA TCATCAGTTC ATCCATCTAT TATGCCGAGT TGCAATCAAC TGTGAAAAAT 600 

TTACTGAAAC ATTAGTTAAG CTCSAGTGTCC TAGTTGCCTA TGAAGGTTTG CCACTTCATC 660 

TTGCACTGTT CCCCAAACTT TGGACTGAGC TATGCCAGAC TCAGTCTOCT ATGTCAAAAA 720 

25 

ACTGCATCAA GCTTTTGTGT GAAGATCCTG TTTTCGCAGA ATATATTAAA TGTATCCTAA 780 

TGGATGAAAG AACTTTTTTA AACAACAACA TTGTCTACAC GTTCATGACA CATTTCCTTC 840 

30 TAAAGGTTCA AAGTCAAGTG TTTTCTGAAG CAAACTGTGC CAATTTGATC AGCACTCTTA 900 

TTACAAACTT GATAAGCXIAG TATCAGAACC TACAGTCTGA TTTCTCCAAC CGAGTTGAAA 960 

TTTCCAAAGC AAGTOCTTCT TTAAATGGGG ACCTGAGGGC ACTCGCITTG CTCCTGTCAG 102O 

35 

TACACACTCC CAAACAGTTA AACCCAGCTC TAATTCCAAC TCTGCAAGAG CTTTTAAQCA 1080 

AATGCAGGAC TTGTCTGCAA CAGAGAAACT CACTCCAAGA GCAAGAAGCC AAAGAAAGAA 1140 

40 AAACTAAAGA TGATGAAGGA GCAACTCCCA TTAAAAGGCG GCGTGTTAGC AGTGATGAOG 1200 

AGCACACTGT AGACAGCTGC ATCAGTGACA TGAAAACAGA AACCAGGGAG GTCCTGACCC 1260 

CAACX3AGCAC TTCTGACAAT GAGACCAGAG ACTCCTCAAT TATTGATCCA GGAACTGAGC 1320 

45 

AAGATCTTCC TTCCCCTGAA AATAGTTCTG TTAAAGAATA CCGAATGGAA GTTCCATCTT 1380 

CGTnrCAGA AGACATGTCA AATATCAGGT CACAGCATGC AGAAGAACAG TCCAACAATG 1440 

50 GTAGATATGA CGATTGTAAA GAATTTAAAG ACCTCCACTG TTCCAAGGAT TCTACCCTAG 1500 

COGAGGAAGA ATCTGAGTTC CCTTCTACTT CTATCTCTGC AGTTCTGTCT GACTTAGCTG 1560 

ACTTGAGAAG CTGTGATGGC CAAGCTTTGC CCTCCCAGGA CTCTGAGGTT GCTTTATCTC 1620 

55 

TCAGTTGTGG CCATTCCAGA GGACTCTTTA GTCATATGCA GCAACATGAC ATTTTAGATA 1680 

CCCTOTGTAG GACCATTGAA TCTACAATCC ATGTCGTCAC AAGGGATATC TGGCAAAGGA 1740 

60 AACCAAGCTG CTTCTTGACA TTAQGTGTAG CATGTCTACT TTTAAGTCCC TCACCCCCAA 1800 
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CCCCCATCCT GTTTCTATAA GTTTTCCTTA TTTGTTTTTG TGCTTCAGTT TGTCCAGTGC I860 

TCTCTCCTTG AATGC3CAAGA TAGATTTATA GGCTTAATTC TTC3C?rCAGGC AGAACTCCAG 1920 

^ ATCAAAAAAA CrTGCATCTT CAGTATACTT CCTAAAGGGC AATCAGATAA TQGATATGTT 1980 

rrATCTAATT AAGAGTTCAC TTTAGTOGCT TTCATTTAAT ATGGCTGTCT QGGAAGAACA 2040 

10 GQGITCCCTA GCCCTOTACA ATGTAATTTA AACTTACAGC ATTTTTACTG TGTATGATAT 2100 

GC?K5TCCTCT GTCCX:AGTTT TCTACCTTAT AGAQGCAGAT TGCCTCCGAT CXSCTCTGGTr 2160 

CTTATTATCA AAATTAAGTr TACITGTATA CGGAACAACC ACAAGAAATT TGATTCTGTA 2220 

AAGAATCCTC TTTAGCTOTG GCCTGGCAGT ATATAAATQG TGCrTTATTT AACAGAATAC 2280 

CTGTGGAGGA AATAAAGCAC ACTTGATOTA AAAATAATTG TTTTATTTTT ATTGACATGA 2340 

20 CTCATTGAIT GCTATTCTGT GCACTNAATT AAACTGATTG TGATGACTTA AAAAAAAAAA 2400 
AAAAAAAAAA AAAAAAAAAA AAAA 
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40 



(2) INFORMATION FOR SEQ ID NO: 226: 



50 



2424 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 1080 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 

(D) TOPdjOGY: linear 

35 (xi) SBQUE^3CE DESCRIPTIoa: SEQ ID NO: 226: 

ATATAC3GACG GATAATCTCT TTACATTCTG TrCTTCTCGA TGCACTCACA AGCGGGTAAC 60 

TAGGTCACAA GAAAACAAAG ATCTTATTCA AAAGAGGnCT TACAGCAACC CAACGTCTCA 120 

TCTTCCCATA GTAAAGATCA CQGCGCCTTG AGGTAAGCTA CAGGCAACAC CACTTCCGCG 180 

TTTCrCriGC GCCCIGGrcC AAGATCGCGG ATGAAGCCAC GCGACGTOTT GTGTCTGAGA 240 

45 TCCCGGTCCT GAAGACTAAC GCCGGACCCC GAGATCGTGA GTTGTGOGTG CAGCGACTGA 300 

AGGAQGAATA TCAGTCCCTT ATCCGGTATG TGGAGAACAA CAAGAATGCT GACAACGATT 360 

GC?rrCCGACT GGAGTCCAAC AAGGAAGGAA CTCGGTGGTT TGGAAAATGC TGGTATATCC 420 

ATGACCTCCT GAAATATCAG TTTCACATCG AGTTTGACAT TCCTATCACA TATCCTACTA 480 

CrcCCCCAGA AATTCCAOTT CCTGAGCTCG ATGGAAAGAC AGCAAAGATG TACAGGGGTG 540 

55 GCAAAATATC CCTGACGGAT CATTTCAAAC CTTTGTOGGC CAGGAATGTG CCCAAATTTG 600 

GACTAGCTCA TCTCATGGCT CTOGGGCTOG GTCCATGGCT GGCAGTGGAA ATCCCTGATC 660 

1GATTCAGAA GGGCGTCATC CAACACAAAG AGAAATGCAA CCAATGAAGA ATGAAGCCAC 720 

60 



455 



TGAGGCAGGG CAGAGGGACC TTTGATAGGC TACGATACTA TTTTCCTGTG CATCACACTT 
AACTCATCTA ACTGCTTCCC CGGACACCCT CCACCTCTAG TTGTTACTAA GTAGCTGCAG 
TAGGCATTGC TOGGGAAGAA ACAAACACAC ACCAAACAGT ACTGCTACTT AGTTTCTAAfi 
GCTCCACAGG GAAQGGAAAG ACTQGGCTTT GGACAATCTA GAGGTAATTT ATATCCGCCC 
CCAGGTOGAG CAACATGCGA TTCTGGAGGC ACGGGGGTAA CTGAAAGTGA CTACATATAG 
TCTTTCTGGT TTCTGGAGAT AACXX^^TCAA TAAAAGCTGC TTCCTCTGGG TAAAAAAAAG 

(2) INPC«MATION FOR SEQ ID NO: 227: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1336 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doxible 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 227: 
TTCCATTCAC AATTACTGGG AGGCAGGCAG GGOCAGTTGC ATGCTGGGQG TGGCTGCATG 
GSdGCCASC TCTCCTGGGT TTGAAGGATG CGGTACASCT GCTTCAGCTG AGCAACGATG 
TTATCCTTGA ICTCTGGGGT TGAGATCTGC AGGCGGACAC TGCCACTATC T^AAGGATCGT 
GTGAAATCAC CAGAAAACAT CTCGTAGATC ATCCGAGCCA CTACTGGAAT GACCTGAACC 
AAGATCAGTT TCCTTTCCAA TGCTTTCCCA TCTGGCCATT CTTCCCCAAA GCATAAGTAG 
ATCTCAAACG GTCGCTGCTT CTCTATCTGT CCTTTCTGGT GGGCAATGAG ATCGCTAAGG 
AATOmCCA GACAAAATAG CTTGACCTTC TTTTGTCTCT CAATCAGGTT GGGAGCAACA 
AGfTCATGGGG CACATGGCCC AGACCAGTAC ACCTTGCACT GGCACAGYCT GATQGCATAA 
ATCGCATCAC OGCTGACCTC CAGGATCAGT CCTCTGTCCA TGACGTCCAG CAGCTTGCTA 
OIXSAACAGCT TCTGCTTCTC ATTGGTAATA TGCTCAGGAC CTQQGAATTT GACCTGCTCC 
AOJCIGACGG GACCAAAGAG CTCCTCCTGG TCAGGCATG6 GACCCAGGTC CCCATAGAAG 
AGTCGGCAGC CCTGAGGGTT GCTCACGGTC ATGGTCCTGC CCGTACTCCT TCCCACOGTA 
CIX3AAACTTG ATGTCCAGGT CAGTCATTGG GAGAGAGCTG ATCCACAGTT CTGGAGAGCT 
ATAGAAOGRC TGTATAGGTG CCTGGGGifllAC TTCCATCTCC AGGGGTTCAG TTTTGGGCCA 
CACTOCCTCC GGSCTGCAGT TGCCCACACT GCAATTGCCC ACACTGGCTG GCGCCATGGG 
AGAACCATTG ATGTTCAGGA AGGGGAAGGT GTCCTGGATG GGAACATGGT GCTGCGACTG 
ATCCAGCTCA TCTTCCTCAT CTTCTTCATC CACATCATTA TCCTTCTCAT CCCAGGGAGC 
AGACCCTCTG GATCCTGGGT TAATGATCGA SCCCTGGGGC TGAGGGATGT CACACACTTG 
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ATATATCTTC ACTOCSGTrCA TCGC3CACCTC CCTTGGTGCC ATCCATACAT CCAGGTTGAA 
TTCTCTGCTC TTATTGAGAG CflCAGCGCAG CTGGGCCTTC CATTTAGCTG GGTCAGGGTC 
^ ATCCACCCCT TCCTCCSTACT TCCCTGTCTC TACAQCCCAG CSCCTTAAAAA TOGTATTTTC 
CICTTCTTOT TCAQGGCTAT GCCGGGTGGC ATGTTTCCAG QGAATCTGGA AGCGTTTAGA 
10 GTCCCTGTGT AGCCAG 



30 



40 



50 



15 (2) INFORMATION FOR SEQ ID NO: 228: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2043 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS: double 

(D) TOPOIiOGY: linear 

(xi) SEQUENCE DESCRIPTIOJ: SEQ ID NO: 228: 
25 TCAGCTOGTC CCITCCTTOT GTCCTGGGGG ACCTGCTQGC QGCCTCITCC TGGGAGCCAT 60 
GACCTCAGAC CCCACCCACA CTCCAGATCG AGACCCCTGC CTCCCCCCGG CAAATG?rCCT 120 
CCCGCTGCCT TGCAGCCTGC ACTTTGCACA TGCTCACCCC CAGCACAGTC CCACTOGCCC 180 
CTCAICTCCC CITCCCTCAG CTCCTTCCCA AGGACTCCTG GTCACTGCCT GCTGTGCAKT 240 
CAGAGGCCCA GGGTCCAGCA GCCCGGSGGG AACGGGTGCT GCCTSTTCCT CCAGTTAGCT 300 
35 CCAGYTCAGG TCTGAGACCC GTCYTGAGTA AAGGTCTGAG CAMCGACCGT GCCCTCTGCC 
CAGGGCTCGG TCCTCAGCAG CTOmTTCC TGCAGGAAGG TIGGAGCAAG CAAAGTCCTT 
CTCTCCCCTC AGGGTCAGCT GCCCAGACTG GGGCGGATGC AGAGAGQCAG GTGGGCTGTG 
GCTGGACTOG -TCCXSGAGCTG GCTTCCTTAC CAGAAAAGCC TCAGCCTTCC TCTGGAAGCA 540 
TCCCCCCTTTC TO3GCAAGGG GGAAGGGCTC dTTAAGGGG TGTGCTTTCC CAGTGGGGAG 
45 CASTCTOGCC CTCCCCCCTA CTAAAGCCTC TGCTCTCAGC ACTTTCCCCC AAGTCCTTGT 

AACTTCCTTG AAGCTTOGGTr CTCGCTGCCA GCCAGTCCCT GGACAAACTC TCCTQCCCCT 720 
TTTAAATTTC ACTCATTTTG TATAAACCCA GCAQGCTQGT GTTTACTTAG CCCTGTAGCT 780 
TTTTTCATTT TTTCriTCCG TCTTTCTTCT TGAGTTCACG GTTCAATATT GCCTCCTCGC 
CCTGGTCAGG GGAGGTGCTG CTTTTCTGCC CCACCTGCCG GCTQGTTCCA GCAGCGCTGG 
55 NGCCCAQCTG GGQGGCCGGG ATCGGGGCTT CTCTCTCTGG GAGGGGTGCA GGTGCCCTCC 
CCAGGCTOGG AGGGTTCCTT CCCTAGCTCC CCATCTGCCC CCGCTGGTGA GAGTTGGGCT 
TCTTOGTCTr GGAACTCCCT GGCATTOGGA ACAGAGCATT TCCAGCATTT GTTGTrGTTG 1080 
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TTTTACTCAC CTAACCCTTA GAAAATGAAT GTTAGAAGGT GCCTGCCGAG GCGGGACAGA 1140 

GTOTTroCTC GCGCTGGAGA AGGCTCTGCT CAGCCCTGAG AGTCCCTTCC TGCCCCACCG 1200 

ATACTOGCAC TTTAAAAAQG AAGCTGACCG CACAGTGTCC AGACGAATTG GCXTCXTCAGAA 1260 

GATCGGGAGT TCTOTCCTGC CCTTCTGTGT CTGCXnCACC TCACCCAGCC TAGGAGGGAG 1320 

GTOCATTCAG GGTAGATTTG CCTCTCATTC AAAGTTCTGG QGCTTTGGGY GGAAAACAGC 1380 

CACXOTTGGC GCTOTTGGGG AGACTCCTCC AGACCAGGAA CXCCAGAAGG AGACAGAGCC 1440 

TCCCACATCC TCCCACGCCA GGCCCTGGGC CAGGGTGATT GGACTGAGAA TrTGGCCACA 1500 

ACCAAATTGA TQCTGGCTGG AACCAGAGGC CAGAAAGCCT GGCCTTGTCC CCATGTGGGA 1560 

GCCCTOTCCT CAGCCCTCTT GTCCCXTTTGA GCTCAGTGAA TTCCCACCAG GTGCCCACAG 1620 

CTCCTCGACT TCAAATTCTA TATA1TGAGA GAGTTGGAGA GTATATCAGA GATATTTTTG 1680 

GAAAGGAGTT GGTCTATGCA ATGTCAGTTT GGAATCTTCT TGAAAGTTTA ATGTTTTTAT 1740 

TAGGAGATTT AAAGAAAATA AAQGTCTACA ATATCTTTAG GTTTTTTTTT TTTCCTGTTT 1800 

ACCGCACAAA CTGACCACAT GGCATGTCTA TCAGGATGGA GGGTGTCCAT GTTCTCCTCT 1860 

GTCTTTAGGG AOGTGATAAG GAGATGGSCG RAGGGGrTGTT TTTTTCTTTG ACTCCCCTCC 1920 

TTTCTAACAG AATOTTCCCA CCACTGCTT6 AGTQGGCTGT GrmGTPCCT CTGTCCCAGC 1980 

TTCTGTTOTA GAAAATAACA TTGTTAQOQG AACTCAGGCT AGTGTCAGCG TCTTQGTTTG 2040 

GGG 2043 
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(2) INFORMATION FOR SEQ ID NO: 229: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQJGTH: 540 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 229: 
TAAAAAGAAG CGGGAGAATC TGGGCGTCGC TCTAGAGATC GATGGGCTAG AGGAGAAGCT 
GTCCCAGTGT CGGAGAGACC TGGAGGCCGT GAACTCCAGA CTCCACAGCC GGGAGCTGAG 
CCCAGAGGCC AGGAGGTCCC TGGAGAAGGA GAAAAACAGC CTAATGAACA AAGCCTCCAA 
CTACGAGAAG GAACTGAAGT TTCTTCGGCA AGAGAACCGG AAGAACATGC TGCTCTCTGT 
GGCCATCTTT ATCCTCCTGA CGCTCGTCTA TGCCTACTGG ACCATGTGAG CCTGGCACTT 
CCCCACAACC AGCACAGGCT TCCACTTGGC CCCTTGGTCA GGATCAAGCA GGCACTTCAA 
GCCTCAATAG GACCAAGGTG CTGGGGTGTT CCCCTCCCAA CCTAGTGTTC AAGCATGGCT 
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TCCTGGCGGC CCAGGCCTTG CCTCCCTGGC CTGCTC3GGGG GTTCCGGGTC TCCAGAAGGA 480 
CATGC?K3CTG GTCCCTCCCT TAGCCCAAGG GAGAGGCAWT AAAGACACAA AGCTGGAAAT 540 

(2) INFORMATION FOR SEQ ID NO: 230: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 448 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
15 (D) T0P0UX3Y: linear 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 230: 
AATTGTCAAA TATTAGAATA TTGITACTAT TTGACCCAAC TCAAAATCTC CATGGGAAAA 60 
TACCTCTCGA TACCCACAGT ATTGTTGAAA ATAATCAGAT GCAGTATCAC AGCTGTGTCA 120 
GACTCTAGTA CCAGTTCGGC AATCAAGGCA CAGCTAAAAA TTGAAAACAA AGATCTGGAC 180 
25 AACAAAACAG CCAAAQGTQG GQGTCAAGAA GCTCTGAOGT GTACCTAGCT CTTAGAATGCT 
A'TOCACACGT GCCAGGTOTA GTGTGCATAT CCAGGAAAAA CTGCAGAGAG CCCCAGTCTT 
CAMCTCTCGT TGACCATCAG CTCTGTGTAA GCAGGAAGTG AAGGCTAAGG CAGATTTAAG 360 
CTCTGAAAGC ATTCCACAAC ATACACACAA ATCGTGCAAA GCATTAAGGA AATCTTGTTA 420 
CTGCTAAGTG TTGCTGACCC AGGAACAA 



30 
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300 
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35 

(2) INFORMATION FOR SEQ ID NO: 231: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 407 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTIC»J : SEQ ID NO: 231: 
GTATGCTCCC CCAAACCAAT ATGTGTGGCT GCCTTTWACC TGACTTCTCC AACATGTAGC 60 
50 CCCAAGAGGA GGCCTCTAGA CTRAGGGAGG GGCTGGTGAC CCAGGTGTGG TGGGGCTGCA 120 
TGARACTACC AGAGAGACAG ACATTCTGGA ACTCACCCTG GGGGATCCAG TGGATCTGCC 180 
TATQGTCTGG TCCACCCCAG ACCTGTGAGA TGTTCCTCAT GAGGATGCAC TTGTGCTTCT 240 

55 

GCAAOTATTG CTCCAGCTTC ATAGTGACTC CCACCAGCAC CAGCAATACA GYTAGCTACC 300 
TGTGGCCTTC GATCTCAGCC AQCATCGCTG GGAGAGGGAG CARCTGGGCA TGTACCCTAA 360 
60 ATXXTCTTAC CAGGGAAGGA CTCCCAGAGT GAAGACAAGT AGGGACT 407 
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5 (2) INFORMATION FOR SEQ ID NO: 232: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 830 base pairs 

(B) TYPE: nucleic acid 
10 (C) STRANDEE»J£SS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232: 

15 GTATTTGATT TCAGGCTGCT AAATGGGCTC ATTTAGCATT CATTCCTTGA TGTAfiACATT 60 

AAAAAAAAAA CTGAATAGCA TTCTTTCCAG GNTAACTAAT AAAGCAGACA TGCTAAGCCT 120 

ATAAATACAT CAGCACTGCA GCACACGTTT AAGGTTGCCA CGGACAAGGA TCACACAATA 180 

GAGAACACTG TAGTrCOGTC TGCTCACAAG ACCCAGAACA TTGATCAGTT TTTGTTGTTG 240 

GTTTATTATT TTTCTGTTAA AAAATTGTGA AAAGTTTGTT TTAGCTAGAT GATATTTTAA 300 

25 TAGCTGCGAG TGCTTTOGAA CTATAAAGAT GTCACTACTT AACACACATA CCTTATGTTT 360 

TGTTTTGTTT TGTTTTACAC TCAGTATAAA TCAGGAGAAG TTAGCCAACC ATCTAGCATT 420 

TAGAATCCTC TTTTTTATTG TCTTCTAAGG ATATGGATGT TCCCATAACA GCAACAAAAC 480 

AGCAACAAAA ACATTTCATA AATATCACTT GATAGACTGT AAGCACCTGC TTAACTTTGT 540 

GTNCCAAATA TTTAGTGTGT ATATATATAT ATATATACAC ACACACACAC ATATATATTC 600 

35 AACAAATAAA GCAAAATATA ACATGCATTT CACATTTTGT CTTTCCCTGT TACGATTTTA 660 

ATAGCAGAAC TGTATGACAA GnTAGGTGA TCCTAGCATA TGTTAAATTC AAATTAATGT 720 

AAAACAGATT AACAACAACA AAGAAACTGT CTATTTGAGT GAAGTCATGC TTTCTATTAT 780 

AATAACTTGG CTTOGGTTAT CCATCAAATG CACACTTATA CTGTTATCTG 830 



30 



40 



45 



55 



(2) INFORMATION FOR SEQ ID NO: 233: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 932 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 233: 

CCAGAAGAAA GACCAATCTA GAATATGGAA CTCTAATCAC TTCTAGTATT TCAACTTCCT 60 

AGCAGAAATG AACTTGGCCC TAGACCTAGG GGATAAGCAA TGTTCTTTAT GTAGCCAATG 120 

60 CTACGGAAAC AAAAGAGGTG AAAGAGACCC TTTTTTTATA CTTAATGTAC ATATATTGAC 180 
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-nriTCAGCA AGAAIXSCCAG AAATAGCCTT CATTTCTACC CTGCAAAATA ATCCAGATCT 240 

GCTTTCTAAA ATCRANTCAG TTTCTAAAGT GAAACATGCA ATATTTATGC TCTGACTGAC 300 

^ TCCTCAATTG GARGAGGAAG RACTTCTGTT TACAGAAAAC YGTAITOTTA TATATGTCAG 360 

GCTOTOTATT GTCACTATCA GCATTCTOGT GCAAATGAAC TTTTCTCCAT CATCXSACTGT 420 

10 GGAAAATTGA TACTTTTAAA GCATATTCTT CTATGAGCAC AGGTCCTCCT AGTGAAACTr 480 

AATTIGACAA AGGOKSTCAT ATGCTTTCCT AACCTGfiWrT GTATTAACAT TCACAGftGCC 540 



15 



TACATTTTCT CATTAGQC?rT OTGATGCTCA GTATCTTTCC AAGTGCCAGG CAGRQCTTNC 600 

CTTTTCIGAT CAAACATACC ATTTTTTGTA TTTCACAACT ATAGACAGTC ACTTCTGCAG 660 

TCCCAATTTA AAAATGCAGA ACTOClTrAT OCAAGAATGC TGAAAAATAC TGTTCTATCC 720 

20 AGGTTTCOTA AACTATAAAA GCAGATTTTG CTrTTGlTrG TTAKECATAG GCATGGCCGA 780 

GCATTGTCGA TTAGCC1>5AG GCTTAAAATC ASATGCATGT CTGGTAAGAT GACCACTGTC 840 

TCACTAICAA GAGCCTGCAG AGCCATTTTC CAGACCTGTG ATTGCCCAGA ACACATAGTC 900 
CCCACGTTTC TAATTTGGAG CAAATCTT^ AG 



25 



30 



40 



(2) INFORMATICfN FOR SEQ ID NO: 234: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LEWGTH: 2786 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



932 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 234: 
TTAGCAGQOT GAGCTGTTAA AACAGCACAC ATCTCTCATC CCCTCTTCCT ITATTCCCCC 
CTCGOTPTCA GAAAGGAAGG ATATATGGGG ACCACCTCCC CCTTCTTTGA TCCCAGCATC 
45 TCAOTCCCCC TCCCAACCCT CCATATGGCT CTCAATQGTG CTCACTTGCT TGGAAGCAQG 

CTCCCAATAG GGAGGGGSCT GCCCTCTACA GflCTCTTTGA CTGTAAGACA QGGCTCTGTA 240 
TCAGTGAGAC GATGAGAAAA GTCCCAGGCT AATGGCAGAA ATTTGCACTT TGAACATGTG 300 
TGTTmGTG TTCTGGAACC TGAGATTCCT TATTTATTAA CAGGAAGTCT GATTTTTTTT 360 
TTrTGGAGTC nTCITCCTA TATTTTGTGG GGCTGGGAGA GAGAGA1TAG ATTATTTTGA 420 
55 CATGGGATCC CTTCCATAAC AGGTACTTTC AAGGCAAGAC ATAGGGrTGA AGAAGCACAA 480 
CCAGCCTCTG AAATCATAGC TCTCCAGTGG cnTTAAAGA AAGCTGGTCC TCAGCACTAA 540 
CAAAATCACT ACAATAGCCT AGTGCTTTTT TGGAAGCCTT TTTAGGGAAG AATGTTAGGT 
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TCATGGTAAC TAGTATGCTC TTTGAGATTT TTACAGTGTT GAAACTTAAG AATTTTGAGA 660 

GGGTGAGGAG GGTTCTTCAG AATCTAAATT ACAGATAGAT GATTGTTTCT TGTGAATTTG 720 

5 TTTCTTTTCC TTTTTTTTTG TCCCTACCAT TTCCTTACAT TTCCXnTQGG GCCCATCTCT 780 

GGcrccnTCC rnTm ' iTii: ttqctttgct ttatcagttc attccagctc cctgttagtg 840 

AAGGACACTC CTCTTAGrTGA AQGAACAAAG TCTATCAGTC CTAAAATTTT AAGTCAAAGA 900 

AAACIGCTCT GTTTCCCCTT TAGTAACACT TCTGAAGAGG AAAAACTTCA ATAGCCAAAG 960 

TTAATAATCC TATATAATAA TTGCTTTGGC TTTCACCTAA AATTCTGGC3C ATCACAATTT 1020 

15 CCTTGGGATA GftGGTTCTGT TGGGGAATAG ATTGCTTATT GCTGTTCACT QGAGAGAAAA 1080 

GGTACnCTTT TTCTACAAGG TCATACCGCC AGAAGCXXXIA AATCCTATTT TQGCTCATCT 1140 

TCAGGTAAAG AGTAATTCCT ATCCTGTGTG CCTCAGAAGC TAGAATCGAA GGCTTACCCT 1200 

ATTCAOTCTT TATTGTCAGA AATGCATGAT GGCTCTTGGA AAGAATGACX3 TTTTGCTGGA 1260 

AAAAAAAAAA AGAACACTTIT GTGTTTCACA AACATGGCTT ATCAATTTTT TCAAAGAATT 1320 

25 CmriTCCC AAAAAGAGGA GTAACAAAAT GTCATTTCTG AAAGAGGCTT ACTTTATACC 1380 

AACTAGTCTC AGCATTTCGG ATGCCAGC3GA ACAGAGAGTG AGACACCTAC AATCACCACT 1440 

CICAAATCCG CTATTGTTTC TTTTCAGAGT. GTTGCAGATT TGCCATPrCT CCATAATATG 1500 

GGGATAGAAA AIXSGAATAAA GATAGAAGGG ATGTAGAATA TGCTTTCCTG CCAACATQGT 1560 

TTOGAGTCGA CTTTQGTATA TTGACTAGAT TTGAAAATAC AAGATTGATT AGATGAATCT 1620 

35 ACAAAAAAGT TGTCCTCCTC TCAQGTCCCT TTTACACTTT TTGACTAACT AGCATCTATA 1680 

TTCCACACTT AGCTTTTTTG TCACACTTAT arrTTGTCTC OGTAAATTTC ATTTGCAGTG 1740 

GTTAGTCATC AGATATTTTA GCCACCTACA CAAAAGCAAA CTGCATTTTT AAAAATCTTT 1800 

CTGAGATCGG AGAAAATCTA TTCTCCTTTC CTATACCGCT CTCCXrAACAA AAAAACAACT 1860 

AGTTACyrrCT ACTAATTAGA AACTK3CTGT ACTTTTTCTT TTCTTTTAGG GGTCAAGGAC 1920 

45 CXrrCTTTATA GCTACCATTT GCCTACAATA AATTATTGCA GCAGTTTGCA ATACTAAAAT 1980 

ATrmTATA GACTTTATAT rmXXrm'T GATAAAGGGA TGCTGCATAG TAGAGTTGGT 2040 

OTAATTAAAC TATCTCAGCC GrTTCCCTGC TTTCCCTTCT GCTCCATATG CCTCATTGTC 2100 

CTTCCAQGGA GCTCTTTTAA TCTTAAAGTT CTACATITCA TGCTrCTTAGT CMATTCTGT 2160 

TACCTmTA ATAACTCTTC CCACTGCATA TTTCCATCTT GAATTGGTOG TTCTAAATTC 2220 

55 TGAAACTGTA GTTGAGATAC AGCTATTTAA TATTTCTGGG AGATGTGCAT CCCTCTTCTT 2280 

KGTGGTTGCC CAAGGTTCTT TTCCGTAACT GAGACTCCTT GATATGCTTC AGAGAATTTA 2340 

GGCAAACACT QGCCATGGCC GTGGGAGTAC TGGGAGTAAA ATAAAAATAT CGAGGTATAG 2400 
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ACTAGCATCC ACATAGAGCA CTTCAACCTC CTTICTACCT GTTIOC3QGAA AAAGTATAAT 2460 
GACTOTACTA CCAATCTAAC TAAGATTATT ATAGTCTGGT TOTTTCAAAT ACCATlTrTr 2520 
5 TCTCCnTTG TOITITICCC ACTTTCCAAT CSTACTCAAGA AAATTGAACA AATGTAATC3G 2580 
ATCAAirrAA AATATTTTAT nCTTAAAAG CCTTITITGC CTGTICTAAT GTQCAGGACC 2640 
CTTCTCCrrT CATGGGAGAG ACAGGTAGTT ACCTGAATAT AGGTTGAAAA GGTTATGTAA 
AAAGAAATTA TAATAAAAGG GATACTTTGC OTXTCAAATC TTICTTTTCT CTTATTCTAG 
GTAAGGCATA TTAAAAATAA ATATGT 



10 



15 



(2) INFORMATION FOR SEQ ID NO: 235: 



20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 458 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235: 
GQCTGCAGGA ATTCGGCACG AGAGAATGTT TGATTTTCTT TCCTATTTTA AGGATCTTCT 
30 CTCTTOTrGA TOTTGAAAAC TTACCTTAGT GAAGATGTGT TTCAACATGC TGTTGTCCTT 
TACCTGCATA ATCACAGCTA TGCATCTATT CAAAGTGATG ATCTGTQGGA TAGTTTTAAT 
GAGGTCACAA ACCAAACACT AGATGTAAAG AGAATGATGA AAACCTGGAC CCTQCAGAAA 

GGArrrCCTT tagtgactgt tcaaaagaaa ggaaaggaac tttttataca acaagagaga 
rPCmTTAA atatcaagcc tgaaattcag ccttcagata caaqgtacat gccctctttc 
40 TTiTCATCCc ATCrcrrrTG cactctcagg tggaaatatt TrrAAGTGTr ttataatcat 
aagttcttgt gaaacctaac aagattatcc cttcctaa 



35 



45 



(2) INFORMATION FOR SEQ ID NO: 236: 



2700 
2760 
2786 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 591 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 236: 

AGGATCAAGA GGAAATTATC TCITCGATTC CTCTCCAGGA AATCCTTCTC TATACITTAA 
AAGCTCITCT TCrrrTCTAG GARTCCAATG TGCTGATTGC TGCTAACAGT CAGGGTACAA 

60 



60 
120' 
180 
240 
300 
360 
420 
458 
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TTAAGGTGCT AGAATTGGTA TGAAGGGTTA 
AAATACATCT GCAGCTGACA ATGAGAGARG 
5 CAAAGTCATC ATGGGTTTTG GATrTGTTTT 
TTTATGAGCC TTTGGGACAT TGGGAATACC 
ATGGACATTG CTGCTCTTGG TGGTGTTATC 

10 

T6AATAAAAA TAAATAACWA AACAATAAAA 
TTTTTGTCAA ATGCNGCAAG AGATAACTCT 
15 GTAACAGTGA ATAATTTGTA AAGTTCGTAT 



463 

ACTCAAGTCA AATTGTACTT GATCCTGCTG 180 

AAACAGAAAA TGTCATGTGA TGTCTCTCCC 240 

GAATATTTTT TCTTTTTITC TnCTCCCTCC 300 

CAGCCAACTC TCCACCATCA ATGTAACTCC 360 

TAATTTTTGT GATAGGGAAA CAAATTCTTT 420 

GTTTATPGAG CTACAGTTGA GCTTGGAAAG 480 

TTTTANGAAG TAGCATATGT GAACTATAAT 540 

TTCCCAACCT CTTTGGGAAT T 591 



35 



20 (2) INFORMATION FOR SBQ ID NO: 237: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1286 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 237: 

30 TCTTTTTAAG GTACAGCAGG GAAGAACTGG AAACTCAGAG AAAGAAACTG CCCTTCCATC 60 

TACAAAAGCT GAGTTTACTT CTCCTCCTTC TTTGTTCAAG ACTGGGCTTC CACCGAGCAG 120 

GAGATTACCT GQGGCAATTG ATGTTATCGG TCAGACTATA ACTATCAGCC GAGTAGAAGG 180 

CAGGCGACGG GCAAATGAGA ACAGCAACAT ACAGGTCCTT TCTGAAAGAT CTGCTACTGA 240 

AGTAGACAAC AATTTTAGCA AACCACCTCC GTTTTTCCCT CCAGGAGCTC CTCCCACTCA 300 

40 CCTTCCACCT CCTCCATTTC TTCCACCTCC TCCGACTGTC AGCACTGCTC CACCTCTGAT 360 

TCCACCACCG GGTTTTCCTC CTCCACCAGG OGCTCCACCT CCATCTCTTA TACCAACAAT 420 

AGAAAGTGGA CATTCCTCTG GTTATGATAG TSGTTCTGCA CGTGCATTTC CATATGGCAA 480 

TGCGATGAAG AACGATACAG ATACAGGGAA TATGCAGAAA GAGGTTATGA GCGTCACAGA 540 

GCAAGTCGAG AAAANGAAGA ACGACATAGA GAAAGACGAC ACAGGGAGAA AGAGGAAACC 600 

50 AGACATAAGT CTTCTCGAAG TAATAGTAGA CGTCGCCATG AAAGTGAAGA AGGAGATAGT 660 

CACAGGAGAC ACAAACACAA AAAATCTAAA AGAAGCAAAG AAGGAAAAGA AGCGGGCAGT 720 

GAGCCroCCC CTGAACAGGA GAGCACCGAA GCTACACCTG CAGAATAGGC ATGGTTTTGG 780 

CCTTTTOTGT ATATTAGTAC CAGAAGTAGA TACTATAAAT CTTGTTATTT TTCTGGATAA 840 

TGTTTAAGAA ATTTACCTTA AATCTTGTTC TGTTTGTTAG TATGAAAAGT TAACTTTTTT 900 

60 TCCAAAATAA AAGAGTCAAT TTOTCATGTT AAGTTAAAAA TCTTTGTCTT GTACTATTTC 960 



45 
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AAAAATAAAA AGACAGCAAT GACTTTATAT CCAAGAAAGG AATGTGAATG AGTCACTTAA 1020 

CAGGGAATCT AAAGAGCTGT GTTAGCTGTG TACATACACA GATTATCTGA GAAAAGGTCA 1080 

AGGCTTCCAC TTGGGCCACA GTTmTTGT TAATCAAACA CCACTCTCTT AAGRGGCTGC 1140 

ATCACAAARG GCAACCAARG GGCCCCTCTT ARGGCTTTGA GGATTAAAAC TAGTCTTTAT 1200 

10 CCATTACTCC TGTOGACACT CTTGGCrrRG TATWTTTAGG GGGGNTrCCTT ACCTrTTTTT 1260 

GGTTTTCCNC ACCTmTGG TTGGGC ^286 



15 
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40 



50 



(2) INFORMATION FOR SEQ ID NO: 238: 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 734 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doxible 

(D) TOPOLOGY: linear 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 238: 

ATCGCAGCGC AGAAGGACCA GCAGAAAGAT GCCGAGGCGG AAGQGCTGAG OQGCACGACC 60 
CTGCroCCGA AGCTCATTCC CTCCGGTGCA GGCCGGGAGT GGCTGGAGCG QCGCCGCGCG 120 
ACCAICCGGC CCTO3AGCAC CITCGTGGAC CAGCAGCGCT TCTCACGGCC CCGCAACCTG 180 
GGAGAGCTCT GCCAGCGCCT CGTACGCAAC GTGGAGTACT ACCAGAGCAA CTATGTGTTC 240 
35 GTCTTCCTGG GCCTCATCCT GTACTGTGTG GTGACGTCCC CTATGTTGCT GGTGGCTCTG 
GCTCTCirrT TCGGCGCCIK3 TTAACATTCT CTATCTGCGC ACCTTGGAGT CCAAGCTTGT 
CCTCTITCGC CGAAAGGTGA GCCCAGCGCA TCATATGCTC TQGCTGGAGG CATCTCCTTC 
CCCTTCTTCT GGCTGGCTOG TGCGGGCTCG GCCGTCTTCT GGGTGCTGGG AGCCACCCTG 
GTOGTCATCG GCTCCCACGC TGCCTTCCAC CAGATTGAGG CTGTGGACGG GGAGGAGCTG 540 
45 CAGATCGAAC CCGTOTGAGG TGICrrCTGG GACCTGCCGG CCTCCCGOGC CAGCTGCCCC 600 
ACCCCroCCC ATCCCTOTCC TCCACGGTCT GCIGCTCGGG CCCACAGCGC CGTCCCATCA 660 
CAAGCCCGGG GAGGGATCCC GCCTTTGAAA ATAAAGCTGT TATGGGTGTC ATTCAAAAAA 720 

734 

AAAAAAAAAA AAAA 



300 
360 
420 
480 



55 

(2) INFORMATION FOR SEQ ID NO: 239: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 809 base pairs 
60 (B) TYPE: nucleic acid 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 239: 

5 

CGGGGrrCTTC AGGGTACCGG QCTOGTTACA GCAGCTCTAC CCCTCACGAC GCARACATGG 60 

CAGCGCAGAA GGACCAGCAG AAAGATGCCG AGGCGGAAGG GCTGAGCC3GC ACGACCCTGC 120 

10 'TCCCGAAGCT GATTCCCTCC GGTGCAGGCC GGGAGTGGCT GGAGCGGCGC CGCGCGACCA 180 

TCCGGCCCTC GAGCACCTTC GTGGACCAGC AGCGCTTCTC ACGGCCCCGC AACCTGGGAG 240 

AOCTGTOCCA GCGCCTCGTA CGCAACGTGG AGTACTACCA GAGCAACTAT GTGTTCGTGT 300 

TCCrOSOCCT CATCCTCTAC TGTGTGGTGA CGTCCCCTAT GTTGCTGGTG QCTCTGGCTG 360 

TCTTTTTCGG CGCCTOTTAC ATTCTCTATC TC3CGCACCTT GGAGTCCAAG CTTGTGCTCT 420 

20 TK5GCCGAGA QGTGAGCCCA GCGCATCAGT ATGCTCTC3GC TQGAGGCATC TCCTTCCCCT 480 

TCTTCTOGCT GGCTGGTGCG GGCTCGGCCG TCTTCTGGGT GCTGGGAGCC ACCCTGGTOG 540 

TCATCGGCTC CCACGCTGCC TTCCACCAGA TTGAGGCTGT GGACGGQGAG GAGCTGCAGA 600 

TGGAACCCGT GTCAGGTGTC TTCTGGGACC TGCCGGCCTC CCGGGCCAGC TGCCCCACCC 660 

CIX3CCCATGC CTGTCCTGCA CGGCTCTGCT GCTCGGGCCC ACAGCGCCGT CCCATCACAA 720 

30 GCCCGGGGAG GGATCCCGCC TTTGAAAATA AAGCTGTTAT GGGTGTCATT CAGGAAAAAA 780 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAA 



25 



35 



50 



(2) INFOHMATION FOR SEQ ID NO: 240: 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 2201 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

45 (xi) SEQUENCE DESCRIPTION: SBQ ID NO: 240: 

TOGACCCACG CGTCCGGCAA CATGGCGGCT GCCGTGGTGC AGCGCCCGGG CTGAGCGACA 60 

GCAAGTCCAG CQGQCTCCTA CCCCGGGTGA GGGGTGGCCT CCGCGTGGGA TCGTGCCCTC 120 

TTCAGCCCGC TCCTCTCCCC GACATCACGT GTATTCCGCA CGTCCCCTCC GCGCTGTGTG 180 

TCTACTCAGA CGGQGAGGCG TGACAGGGCC CGGGTCCCTT CTCAGTGGTG CTCTCTGCTT 240 

55 CAGGGCAAGC TCCCCGTCTC CGGGCGCACT TCCCTCGCCT GTGTTCGGTC CATCCTCCTT 300 

TCTCCAGCCT CCTCCCCTCG CAGGCGGATG AMCCGGACGA CGGGCCAGTG CCTGGCACCC 360 

CGGGGTTCCC ARGGTCCAMG GGGAACCCGA AGTCCGAGGA GCCCGARGTC CCGAACCAGG 420 
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ABGGQCTOCA GCXSCATCftMC OGCCTOTCTC CCGQCCXSITC GGCTCTCATA GTC3C3CGGTGC 
TOTOCTACAT CAATCTCCTC AACTACATGG ACCQCTrCAC CXrTOGCTGGC GTCCTICCCG 
ACATCGAGCA GTICTICAAC ATO3C3GGACA GfPAQCICTGG GCTCATCCAG ACCGTOTTCA 
TCTCCAOTTA CATOSrorro GCACCTOTOT TTGGCTACCT GQGTCACaaG TACAATOQGA 
ACTAOCTCAT C?raCGGC5C3GC ATTOCXTPrCT GGTCCCTCCT CaCACTOSGG TCATCCTTCA 



OrcCCACCTC CTCOOTOATC CTOC3GAOCCC CTACCICATT GGCCTGATCT CTGACCGCCT 



480 
540 
600 
660 
720 



TCCCCGGAGA GCATTrCTOG CTCCTCCrcC TCACCCQGC5G CCTOGTGC3QG GfTCOSGGftGG 780 
CCAGTrATTC CACCATCC3CG CCCACTCTCA TTGCCGACCT CrrrGTOSCC GACCAGOGGA 
15 COGGATOCTC AGCAlCrrCT ACTITOCCAT TCCGGTGGGC AGTOGTCTTCG GCTACATTGC 
ftBGCTCCAAA GTOAAGGATA TOQCTGGAGA CTOC3CACTGG GCTCTCAGGG TGACACCGGG 
TCTWSGASre OTOGCOCTTC TGCTOCIGTT CCTOCTA£3TG OGQGRGCCGC CAAGGGGftOC 

90 

COTOGAGCGC CACICAGATT TOCCACCCCT GAACCCCACC TC(?IGC?10GG CAGATCIGAG 
GGCTCTOGCA AGAAATCCTA CTITCGTCCT GTCTTCOCTG GGCTTCACrG CTGTGGCCTT 
25 TOTCACGGGC TCCCIGCSCTC TOKX3GCTCC C3GCATTCCTG CTGCGTTCOC GCGTGC3TCCT 
TCGGGAGACC CCACCCTCCC TICCCQGAGA CTCCTCCTCT TCCTCTGACA GTCICATCTT 
TOGACTCATC ACCTCCCTCA CCQGAGTCCT GGGTOTGCSGC CTCGGTC?K3G AGATCAGCCG 
CCGGCrCCGC CACrcCAACC CCTGGGCIXSA TCCCCTOGTC TCTOCCACTG GCCTCCTOQG 1380 
CTCTGCACCC TTCCICTTCC TOTCCCTTCC CTGCGCCX:GT GGTAOCATCG TCGCCACTTA 

35 TArmWATC ttcattcgag agaccctcct otccatgaac tgggccatcg togccgacat 

TCTOCTOTAC GTCGOXSATCC CTACCCX5ACG CTCCACOGCC GAGGCCTTCC AGATCGrTGCT 



840 

900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



1440 
1500 
1560 
1620 



GCGCCX3GAAC TOGCCCCCCT CCTICrTCTC CGAGTTOCGG GCTCPGCAGT TCTCGCICAT 1680 
GCrCTCCCSCG TITOITOGGG O^CTOQGCGG CGCACTTTCC TOGGCACCGC CATCITCATT 
45 GAGGCXTGACC GCCQGCGC3GC ACAGCTX3CAC GTC3CAGGGCC TCCIXSCACGA AGCAGQGTCC 
ACAGACGACC GGATTCTOCT GCCCCAGCGG OGCCGCTCCA CCCC3CGTG0C CXrTGGCCAGT 
CTSCTCATCr GAGARGCTCC CC3CICACCTA CCTGCACATC TGCCACAGCT GCSCCCIOGGC 
CCACCCCACG AAGGGCCKX; GCCTAACTCC TTO^CCTCGC CXIAGCPICCA GAGGGACTCT 1980 
C3GGCCGTCTG CCAGCTCCCA GACACTACMT GGGTAGCTCA GGGGAGGAGG TGGGGGT€CA 
55 GGAGGGGGAT CCCTCTCCAC AGGGGCAGCC CCAAGGGCIC GGTGCTATrT GTAACGGAAT 
AAAATITCTA GCCAGACCCC AGGTCCCIKSC ^TCGTCITr CTCIGGGTCG CCTCTGATCT 
TCCACCCCGT CITCACCCCA GGGCTCCTGA AGACTGTGGG T 

60 
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(2) INFORMATION FOR SEQ ID NO: 241: 

5 

(1) SEQUEZ^ CHARACTERISTICS: 

<A) LQQGTH: 1661 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
10 (D) TOPOLXX?Y: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241: 

GTCCTTCCCG ACATCGAGCA GTTCTTCAAC ATCGGGGACA GTAGCTCTGG GCTCATCCAG 60 

ACCGTOTTCA TCTCCAGTTA CATGGTGTTG GCACCTGTGT TTGGCTACCT GGGTGACAGG 120 

TACAATCGGA AGTATCTCAT GTGCGGGGGC ATTGCCTTCT GGTCCCTGGT GACACTGGGG 180 

20 TCATSCTTCA TCCCCGGAGA GCATTTCTGG CTGCTCCTCC TGACOCGGGG CCTGGTQGGG 240 

G?rcGQGGAGG CCAGTTATTC CACCATCGCG CCCACTCTCA TTGCCGACCT CTTTOTGGCC 300 

GACCAGCGGA SCGGATQCTC AGCATCTTCT ACTTTGCCAT TaXX5TGGGC''^3TGGTCTQG 360 

GCTACATTGC AGGCTCCAAA GTGAAGGATA TGGCTGGAGA CTGGCACTGG GCTCTGAGGG 420 

TGACACCGGG TCTAGGAGTG GTGGCCGTTC TGCTGCTGTT CCTGGTAGTG CGGGAGCCGC 480 

30 CAAGGGGAGC CGTGGAGCGC CACTCAGATT TGCCACCCCT GAACCCCACC TCGTGGTGGG 540 

CAGATYTCAG GGCTCTOGCA AGAAATCCTA GTTTCGTCCT GTCTTCCCTG GGCTTCACTG 600 

CTCTGGCCTT TOTCACGGGC TCCCTGGCTC TGTGGGCTCC GGCATTCCTG CTGCGTTCCC 660 

GCGTOG1CCT TGGGGAGACC CCACCCTGCC TTCCCGGAGA CTCCTGCTCT TCCTCTGACA 720 

GTCTCATCTT TOGACTCATC ACCTGCCTGA CCGGAGTCCT GGGTGTQGGC CTGQGTGTQG 780 

40 AGATCAGCCG CCGGYTCCGC CACTCCAACC CCCGGGCTGA TCCCCTGGTC TGTGCCACTG 840 

GCCTCCTCGG CTCTGCACCC TTCCTCTTCC TGTCCCITGC CTGOGCCCGT GGTAGCATCG 900 

TOGCCACTTA TATTTTCATC TTCATTGGAG AGACCCTCCT GTCCATGAAC TGGGCCATCG 960 

TGGCCGACAT TCTGCTCTAC GTGGTGATCC CTACCCGACG CTCCACCGCC GAGGCCTTCC 1020 

AGATCGTCCT GTCCCACCTG CTGGGTGATG CTGGGAGCCC CTACCTCATT GGCCTGATCT 1080 

50 CTCACCGCCT GCGCCGGAAC TGGCCCCCCT CCTTCTTGTC CGAGTTCCGG GCTCTGCAGTr 1140 

TCTCGCTCAT GCTCTGCGCG TTTGTTGGGG CACTGGGCGG CGCACTTTCC TGGGCACCGN 1200 

CATCTTCATT GAGGCCGACC GCCGGCGGGC ACAGCTQCAC GTGCAGGGCC TGCTGCACGA 1260 

AGCAGGGTCC ACAGACGACC QGATTGTGGT GCCCCAGCGG GGCCGCTCCA CCCGCGTGCC 1320 

CCyiGGCCAGT GTCCTCATCT GAGAGGCTGC CQCTCACCTA CCTQCACATC TOCCACAGCT 1380 

60 KGCCCTGGGC CCACCCCACG AAGGGCCTGG GCCTAACCCC TTGGCCTGGC CCAGCTTCCA 1440 
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GAGGGACCCT GGGCCOTGTC CCAGCTCCCA GACACTACMT GGGTAGCTCA GGGGAGGAGG 
OGGCSGffrcCA GGAG0C3QGAT CCCTCTCCAC AGGC3GNCACC CCAAGGGCTTC GGTGCTAITr 
C?rAACGGAAT AAAATTICTA C3CCAGACCCC AGGTC3CCTCC TCTCGfPCrm CTCTC3C3C3TGG 
CCrCTCATCT TGCACCCCGT CTTCACCTCA QGGCTCCTGA A 

(2) INFORMATION FOR SBQ ID NO: 242: 



40 



1500 
1560 
1620 
1661 



15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1146 base i>airs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doxible 

(D) TOPOUXSY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242: 
NGACAGAAAA GCAGAAGATC AGACTCTOTT CATTCACTIT TCCTAGC5CCC ATCCTC?IGGT 60 
25 CATOTITCCC CCTCCCATCA TACCTCCTCC TTCCTGGAGC CTCTGCCGGC TTGGCTGTAA 120 
TGGTCGCACT TACCTGGATA TTTCAGTGGG AGGATGAAAG GCGAGACTCA CCCTACGCGG 180 
TGGGACAGAT GGGGAGAGGA AAAAGGCAGA GATNGCCAGG AGAGGGGTGC AGGACAAACC 240 

30 

AGftGACSGITC GCTCAGC3GGA AAAGTGTNGG GAGAAAGTGG GGTGCAOSCC CTGCAGGCOG 300 
GTITAGCCAG CAGCTIGCGGC CTCCCCGGC5C CCTTCGCATC CAACTTCGCA GACAGGGTAC 
35 CAGCCTCCTC GTOKSTATCA TAC3GATTO3T TCACATAGTG TTATCCATGA TCTTCGTAAG 

CTTAftEftAGC CGTOSTOfflG CaCCATCACA TCCAACCCOT ATATATAAAG ATAAATATAT 480 
ATATATATCT ATOHVAATTA TAGCACTOAG C3C3CXCTQCTG CCCTCCTGC5A CCAAGCAAAA 
CTAAGCXTPP TOOITTOGCT MTWrarTTC GmTCTTAT TTOITPGTTr CTGrTOGCTTC 
TCTTAICTCG TCATAGCaCA AGTGCCAGTC CXaVTrGCrCT GTATTACAGA ATAGTOmT 
45 TAATTCATCA ATOTTCTACT TAATGTCTAC CTCAGCACCT CCTCTTAGCC TAATPTTAGG 
AGGTTCCCCA ATnTOTTTC TTCAATITrA CTGC3TOACTr nTTOTflCAA ATCAATCTCT 
TICTCTCTrr CTCTCCTCCC CACCTCTCAC CCTTGCCCTC TCCATCTCCC TCTOXXSCCC 

Tccccrccrc cctcigcxttc cccc?rcrcAT itctgtccac TccATrcTCT cTCcxrrcrcr 

ccTCcrrccT GCTCCCCCCT CCCCAGCCCA CTTSCCOGAG TTGTGCTTCC CGCTCCTTAT 960 
55 CTCTTCTACT TCCMAAGCAG TTTCACTCGA AGTTGTGCAG TCCTGC3TTGC PCCTTVCCCC 

ATCTCCcrrc G-mccyrarA gattcacgcg TrrcrnGTA atticw3tgt ttctcacaag 

ATTTAAAAAA AAAAAAAGGA AAAAAAAAAA AAAAAAAAAC TCC»GGGQG6 GCCCGGTACC 
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(2) INFORMATION FOR SEQ ID NO: 243: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LmSTH: 1350 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDECMESS: double 

(D) TOPOLOGY: linear 



1146 



15 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 243: 

AACCCACGGC TCCTCCQGCA GGGCGTGGAG GGCAGAQGGC CGCGGAGGCG CAGTTGCAAA 60 

CATOGCTCAG AGCAGAGACG GCQGAAACCC GTTCGCOGAG CCCAGCGAGC TTGACAACCC 120 

20 eirrcAGCCA CCACCAGCCT ATGAGCCTCC AGCCOCTCSCC CCATTGCCTC CACCCTCAOC 180 

TCCCTCCTTG CAGCCCTCGA GAAAGCTCAG CCCCACAGAA CCTAAGAACT ATGGCTCATA 240 

CAGCACICAG C3CCICAGCTG CAGCAGCCAC AGCTGAGCTG CTGAAGAAAC AGGAGGAGCT 300 

25 

CAACCGGAAG GCAGAGGAGT TGGACCGAAG GAC24CGAGAG CTGCAGCATG CTGCCCTGGG 360 

RQGCACAGCT ACTCGACAGA ACAATTGGCC CCCTCTACCT TCTTTTTGTC CAGTTCAGCC 420 

30 ciGcrrnTC CAGGACATCT CCATGGAGAT CCCCCAAGAA TTTCAGAAGA CTGTATCCAC 480 

CATOTACTAC CTCTGGATCT GCAGCACGST GGCTCTTCTC CTGAACTTCC TCGOCTGCCT 540 
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780 
840 



GQCCAGCTTC TCTCTGGAAA CCAACAATGG CGCAGGCTTT QGGCTTTCTA TCCTCTGGGT 600 

ccTCcrrrrc actccctgct ccpttgtctg ctggtaccgc cccatgtata aggctttccg 660 

GAOTGACAGT TCATTCAATT TCTTCGTTTT CTTCTTCATT TTCTTCGTCC AGGATGTGCT 720 
40 CTTTCTCCTC CAGGCCATTG GTATCCCAGG TTQGGGATTC AGTGGCTGGA TCTCTGCTCT 
GC?rGC?rGCCG AAGGCAACAC AGCAGTATCC GTGCTCATGC TGCTGGTCGC CCTGCTCTTC 

ACTGGCATTG CTGTGCTAGG AATTGTCATG CTGAAACGGA TCCACTCCTT ATACCGCCGC 900 

ACAGGIGCCA GCTTTCAGAA GGCCCAGCAA GAATTTGCTG CTGGTGTCTT CTCCAACCCT 960 

GCGGTCCGAA CCGCARCTTC CCAATGCAGC CGCTGGGGCT GCTGAAAATG CCTTCCGGGC 1020 

50 CCCGTGACCC CTCACTGGGA TGCCCTGGCC CTGCTACTTG AGQGAGCTGA CTTAGCTCCC 1080 

GTCCCTAAGG TCTCTGGGAC TTGGAGAGAC ATCACTAACT GATQGCTCCT CCGTAGTGCT 1140 

CCCAATCCTA TGGCCATGAC TGCTGAACCT GACAGQCGTG TGGGGAGTTC ACTGTGACCT 1200 

AGTCCCCCCA TCAGGCCACA CTGCTGCCAC CTCTCACACG CCCCAACCCA GCTTCCCTCT 1260 

GCTCTCCCAC GGCTGTTGCT TCGGTTATTT AAATAAAAAG AAAGTGGAAC TGGAAAAAAA 1320 

60 AAAAAAAAAA AAAAAAAAAG GQGGGNCCNC 1350 
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5 (2) INFORMATIC»J FOR SEQ ID NO: 244: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1529 base pairs 

(B) TYPE: nucleic acid 
JQ (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244: 
15 TCCCAGAGC3C CQGQGGGTTC CAGCTCTGCC TGTAGCAGAG CCCTGAGGAG GAGGAGGAAG 
AGGATGTCCT GAAATACGTC CGGGAGATCT TTTTCAGCTA GQGCATAAAC TGTOCACTGA 

ACTCTCTOCC GAQAGCAGCT GGAGGACAGC TGAQCTTCCA CTGGTGCTGC TGOGCCGMCC 180 

GCCTOTOQCSA ATCGGGCTCT CTCTCCTCCT ACCTrTGTGC CTTCTTGGGC CTQGCAGATT 240 

CACCTCAGGC CAGAAGCCCC TGGACACTCC GGOCCTTGQG GTGCCGTTCT GAGTGTGCQG 300 

25 AAGGCAGGAC TCAAAATGAG ATCCCATTTG ACTCCCTCTG TATGTACTGT GCCCTCTCCT 360 

GGCTCITCAG GCTCTGGAGT CCCAATTCTC TGTGTTAGTC AGTGACCAGG TTCCAGGGAA 420 

AATRATGTCA TC?rGGK3GTC CAACTTACTG GAACCAAAGA GACAGTACTT TGCAAAGAAA 480 

AGGATCACTC CCAGGTGCAC TGGAATTCCT ACAGTTTAGT CCGCATGATC TCTCCTGAAG 540 

GAGGAAGCCT GTITCAAAAA TAGTITCCAT CAIXSAGTCTA TCAATGAGCT CCCACCTCTC 600 

35 CAGCCAGCCT AGAAAGCAAA CGAGCTCCCC ACAGTrCTCT GCCCTGTCTC GGAGGTTGAG 660 

GCCACACJKyr ATAGACTOGT AAGCCAGACA GGCCTCCTCC CGCAAGCTGC TACClTGCrr 720 
TCACCTCTAC CTTCOTCCCC GGGCAGCTAG CTATAAAGCA AGAGGGACAG GAGCCCAGAA 
GAGACACTGA GGACAAGAGA TCACACCAGA GTACATG?rCT CTGCCTCTGT TTTCAGTGTG 
GCTTTOGACA GGAATATATC AATAAATCAC TGCCATACAG GTTTTCCAAT ACACAAGTQC 

45 TAGAAAATAC ACACAATTCC CCAATCCGTA AGTTGTGCTA ATGTCTTTCC AAGrTTCTGGG 960 

TTOGGAAGTG GAGGGTQGCA GCGTTTCTTT GTQCGCAACC GTCCAGTCCT GTTCACAGCG 1020 

AGGATITCGA GTCCTCCAGG GTCTCATCAT QGGAGTGATT TGTCAGCGGA CGCCTCTQCC 1080 

OTCTCrGGCT TCAGGTCCAG GGAAGCTTTG AAGCAGTCAA GCCTTCTCTr TGTACCCCAT 1140 

GICTCCTOTC rrrciTCAGT CACTCAGAGA TCACTCCTGG ACCTCTGGGG TTGGAGTTCC 1200 

55 A(?IX3A'rcGCT TATOGCGGCC CACTCACTAT GGTGGGCTGA GTGGAAGCTC CTTAACCATG 1260 

TCCCCAGAGA CACTCAQGTC CTCGCTCITr TAATGTCCTC GTriOTIGCC GTAAGTTCTT 1320 

TX5CTAGGTTr CATTTTQGCA TTOSGCAAAT CAGCCTGGAA GTCTGGCCCC ATGACAGCAA 1380 

60 



780 
840 
900 
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TCACrcCCTC CCCACCCrcC TGAAGCTAGA GGAAGATTTG CTCAGATCCA TTAATTAAAG 1440 
CAGGAATTCG TGTGACAATG AQCTGCATGG TTTAGGGAGT CTTTGGGAGC CTTGGAAGTC 1500 
CTGAAGGACA AACAATCTTG TACTAAGAA 1529 



25 



10 (2) INFORMATION FOR SEQ ID NO: 245: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LaJCra: 1537 base pairs 

(B) TYPE: nucleic acid 
IS (C) STRANDEDNESS: doixble 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 245: 
20 GICCGAGGTC CCCGCCAGCC CCCAGCGGCC TTCCCGGCCC GGGQCGCTCC CAGAGCAAAC 60 

GAGGCCCCTG AGAGCTCCAC CTAGTTCACA GGATAAAATC CCACAGCAGA ACTCGGAGTC 120 

AGCAATOGCT AAGCCCCAQG TQGTTGTAGC TCCTGTATTA ATGTCTAAGC TGTCTGTGAA 180 

TGCCCCTGAA TTTTACCCTT CAGGTTATTC TTCCAGTTAC ACAGAATCCT ATC5AGGATGG 240 

TT<nx3AGGAT TATCCTACTC TATCAGAATA TGTTCAGGAT TTTTTGAATC ATCTTACAGA 300 

30 GCAGCCTGGC AGTTTTGAAA CTGAAATTGA ACAGTTTGCA GAGACCCTGA ATGGTTGTGT 360 

TACAACAGAT GATGCTTTGC AAGAACTTGT GGAACTCATC TATCAACAGG CCACATCTAT 420 

CCCAAATTTC TCTTATATOG GAGCTCGCCT GTGTAATTAC CTGTCCCATC ATCTGACAAT 480 

TAGCCCACAG AGTGGCAACT TCOQCCAATT GCTACTTCAA AGATGTCGGA CTGAATATGA 540 

AGTTAAAGAT CAAGCTCCAA AAQGQGATGA AGTTACTCGA AAACGATTTC ATGCATTTGT 600 

40 ACTCTTTCTG GGAGAACTTT ATCTTAACCT GGAGATCAAG GGAACAAATG GACAGGTTAC 660 

AAGAGCAGAT ATTCTTCAGG TTGGTCTTCG AGAATPGCTG AATGCCCTGT TTTCTAATCC 720 

TATGGATGAC AATPTAATTT GTGCAGTAAA ATTGTTAAAG TTGACAGGAT CAGTTTTGGA 780 

AGATGCTTCG AAGGAAAAAG GAAAGATGGA TATGGAAGAA ATTATTCAGA GAATTGAAAA 840 
CGTTGTCCTA GATGCAAACT GCAGTAGAGA TGTAAAACAG ATGCTCTTGA AGCTTCTAGA 900 
50 ACTCCGGTCA AGTAACTGGG GCAGAGTCCA TGCAACTTCA ACATATAGAG AAGCAACACC 960 

AGAAAATGAT CCTAACTACT TTATGAATGA ACCAACATTT TATACATCTG ATGGTGTTCC 1020 

TTTCACTGCA GCTGATCCAG ATTACCAAGA GAAATACCAA GAATTACTTG AAAGftGAGGA 1080 

CTTTTTTCCA GATTATGAAG AAAATGGAAC AGATTTATCC GGGGCTGGTG ATCCATACTT 1140 

GGATGATATT GATGATGAGA TGGACCCAGA GATAGAAGAA GCTTATGAAA AGTTTTGTTT 1200 

60 GGAATCAGAG CGTAAGCGAA AACAGTAAAG TTAAATTTCA GCATATCAGT TTTATAAAQC 1260 



35 



45 



55 
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ACrrrAGGTA TGGTGATTTA GCAGAACACA AGAGAGCAAG AAAATGTGTC ACATCTATAC 1320 

CAAATTRAGG ATGTTGAGTT A1X3TTACTAA TGTATGCAAC TTTAATTTTG TTTAACACTA 1380 

5 

TCTGCCAAAA TAAACTTTAT TCCCTATAAC TTAAAATGTG TATATATATA TAATAGTTTA 1440 

TTATOTACAG TTAATTCTAC TGTTTTGGCT GCAATAAAAT CX3ATTTTGAA ATAAAWKAAA 1500 

10 AAAAAAAAAA AAGQa«3GCC GCTCTAGAGG ANCCAAG 1537 



15 (2) INFORMATION FOR SEQ ID NO: 246: 

(i) SBC2UENCE CHARACTERISTICS: 

(A) LSNGTH: 506 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEENESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 246: 

25 TCCAQGATTT GGCCAGGACC CSCCGCGGTG GCGGTTGCTA TCQCTTCGCA GAACCTACTC 60 

AGGCAGCCAG CTCAGAAGAG TTGAGGGAAA GTGCTQCTGC TGGGTCTGCA GACGCGATGG 120 

ATAACGTCCA GCCGAAAATA AAACATCGCC CCTTCTGCTT CAGTGTGAAA GGCCACGTGA 180 

AGATGCrcCG GCTGGATA1T ATCAACTCAC TGGTAACAAC AGTATTCATG CTCATCGTAT 240 

CTCTGTIGGC ACTGATACCA GAAACCACAA CATTGACAGT TGGTGGAGGG GTGTTTGCAC 300 

35 TTCTCACAGC AGTATCCTGT CTTGCCGACG GGGCCCTTAT TTACCGGAAG CTTCTGTTCA 360 
ATCCCAGCGG TCCTTACCAG AAAAAGCCTG TGCATGAAAA AAAAGAAGTT TTGTAATTTT 



30 



40 



45 



55 



AAAAAAAAAA AAAAAAAATT TOGTGG 



(2) INFORMATION FOR SEQ ID NO: 247: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1348 base pairs 
50 (B) TYPE: nucleic acid 

{C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



420 



ATATTACTTT TTAGTTTGAT ACTAAGTATT AAACATATTT CTGKATTATT CCAAAAAAAA 480 



506 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247: 

GTCTTTCTTT TNCTCTTTTG AGTTGGTGAG TGAGTGAATA GGGTAACATG GGCCTTCAGG 60 

ATGACCCCTT GGAACTGTGC CGAGTTCCTT AAATCTCAGC TGGGATCCTG GACCTGGGAG 120 

60 GCCCCTGTGA GGGCCAGCTC TGGAAAAACC TGGGAGTTGA TGCCGGAGGY TGGGAAGAAC 180 
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TCTQCTCGAG GGCAGGGTGC CCTGC3AACAC TQGTAGTTCT GGGGCTOGGA GG6AGA0G06 240 

CTCaSQCTTT CTCTGAAATG AACACTGCTC TTCAGCAGTT CAAGTACTTG TTCTCAAAAC 300 

5 

ATTTTCTAAT TGATTGGTAG GTTTTCATAA GCATTGTTTC TTTAAGGCAT GGAAAGQGAA 360 

GAATGCTCAA GCAAGTCATG TTTGTTTTCA GTGGGATGGG CCCGCGTTCT CACTGCTGGG 420 

10 GGCTTCCCCT TGCATGTGGC ACCTTTGTGC AGGGCCACCA GGCAGACTCT TCCCACCTTC 480 

TCCCACTCAA GCACCAAQGG GCTTGAACCG TAATTTQGCT AATCAGAGGC ATTTTTTTTG 540 

TCCTAGTATC TTTCACACTT GTCCAACXX3T CTTATTTTTT TAAAAGTTCT GTrGCTTGTA 600 

TTAACACGAA ACTAGAGAGIA AATAGTTTCT GAAGCCAGTT TATTGTGAAG ATCCCCAAQG 660 

GGAGGTTCGG TAGftGAAAAA TAGTAAGCTG GTTTAGAAAC TGAOSAGGGC AAACAGCX»G 720 

20 GACXXATTGG AGAGGAATTT GCCAAAGATC TACCTTGAGA TAACGCCTCT CCAGTGTCTT 780 

CyWXACGTGA ATAACXZAGCG CTCCAAAGTG TTTTTCTGCT TTGAAAAAAA AAATTCCACA 840 

AGCTTTTAAA GGTGCATTTA AGAATCCATG TGACTTTAGA ATGGAACTGC CGGCCCTGGC 900 

25 

AACTGTCACXS TGTGCTAGAA GGTTCGATGC CTCTQGAATG CATGTGATAC TCATCTCCAT 960 

TTTGTTTCCT TGATTGCATT TTTGTTCTTT TAGCAGATCT GTCCXrTGTGG GTGGTGTCTA 1020 

30 AGAAGTCGGA CACCTTGGTT TTTGTGTTAG ATTGAGCTGG GCAGCTGCAA TCAGCTTCTT 1080 

TATATGCAAA TTAC3GCACGA CCCATCTGTC GTTCCCTGGT TGGTGGCTAA TGAAGTGAGG 1140 

GGAGQGAQGG ATGTCACXXC AAAAGTAGGC CCTCCCATTG GCTTTGGCCA GGCCAGACAC 1200 

35 

TTCACATCGT TTACATGGTT CTGTGTAATT TTAAAGTTTA TGTGTATAAA GCGAAGCTGT 1260 

TTCTCTGAAA CTGTATATTT TGTAAATAAA TATATTGCTA CTTTGAGAWR AAAAAAAAAA 1320 

40 AAAAACTCGA GGGGGGCCCX3 GTACXZCAA 1348 



45 (2) INFORMATION FOR SEQ ID NO: 248: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1766 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDETJNESS: double 

(D) T0P0UX3Y: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 248: 
55 GTGCCGAATC GGCAGAGCGG CACGAGCGGC CACGAGAGCA GGCGGAGTAA AGGGACTTGA 60 
GCGAGCCAGT TGCCGGATTA TTCTATTTCC CCTCCCTCTC TCCCGCCCCG TATCTCTTTT 120 



60 



CACCCTTCTC CCACCCTCGC TCGCGTASCA TGGCGGAGCG TCGGCGGCCA CTCAGTCCCA 



180 



474 



TTCCATCTCC T0GTCX3TCCT TCGGAGCTGA GCCGTCCGCG CCCGGCGGCG GCGGGAGCCC 
AGGAGCCTGC CCCGCCCTGG GGACGAAGAG CTGCAGCTCC TCCTGTGCGG TQCACGATCT 
GATTTTCTGG AGAGATGTGA AGAAGACTGG GTTTGTCTTT QGCACCACGC TGATCATGCT 
GCTTTCCCTG GCAGCTTTCA G?rGTCATCAG TOTGGTTTCT TACCTCATCC TGOCTCTTCT 

ctctgtcacc atcagcttca ggatctacaa gtccgtcatc caagctgtac agaagtcaga 
agaaggccat ccattcaaag cctacctgga cgtagacatt actctgtcct cagaagcttt 
cx:ataattac atgaatgctg ccatggtgca catcaacagg gccctgaaac tcattattcg 
tctctttctg gtagaagatc tggttgactc cttgaagctg gctgtcttca tgtggctgat 
gacctatgtt ggtgctgttt ttaacggaat cacccttcta attcttgctg aactgctcat 
tttcagtctc ccx3attgtct atgagaagta caagacccag attcatcact atgttggcat 
cgcccgagat cagaccaagt caattgttga aaagatccaa gcaaaactcc ctggaatcgc 

CAAAAAAAA6 GCAGAATAAG TACATGGAAA CCAGAAATGC AACAGTTACT AAAACACCAT 
TTAATAGTTA TAACGTCGTT ACTTGTACTA TGAAQGAAAA TACTCAGfTGT CAGCTTGAGC 
CTGCATTCCA AGCTTTTTTT TTAATTTGGT GTTTPCTCCC ATCCTTTCCC TTTAACCCTC 
AGTATCAAGC ACAAAAATTG ATGGACTGAT AAAAGAACTA TCTTAGAACT CAGAAGAAGA 
AAGAATCAAA TTCATAGGAT AAGTCAATAC CTTAATGGTG GTAGAGCCTT TACCTGTAGC 
TTGAAAGGGG AAAGATTGGA GGTAAGAGAG AAAATGAAAG AACACCTCTG GGTCCTTCTG 
TCCAGTTTTC AGCACTAGTC TTACTCAGCT ATCCATTATA GTTTTGCCCT TAAGAAGTCA 
TGATTAACTT ATGAAAAAAT TATTTGGQGA CAGGAGTGTG ATACCTTCCT TQGTTTTTTT 
TTGCAGCCCT CAAATCCTAT CTTCCTGCCC CACAATGTGA GCAGCTACCC CTGATACTCC 
TTTTCTTTAA TGATTTAACT ATCAACTTGA TAAATAACTT ATAQGTGATA GTGATAATTC 
CTGATTCCAA GAATGCXJVTC TGATAAAAAA GAATAGAAAT GGAAAGTGGG ACTGAGAGGG 
AGTCAGCAGG CATGCTGCGG TGQCGGTCAC TCCCTCTGCC ACTATCCCCA GGGAAGGAAA 
RGCTCCGCCA TTrGGGAAAG TGGTTTCTAC GTCACTGGAC ACCQGTTCTG AGCATTAGTT 
TGAGAACTCG TTCCXGAATG TGCTTTCCTC CCTCTCCCCT GCCXACCTCA AGTTTAATAA 
ATAAGG1TGT ACTTTTCTTA CTATAAAATA AAAAAAAAAA AACTCGAGGG GGGCCCGGTA 
CCCAAATCGC CGGATATGAT CGTAAA 



(2) INFORMATION FOR SEQ ID NO: 249: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 2664 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

5 

(xi) SEQUENCE DESCRIPTICW: SEQ ID NO: 249: 

AGTGTCCTCG GAGCAGGCGG AGTAAAGGGA CTTGAGCGAG CCAGTTGCXX3 GATTAITCTA 60 

10 TTTCCCCTCC CTCTCTCCCG CCCXX3TATCT CTTTTCACCC TTCTCCCACC CTCGCTCGCG 120 

TASCA-TCGCG GAGCGTCX5GC GGCCAdCAG TCCCATTCCA TCTCCTCGTC GTCCTTCX3GA 180 

GCCGAGCXX5T CCGCGCCCGG CGGCGGCGGG AGCCCAGGAG CXTPGCCCCGC CCTGGGGACG 240 

15 

AAGAGCTGCA GCTCCTCCTG TGCGGTQCAC GATCTGATTT TCTGGAGAGA TGTGAAGAAG 300 

ACTGGGTTTG TCTTTGGCAC CACGCTGATC ATGCTGCTTT CXXrTGGCAGC TTTCAGTGTC 360 

20 AICAGTGTGG TTTCTTACCT CATCXTTGGCT CTTCTCTCTG TCACCATCAG CTTCAGGATC 420 

TACAAGTCCG TCATCCAAGC TGTACAGAAG TCAGAAGAAG GCCATCCATT CAAAGCCTAC 480 

CTGGACGTAG ACATTACTCT GTCCTCAGAA GCTTTCCATA ATTACATGAA TGCTGCCATG 540 

25 

GTGCACATCA ACAGGGCCCT GAAACTCATT ATTCGTCTCT TTCTGGTAGA AGATCTGGTT 600 

GACTCCTTGA AGCTGGCTGT CTTCATGTGG CTGATGACCT ATGTTGGTGC TGTTTTTAAC 660 

30 GGAATCACCC TTCTAATTCT TGCTGAACTG CTCATTTTCA GTGTCCCGAT TGTCTATGAG 720 

AAGTACAAGA CCCAGATTCA TCACTATGTT QGCATCGCCC GAGATCAGAC CAAGTCAATT 780 

GTTGAAAAGA TCCAAGCAAA ACTCXCTGGA ATCXXX^^AAA AAAAGGCAGA ATAAGTACAT 840 

35 

GGAAACCAGA AATGCAACAG TTACTAAAAC ACCATTTAAT AGTTATAACG TCGTTACTTG 900 

TACTATGAAG GAAAATACTC AGTGTCAGCT TGAGCCTGCA TTCCAAGCTT TTTTTTTAAT 960 

40 TTGGTGTTTT CTCCCATCCT TTCCXmTAA CCCTCAGTAT CAAGCACAAA AATTGATGGA 1020 

CTGATAAAAG AACTATCTTA GAACTCAGAA GAAGAAAGAA TCAAATTCAT AGGATAAGTC 1080 

AATACCTTAA TGGTGGTAGA GCCTTTACCT GTAGCTTGAA AGGQGAAAGA TTGGAGGTAA 1140 

45 

GAGAGAAAAT GAAAGAACAC CTCTGGGTCC TTCTGTCCAG TTTTCAGCAC TAGTCTTACT 1200 

CAGCTATCCA TTATAG1TTT GCXTCTTAAGA AGTCATGATT AACTTATGAA AAAATTATTT 1260 

50 GGGGACAGGA GTGTGATACC TTCCTTGGTT TTTTTTTGCA GCCCTCAAAT CCTATCTTCC 1320 

TGCCCCACAA TGTGAGCAGC TACCCCTGAT ACTCCTTTTC TTTAATGATT TAACTATCAA 1380 

CTTGATAAAT AACTTATAGG TGATAGTGAT AATTCCTGAT TCCAAGAATG CXIATCTGATA 1440 

55 

AAAAAGAATA GAAATGGAAA GTGGGACTGA GAGQGAGTCA GCAGGCATQC TGCGGTGGCG 1500 

GTCACTCCCT CTGCXZACTAT CCCCAGGGAA GGAAARGCTC CGCCATTTGG GAAAGTGGTT 1560 

60 TCTACGTCAC TGGACACCGG TTCTCAGCAT TAGTTTGAGA ACTCX5TTCCC GAATGTGCTT 1620 
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TCCrcCCTCT CCCCTGCCCA CCTCAAGTTT AATAAATAAG GTTGTACTTT TCTTACTATA 1680 

AAATAAATGT CTGTAACTGC TGTGCACTGC TGTAAACTTG TTAGAGAAAA AAATAACCTG 1740 

5 

CATGTQGGCT CCTCAGTTAT TGAGTTTTTG TCATCCTATC TCAGTCTC3GG GGGGAACATT 1800 

CTCAAGAGGT GAAATACAGA AftGCCTTTTT TTCTPGATCT TTTCCCGAGA TTCAAATCTC 1860 

10 CGATTCCCAT TTGGGGGCAA GTTTTTTTCT TCACCTTCAA TATGAGAATT CAQOGAACTT 1920 

GAAAGAAAAA TCATCTGTGA GrTCCTTCAG GTTCTCACTC ATACTCATGA TCCTTCAGAG 1980 

GGAATATX3CA CTGGCGAGTT TAAAGTAAGG GCTATGATAT TTGATQGTCC CAAAGTACGG 2040 

15 

CAGCTGCAAA AAOTAGTrGGA AGGAAATTGT CTACGrTGTCT TGGAAAAATT AGTTAGGAAT 2100 

TTGGATQGGT AAAAGGTACC CTTGCCTTAC TCCATCTTAT TTTCTTAGCC CCCTTTGAGT 2160 

20 GTTTTAACTG GTTTCATG'TC CTAGTAOGAA OIXXOTTCTC CATCCTCATC CTCTGCCCTC 2220 

CCAGGAAGTC AGTGAWGTC TTTTTGGGCT TCXTCCTCCAA AGGACCTTCT GCAGTGGAAG 2280 

TGCCACATCC AGTTCTTTTC TTTTGTTGCT GCTGTGTTTA GATAATTGAA GAGATCTTTG 2340 

25 

TGCCACACAG GATTTTTTTT TTTTTTAAGA AAAACCTATA GATGAAAAAT TACTAATGAA 2400 

ACTCTGTGTA CGTCTCTGTG CGTGCAACAT AAAAATACAG TAGCACCTAA GGAGCTIGAA 2460 

30 TCTTGGTTCC TGTAAAATTT CAAATTGATG TQOTATTAAT AAAAAAAAAA AAAACAMAAA 2520 

AAAAAAAAAA AAAAGGQCGG CCGCTCTAGA GGATCCAAGC TTACGTACGC GTGCATGCGA 2580 
CGTCCATAQC TCTTTCTATA GGQGTCCCCC AAATTCCATT CANCGGGCOG TOSGTTTTAN . .2640 

35 

AAAGGTCGTG AMTGGGGGAA ANCX 2664 



40 

(2) INFORMATIOJ FOR SEQ ID NO: 250: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 865 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 250: 

50 

CGTGGGAGTG AGGTACCAGA TTCAGCCCAT TTGGCCCCGA CGCCTCTKTT CTCGGAATCC 60 
GQGTGCTGCG GATTGAGGTC CCGGTTCCTA ACGGTGGGAT CGGTGTCCTC GGGATGAGAT 120 
55 TTGGCGTTTC CTCQGGGCTT TGGTGGGATC GGTGTCCTCA GGATGAGATT TAGGGTTTCC 180 
TCGGGGCTTT CGGGATCTTC ACCTAATATC CGGACTGCAA GATGGAGGAA GGCGGGAACC 240 
TAGGAGGCCT GATTAARATG GTCCATCTAC TGGTCTTGTC AGGTGCCTGG GGCATGCAAA 300 

60 
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TGTGGGTGAC CTTCGTCTCA GGCTTCCTGC TTTTCCGAAG CCTTCCCCGA CATACCTTCG 360 

GACTAGTGCA GAGCAAACTC TTCCCCTTCT ACTTCCACAT CTCCATGGGC TG1X3CCTTCA 420 

5 TCAACCTCTG CATCTTGGCT TCACAGCATG CTTQGGCTCA GCICACATTC TGGGAGGCCA 480 

GCCAGCTTTA CXrPGCTGTTC CTGAGCCTTA CGCTGGCCAC TGTCAACGCC CGCTGGCTGG 540 

AACCCXX3CAC CACAGCTGCC ATGTGGGCCr TGCAAACCGT GGAGAAGGAG CXSAGGCX^TGG 600 

10 

GTCGQGAGGT ACCAGGCAGC CftCCAGGGTC CCGATCCCTA CCGCXMCTG CGAGAGAAGG 660 

ACXXCAAGTA CAGTOCTCTC CGCCAGAATT TCTTCCGCTA CXMGGGCTG TCCTCTCTIT 720 

15 GCAATCTQGG CTGCGTCCTG AGCAATGGGC TCTGTCTCGC TOGCXTTTGCC CTGGAAATAA 780 

GGAGCCTCTA GCATGGGCCC TGCATGCTAA TAAATGCTTC TTCAGAAAAA AAAAAAAAAA 840 

AAACTCGAGG GGGGCCCGGT ACCCA 865 

20 



(2) INFORMATION FOR SEQ ID NO: 251: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2082 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
30 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 251: 

TCQGGGGGGN AATGGGTGTC TGGCTCANGG ATTGCCNAAT CTGGAAATTC TCCATAACTT 60 

35 

GCTAGCTTGT Tm ' lTmT TTTTTTTACA CCCCCCCGCC CCACCCCCGG ACTTGCACAA 120 

TGTTCAATGA TCTCAGCAGA GTTCTTCATG TGAAACGTTG ATCACCTTTG AAGCCTGCAT 180 

40 CATTCACATA TTTTTTCTTC TTCTTCCCCT TCAGTTCATG AACTOGTGTT CATTTTCTGT 240 

GTGTGTGrrGT GTTTTATTTT GTTTGGATTT TTTTTTTTAA TTTTACTTTT AGAGCTTGCT 300 

GTGTTGCCCA CCTTTTTTCC AACCTCCACC CTCACTCCTT CTCAACCCAT CTCTTCCGAG 360 

45 

ATGAAAGAAA AAAAAAAGCA AAGTTTTTTT TTCTTCTCCT GAGTTCTTCA TGTGAGATTG 420 

AGCTTGCAAA GGAAAAAAAA ATGTGAAATG TTATAGACTT GCAGCGTGCC GAGTTCCATC 480 

50 GGGTTTTTTT TTTAGCATTG TTATGCTAAA ATAGAGAAAA AAATGCTCAT GAACCTTCCA 540 

CAATCAAGCC TGCATCAACC TTCTGGGTGT GACTTGTGAG TTTTGGCCTT GTGATGCCAA 600 

ATCTGAGAGT TTAGTCTGCC ATTAAAAAAA CTCATTCTCA TCTCATGCAT TATTATGCTT 660 

55 

GCTACTTTGT CTTAGCAACA ATGAACTATA ACTGTTTCAA AGACTTTATG GAAAAGAGAC 720 

ATTATATTAA TAAAAAAAAA AAGCCTGCAT GCTGGACATG TATGGTATAA TTATTriTTC 780 

60 C T TT Ti TTTT CCTTTTGGCT TGGAAATGGA CGTTCGAAGA CTTATAGCAT GGCATTCATA 840 
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15 



25 



35 



45 



CTTTTCTTTT ATTGCCTCAT GACTTTTTTG AGTTTAGAAC AAAACAGTGC AACCGTAGAG 900 

CCTTCTTCCC A1GAAATTTT GCATCTGCTC CAAAACTGCT TTGAGTTACT CAGAACTTCA 960 

ACCTCCCAAT GCACTGAAGG CATTCCTTGT QCAAAGATAC CAGAATGGGT TACACATTTA 1020 
ACCTOGCAAA CATTGAAGAA CICTTRATGT TrTCTTTTTA ATAAGAATGA CXSCCCCACTT 



(2) INFORMATION FOR SEQ ID NO: 252: 



1080 



10 TCGGGACTAA AATTCTOCTA TTQCXX3WSAA GCAGTCTAAA ATTTATnTT TAAAAAGAGA 1140 



AACTOCCCCA TTATTTTTGG TTTGrTTTTAT TTTTATTTTA TATTTTTTGG CTTTTGCSTCA 1200 

TTCTCAAATG TCGAA'TOCTC TGGGTTTCTA GTATATAATT TAAITCTAGT TTTTATAATC 1260 

TOTTAGCCCA GTTAAAATGT ATC3CTACAGA TAAAC3GAATG TTATAGATAA ATTTGAAAGA 1320 

GTTAGGTCTG TTTAGCTCTA GATTTTTTAA ACGATTGATG CACTAAATTG TTTACTATTG 1380 

20 TGATGTTAAG GGGGGTAGAG TTTGCAAGGG GACTGITTAA AAAAAGTAGC TTATACAGCA 1440 

TGTGCTTCCA ACTTAAATAT AAGTTGGGTA TGTGTAGTCT TTGCTATACC ACTGACTGTA 1500 

TTGAAAACCA AAGTATTAAG AGGGGAAACG CCCCTGTTTA TATCTGTAGG QGTATTTTAC 1560 

ATTCAAAAAT CTATOTTTTT TTTTCTTTTC AAAATTAAAG TATTTGGGAC TGAATTGCAC 1620 

TAAGATATAA CCTGCAAGCA TATAATACAA AAAAAAATTG CAAAACTGTT TAGAACGCTA 1680 

30 ATAAAATTTA TCCAGTTATA AAAATQGCAT TACTGCACAG TTTTAAGATG ATGCAGATTT 1740 

TTTTACAGTT GTATTGTGGT GCAGAACTGG ATTTTCTGTA ACTTAAAAAA AAATCCACAG 1800 

TTTTAAAGGC AATAATCAGT AAATGTTATT TTCAGGGACT GACATCCTGT CTTTAAAAAG I860 

AAATCAAAAG TAAATCTTAC CACAATAAAT ATAAAAAAAT CTTOTCAGTT ACTTTTCTTT 1920 

TACATATTTT GCTGTCCAAA ATTGTTTTAT ATCPTGAGTT ACTAACTAAC CACGCGTGTT 1980 

40 G1TCCTATGT GCTTTTCTTT CATTTTCAAT TCTQGTTATA TCAAGAAAAG AATAATCTAC 2040 

AATAATAAAC GGCATTTTTT TTTGAAAAAA AAAAAAAAAA AA 2082 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENCTTH: 1482 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

55 (xi) SEQUENCE DESCRIPTIWJ : SEQ ID NO: 252: 

CAGGCAGGCT GGCCCCGGGG ACTTCTCTCT GGCCCTGCTC CCTCCGAGCG CTCCGCCGTT 60 
GCCCGCCTOG CCCCTACGGA GTCCTTAGCC AGGATGGAGG CTGTTGTGAA CTTGTACCAA 120 

60 
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20 



GAGGTCATGA AGCACGCAGA TCCCCQGATC CAGGGCTACC CTCTGATGGG GTCCCCCTTG 180 

CTAATCACCT CCATTCTCCT GACCTAOGTG TACTTCGTTC TCTCACTTGG GCCTCGCATC 240 

ATGGCTAATC QGAAGCCCTT CXy^GCTCCGT GGCTTCATGA TTGTCTACAA CTTCTCACTG 300 

GTOGCACTCT CCCTCTACAT TGTCTATGAG TTCCTGATGT OGGQCTQGCT GAGCACCTAT 360 

ACCTGGCXXrr GTGACCCTGT QGACTATTCC AACAGCXCTG AGGCACTTAG GATGGTTCOG 420 

GXGGCCTGGC TCTTCCTCTT CTCCAAGTTC ATTGAGCTGA TGGACACAGT GATCTTTATT 480 

CTCCX3AAAGA AAGACGGGCA GGTGACCTTC CTACATGTCT TCCATCACTC TOTGCTTCCX: 540 

15 TGGAGCTGGT GGTGGGGGGT AAAGATTGCC CCX3GGAGGAA TGGQCTCTIT CCATGCCATG 600 

ATAAACTCTT CCGTGCATGT CATAATGTAC CTGTACTACG GATTATCTQC CTTTGGCCCT 660 

GTCGCACAAC CCTACCTTTG GTGGAAAAAG CACATGACAG CCATTCAGCT GATCCAGTIT 720 

GTCCTGGTCT CACTGCACAT CTCCCAGTAC TACTTTATGT CCAGCTGTAA CTACCAGTAC 780 

CCAGTCATTA TTCACCTCAT CTGGATGTAT GGCACCATCT TCTTCATGCT GTTCTCCAAC 840 

25 TTCTQGTATC ACTCTTATAC CAAGGGCAAG CGGCTGCCCC GrTGCACTTCA GCAAAATGGA 900 

GCTCCAGGTA TTGCCAAGGT CAAGGCCAAC TGAGAAGCAT GGCCTAGATA GGCGCCCACC 960 

TAAGTGCCTC AGGACTGCAC CTTAGGGCAG TGTCCGTCAG TGCTCTCTCC ACCTACACCT 1020 

GTCACCAAGG CrTATGTGGT CAGGACTGAG CAGGQGACTG GCCCTOCCCT CCCCACAGCT 1080 

QCTCTACAGG GACCACGGCT TTGGTTCCTC ACCCACTTCC CCCGGGCAGC TCCAGGGATG 1140 

35 TCGCCTCATT GCTOTCTGCC ACTCCAGAGC TGGGOGCTAA AAGGGCTCTA CAGTTATTTC 1200 

CCCCTCCCTG CCTTAAAACT TGGGAGAGGA GCACTCAGGG CTGGCCCCAC AAAGGGTCTC 1260 

GTGGCCTTTT TCCTCACACA GAAGAGGTCA GCAATAATGT CACTGTGGAC CCAGTCTCAC 1320 

TCCTCCACCC CACACACTGA AGCAGTAGCT TCTGGGCCAA AGGTCAGGGT GGGCGGGGGC 1380 

CTGGGAATAC AGCCTGTCGA GGCTGCTTAC TCAACTTGTG TCTTAATTAA AAGTGACAGA 1440 

45 QGAAACCAAA AAAAAAAAAA AAAAACTCGA GGQQQGCCCG TA 1482 



30 



40 



50 (2) INFORMATION FOR SEQ ID NO: 253: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 834 base pairs 

(B) TYPE: nucleic acid 
55 (C) STRANDEE»JESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTIC»3 : SEQ ID NO: 253: 



60 GGCACGAGCG CCGTTGCCCG CCTGGCCCCT ACGGAGTCCT TAGCCAGGAT GGAGGCTGTT 



60 



480 



GTGAACTTGT ACCAAGAGGT GATGAAGCAC GCAGATCCCC GGATCCAGGG CTACCCTCTG 
ATGGGGTCCC CCTTGCTAAT GACCTCCATT CTCCTGACCT ACGTGTACTT CGTTCTCTCA 
CTTGGGCCTC GCATCATGGC TAATCGGAAG CXTCTTCCAGC TCCGTGGCTT CATGATTGTC 
TACAACTTCT CACTGGTGGC ACTCTCCCTC TACATTGTCT ATGAGTTCCT GATGTCGGGC 
TGGCTGAGCA CCTATACCTG GCX3CTGTGAC CXTTCAGGACT GCACCTTAGG GCAGTOTCCG 
TCAGTGCCTT CTCCAMCTAC ACCTGTCACC AAGGCTTATG TGGTCAGGAC TGAGCAGGGG 
ACTGQCCCTC CCCTCCCCAC AGCTGCTCTA CAGQGACCAC GGCTTTGGTT CCTCACXXAC 
TTCCCCCGGG CAGCTCCAGG GATGTGGCCT CATTGCTGTC TGCCACTCCA GAGCTGGGGG 
CTAAAAGQGC TGTACAGTTA TTTCCCCCTC CCTGCCTTAA AACTTGGGAG AGGAGCACTC 
AGGGCTGGCC CCACAAAGGG TCTCGTQGCC TTTTTCCTCA CACAGAAGAG GTCAGCAATA 
ATGTCACTGT GGACCCAGTC TCACTCCTCC ACCCCACACA CTGAAGCAGT AGCTTCTGGG 
CCAAAGGTCA GGGTGGGCGG GGGCCTGGGA ATACAGCCTG TGGAGGCTGC TTACTCAACT 
TGTGTCTTAA TTAAAAGTGA CAGAGGAAAC CACGAAAAAA AAAAAAAAAA AAAA 

(2) INFORMATION FOR SEQ ID NO: 254: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1508 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 254: 
TTGAACTTTT AAAATTTTAG ATCAGCAAAC TCTAAGATCC TAGAATGGAA GCTGTTCCTC 
ATTTCTCCAT GCTCACCCTC CCAGGTCAGC GAGATGGTGA AGAAGCTGCA CGCGGCAACA 
CCACCAACGT TCGGAGTGGA CCTCATCAAT GAGCTTGTGG AGAACTTTQG CAGATGTCCC 
AAGTGGTCTG GTCGGCAAGC CTTTGTCTTT GTCTGCCAGA GTGTCATTGA QGATGACTGC 
CTTCCCATGG ACCAGTTTGC TGTGCATCTC ATGCCGCATC TQCTAACCTT AGCAAATGAC 
AGGGTTCCTA ACGTGCGAGT GCTGCTTGCA AAGACATTAA GACAAACTCT ACTAGAAAAA 
GACTATTTCT TGGCCTCTGC CAGCTGCCAC CAGGAGGCTG TGGAGCAGAC CATCATGGCT 
CTTCAGATGG ACCGTGACAG CGATGTCAAG TATTTTGCAA GCATCCACCC TGCCAGTACC 
AAAATCTCCG AAGATGCCAT GAGCACAGCG TCCTCAACCT ACTAGAAGGC TTGAATCTCG 
GTGTCTTTCC TGCTTCCATG AGAGCCGAGG TTCAGTGGGC ATTCGCCACG CATGTGACCT 
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GGGATAGCTT TCGGGGGAGG AGAGACCTTC CTCTCCTGCG GACTTCATTG CAGGTGCAAG 660 

TTGCCTACAC CCAATACCAG GGATTTCAAG AGTCAAGAGA AAGTACAGTA AACACTATTA 720 

TCTTATCTTG ACTTTAAGGG GAAATAATTT CTCAGAGGAT TATAATTGIC ACCGAAGCCT 780 

TAAATCCTTC TGTCTTCCTG ACTGAATGAA ACTTGAATTG GCAGAGCATT TTCCTTATGG 840 

AAGGGATGAG ATTCCCAGAG AOCTGCATTG CTTTCTCCTG GTTTTATTTA ACAATCGACA 900 

AATGAAATTC TTACAGCCTG AAGGCAGACG TGTGCCCAGA TGTGAAAGAG ACCTTCAGTA 960 

TCAGCCCTAA CTCTTCTCTC CCAGGAAGGA CTTGCTGGGC TCTGTGGCCA GCTGTICCAGC 1020 

15 CCAGCCCTGT GrTGTGAATCX; TTTGTGACGT GTGCAAATGG GAAAC3GAGGG GTTrTTACAT 1080 

CTCCTAAAGG ACCTGATGCC AACACAAGTA GGATTGACTT AAACTCTTAA GCGCAGCATA 1140 

TTGCTGTACA CATTTACAGA ATGGTTGCTG AGTGTCTGTG TCTGATTTTT TCATGCTGGT 1200 

20 

CATGACCTGA AGGAAATTTA TTAGACGTAT AATGTATGTC TGGTGTTTTT AACTTGATCA 1260 

TGATCAGCTC TGAQGTGCAA CTTCTTCACA TACTGTACAT ACCTGTGACC ACTCTTQGGA 1320 

25 GTGCTGCAGT CTTTAATCAT GCTGTTTAAA CTGTTGTGGC ACAAC3TTCTC TTGTCCAAAT 1380 

AAAATTTATT AATAAGATCT ATAGAGAGAG ATATATACAC TTTTGATTOT TTTCTAGATG 1440 

TCTACCAATA AATGCAATTT GTGACCTQTA TTAAAAAAAA NTAAAAAAAC TCGAGGGGGG 1500 

CCCGGTAC 1508 



30 



35 

(2) INFOBMATION FOR SEQ ID NO: 255: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2514 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255: 

45 

GAGAGACTCA CACTTCTTTT CCATTATCAC TGACGATGTA. GTGGACATAG CAGGGGAAGA 60 

GCACCTACCT GTGrTGGTGA GGTTTGTTGA TGAATCTCAT AACCTAAGAG AGGAATTTAT 120 

50 AGGCTTCCTG CCTTATGAAG CCGATGCAGA AATTTTGGCT GTGAAATTTC ACACTATGAT 180 

AACTGAGAAG TGGGGATTAA ATATGGAGTA TTGTCGTGGC CAGGCTTACA TTGWCTCTAG 240 

TGGATTTTCT TCCAAAATGA AAGTTGTTGC TTCTAGACTT TTAGAGAAAT ATCCCCAAGC 300 

55 

TATCTACACA CTCTGCTCTT CCTGTGCCTT AAATATGTGG TTGGCAAAAT CAGTACCTGT 360 

TATGGGAGTA TCTGTTGCAT TAGGAACAAT TGAGGAAGTT TGTTCTTTTT TCCATCGATC 420 

60 ACCACAACTG CTTTTAGAAC TTGACAACGT AATTTCTGTT CTTTTTCAGA ACAGTAAAGA 480 
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55 



60 



AAGGGGTAAA GAACTGAAGG AAATCTGCCA rrCICAGTGG ACAGGCAGGC ATGATGCTTT 540 

TGAAATTTTA GTGGAACTCC TGCAAGCACT TGITTTATGT TTAGATGGTA TAAATAGTGA 600 

CACAAATATT AGATGGAATA ACTATATAGC TGGCCGAGCA TTTGTACTCT GCAGTGCAC?r 660 

GTCAGATTTT GATTTCATTG rrACTATTGT TGTTCTTAAA AATGTCCTAT CTTTTACAAG 720 

AGCCTTTGGG AAAAACCTCC AGGGGCAAAC CTCTGATGTC TTCTTTQCGG CCGGTAGCTT 780 

GACTGCAGTA CTQCATTCAC TCAACGAAGT GATTGGAAAA TATTGAAGTT TATCATGAAT 840 

TTTGGTTTGA QGAAGCCACA AATTTGGCAA CXMACTTGA TATTCAAATG AAACTCCXTO 900 

GGAAATTCCG CAGAGCTCAC CAQQGTAACT TGGAATCTCA GCTAACCTCT GftGAGTTACT 960 

ATAAASAAAC CCTAAGTGTC CCAACAGTGG AGCACATTAT TCAQGAACTT AAAGATATAT 1020 

TCTCAGAACA GCACCTCAAA GCTCTTAAAT GCTTATCTCT GGTACCCTCA GTCATOQGAC 1080 

AACTCAAATT CAATACGTCG GAGGAACACC ATGCTGACAT GTATAGAAGT GACTTACCCA 1140 

ATCCTGACAC GCIGTCAGCT GAGCTTCATT GTTGGAGAAT CAAATGGAAA CACAGGGGGA 1200 

AAGATATAGA GCTTCCGTCC ACCATCTATG AAGCCCTCCA CCTGCCTGAC ATCAAGTTTT 1260 

TTCCTAATGT GTATGCATTG CTGAAGGTCC TGTGTATTCT TCCTGTGATG AAGGTTGAGA 1320 

ATGAGCQGTA TGAAAATQGA CGAAAGCGTC TTAAAGCATA TTTGAGGAAC ACTTTGACAG 1380 

ACCAAAOGTC AAGTAACTTG GCTTTGCTTA ACATAAATTT TGATATAAAA CACGACCTGG 1440 

ATTTAATGGT GGACACATAT ATTAAACTCT ATACAAGTAA GTCAGAGCTT CCTACAGATA 1500 

ATTCCGAAAC TGTGGAAAAT ACCTAAGAGA CTTTTAAAAA TAGGCTTTCT TATATTTGAT 1560 

ATTTGGAAGA AAAAGCCGTA AGTGTATGTA GACCACTTAA TCACTAAATA TCTTTQCCTA 1620 

TAGGACTCCA TTGAATACAT TAGCCMTGA TAATCTACCT GTTTAAATGG CXrCXTOTITG 1680 

AACTCTCAAG CTTTGAAGAC CTACCTGTTC TTCCAGAAGA GAACGTTGAA AGTGCCATGT 1740 

TTCCTTTTGC GTCATCTCTG TTGATGGCAC TCTGGAATTG TTTCAGTTAA GTCATTTTAG 1800 

ACATAGCATT TATTATCACT GTGGATCTCT ACTTGTTGGG TGTTATGAAT TCTTTGAAGA 1860 

AATATATTTT GAAGAGGTGT GGGAGGAAGG AATACATTTT ATAAAATGTT GTAGTGAAGC 1920 

CCACAATTGA CCTTTGACTA ATAGGAGTTT TAAGTATGTT AAAAATCTAT ACTGGACAGT 1980 

TACAAGAAAT TACCGGAGAA AAGCTTGTCA GCTCACCAAA CAAGGATTTC AGTGTAGATT 2040 
TTGTCTTTCT TGAACTTAAA GAAACAAATG ACAAAGTTTG AATGGAAAAG CCTGCTGTTG 2100 
TTCXACATCT CGTTGCTGTT TACATTCCTT TGTQGAGCCT ACATCTTCCT AAGCTTTTTA 2160 
GCAGGTATAT GTTGAACACT TCTGTTTCAT GGTTGAGACA GAATCAGAGG CCATGGATAC 2220 
TGACAACTGA TTTGTCTGTT TTTTTTCTCT GTCTTTTTCX: ATGACTCTTA TATACTGCCT 2280 
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aVTCTTGATT TATAAGCAAA ACCTGGAAAA CCTACAAAAT AAGTOTTGTC CSTTTATCTAG 2340 

AAAAATATGG AAAAJTATTGC TGTTATTTTT GGTGAAGAAA ATCAATTTTG TATAGTTTAT 2400 

ITCAATCTAA ATAAAATOTO AATTTTGTTT AAAQCTTAGG CACATTATTT TTTGTCGGGT 2460 

CAAAACATTC TTGTGTAAAT TCTCTTAAAC ATTTGATAAA CAQCTTCACA ATTC 2514 

(2) INFORMATION FOR SEQ ID NO: 256: 



15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2357 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: do\able 

(D) TOPOLOGY: linear 

20 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 256: 

CrcCCTTATG AAGCCGATGC AGAAATTTTG GCTGTGAAAT TTCACACTAT GATAACTGAG 60 

25 AAGTQGQGAT TAAATATQGA GTATTGTCGT GGCCAGGCTT ACAITGTCTC TAGTGGATTT 120 

TCTTCCAAAA TGAAAGTTGT TGCTTCTAGA CmTAGAGA AATATCCCCA AGCTATCTAC 180 

ACACTCTGCT CTTCCTGTGC CTTAAATATG TGGTTGGCAA AATCAGTACC TGTTATGGGA 240 

GTATCTGTTC CATTAGGAAC AATTGAGGAA GTTTGTTCTT TTTTCCATCG ATCACCACAA 300 

CTGCTTTTAG AACTTGACAA CGTAATTYCT GTTCTTTTTC AGAACAGTAA AGAAAGGGGT 360 

35 AAAGAACTGA AGGAAATCTG CCATTCTCAG TGGACAGGCA GQCATGATGC TTTTGAAATT 420 

TTAGTOSAAC TCCTGCAAGC ACTTGTTTTA TGTTTAGATG GTATAAATAG TGACACAAAT 480 

ATTAGATX5GA ATAACTATAT AGCTGGCCGA GCATTTGTAC TCTGCAGTGC AGTGTCAGAT 540 

TTTCATTTCA TTGTTACTAT TOTTGTTCTT AAAAATGTCC TATCTTTTAC AAGAGCCTTT 600 

GGGAAAAACC TCCAGGGGCA AACCTCTGAT GTCTTCTTTG CGGCCQGTAG CTTGACTGCA 660 

45 GTACTGCATT CACTCAACGA AGTGANTQGA AAATATTGAA GTTTATCATG AATTTTGGTT 720 

TCAGGAAGCC ACAAATTTGG CAACCAAACT TGATATTCAA ATGAAACTCC CTGGGAAATT 780 

CCGCAGAGCT CACCAGGGTA ACTTOGAATC TCAGCTAACC TCTGAGAGTT ACTATAAAGA 840 

AACCCTAAGT GTCCCAACAG TOGAGCACAT TATTCAGGAA CTTAAAGATA TATTCTCAGA 900 

ACAGCACCTC AAAGCTCTTA AATGCTTATC TCTGGTACCC TCAGTCATGG GACAACTCAA 960 

55 ATTCAATACG TCGGAGGAAC ACCATCCTGA CATGTATAGA AGTGACTTAC CCAATCCTGA 1020 

CACGCTGTCA GCTGAGCTTC ATTGTTGGAG AATCAAATGG AAACACAGGG GGAAAGATAT 1080 

AGAGCTTCCG TCCACCATCT ATCAAGCCCT CCACCTGCCT GACATCAAGT TTTTTCCTAA 1140 

60 
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TGTGTATGCA TTGCTGAAGG TCCTGTGTAT TCTTCXrrGTG ATGAAGGTTG AGAATC5AGCG 1200 

GTATGAAAAT GGACGAAAGC GTCTTAAAGC ATATTTGAGG AACACTTTGA CAGACCAAAG 1260 

5 GTCAAGTAAC TTQGCTTTGC TTAACATAAA TTTTGATATA AAACACGACX: TGGATTTAAT 1320 

GGTGGACACA TATATTAAAC TCTATACAAG TAAGTCAGAG CTTCCTACAG ATAATTCCGA 1380 

AACTGTGGAA AATACCTAAG AGACTTTTAA AAATAGGCTT TCTTATATTT GATATTTGGA 1440 

10 

AGAAAAAGCC GTAAGTGTAT GTAGACCACT TAATCACTAA ATATCTTTGC CTATAGGACT 1500 

CCATTGAATA CATTAGCCAT TGATAATCTA CCTGTTTAAA TGGCCCCTGT TTGAACTCTC 1560 

15 AAGCTTTGAA GACCTACCTG TTCTTCCAGA AGAGAACGTT GAAAGTGCCA TGTTTCCTTT 1620 

TGCGTGATCT CTGTTGATGG CACTCTQGAA TTGITTCAGT TAAGTCATTT TAGACATAGC 1680 

ATTTATTATC ACTGTGGATC TCTACTTGTT GQGTGTTATG AATTCTTTGA ASAAATATAT 1740 

20 

TTTGAAGAGG TGTGGGAGGA AQGAATACAT TTTATAAAAT GTTGTACSTGA AGCOCACAAT 1800 

TGACCTTTGA CTAATAGGAG TTTTAAGTAT GTTAAAAATC TATACTQGAC AGTTACAAGA 1860 

25 AATTACCX3GA GAAAAGCTTG TGAGCTCACC AAACAAGGAT TrCAGTGTAG ATTTTGTCTT 1920 

TCTTGAACTT AAASAAACAA ATGACAAAGT TTGAATQGAA AAGCCTGCTG TTGTTCCACA 1980 

TCTCGTTGCT GTTTACATTC CTTTGTGGAG CCTACATCTT CCTAAGCTTT TTAGCAGGTA 2040 

30 

TATGTTCAAC ACTTCTGTTT CATGGTTGAG ACAGAATCAG AGGCCATGGA TACTGACAAC 2100 

TGATTTGTCT GTTTrTTTTC TCTGTCTTTT TCCATGACTC 1TATATACTG CCTCATCTTG 2160 

35 ATTTATAAGC AAAACCTGGA AAACX:TACAA AATAAGTGTT GTGGTTTATC TAGAAAAATA 2220 

TGGAAAATAT TGCTGTTATT TTTGGTGAAG AAAATCAATT TTGTATAGTT TATTTCAATC 2280 

TAAATAAAAT GTGAATTTTG TTTAAAGCTT AGGCACATTA TTTTTTGTGG GGTCAAAACA 2340 

40 

TTCTTGTGTA AATTCTC 2357 



45 

(2) INFORMATION FOR SEQ ID NO: 257: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 689 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEiasiESS: double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 257: 

55 

ACTTTCTCGT GCAAAAAGAT GTTCAAGCCT TATTTTATAC TTGCCTGCCC CTTTCTCTTT 60 
CATTTATTGG AGTGAGCTGC AGCTCTAAGA AGACCTGTTC TTTTGAATGG AGAGTAGCAT 120 
60 CAGGAACCAG GATGTGGGTG CGAGGCGTGC TCCTGGCTGT TGCAGATTGC TGCACCCGGG 180 
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AGCTCTTAGT GGACAGAGCT AGAGGATATG TGCACGTACT TCCATCTCTC TCTCTGTCTC 240 

CX3ATTTTAGC CCAGCACCAC AGGGTACGTT CXMTTTTTC TCTCTTTCCA TAGCTGTAAG 300 

5 

GCCCTTTCTG GC3AATGGTTC TCATTCTCCT TAATCTATTA TTGGGTCAGT TTTCCTGCAT 360 

GTCCCCAGCC TCCCATCACT GCCACCCACT CCCCACAGAG ATGCCCTGCT CATCCGACTG 420 

10 GGGCTTTGAC TCCCACACTG TCTACCCCTC TTCTGTGGAC GCCCTGCTGC CAAAACCTTC 480 

AGCAAACAGC TTTCCAAATG GAAGriTGrrCA CTGTCARGGS CTTTACAATC AGCAACAGCA 540 

AAATCTACAT GCTGCTGAGG G7CCTGCCTC ATTAAGATGC AATAAATATG TAAGTACATA 600 

AAAACAGCAA TAGAAGAAAC GTAATGCTTT ATTCTCAAAT ATGNATGTCT ACATAGAAAA 660 
GCCAAAATTA TTAAGAATAG TAAGGAATT 
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(2) INFORMATION FOR SBQ ID NO: 258: 



689 



25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2377 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 258: 
TCGACCCACG CGTCCGCCGA TGTGATGATT CCTGCGTATT CCAAGAACCG OGCCTATGCC 60 

35 ATCrrcrrcA tagtcttcac tgtcataggg gacgcccccg gcgctgtgct atcctgtgcc 120 

GGCCACCCTT GCGTTGGTTT TGCTGCTGTA CTGGTGGCGC CCCTGACCGT GGCTGTCTCC 180 

TCTTGAAGGA AGCCTGTTTC TGATGAACCT GCTGACAGCC ATCATCTACA GTCAGTTCCG 240 

GGGCTACCTG ATGAAATCTC TCCAGACCTC GCTGTTTCGG AGGCGGGTGG GAACCCGGCT 300 

GCCTTTCAAG TCCTATCCTC CATQGTGGGG GAGGGAGGAG CCTTCCCTCA GGCAGTTGGG 360 

45 GTGAAGCCCC AGAACTTGCT GCAGGTGCTT CAGAAGGTCC AGCTGGACAG CTCCCACAGA 420 

CAGGCCATGA TGGAGAAGGT GCGTTCCTAT GQCAGTGTTC TGCTCTCAGC TGAGGAGTTT 480 

CAGAAGCTCT TCAACGAGCT TGACAGAAGT GTQGTTAAAG AGCACCCGCC GAGGCCCGAG 540 

TACCAGTCTC CGrTTTCTGCA GAGCOTCCCA GTTCCTCTTC GGCCACTNAC TACTTTGACT 600 

ACCTGGGGAA CCTCATOGCC CTGGCAAACC TGGTGTCCAT TTGCGTGTTC CTGGTQCTGG 660 

55 ATCCAGATGT TGCTGCCTGC TGAGCGTGAT GACTTCATCC TGGGGGGTCT CAACTGCGTC 720 

TTCATTGTGT ACTACCTGTT GGAGATGCTG GCTCAAGGTC TTTTGCCCTG GGGCCTGCGA 780 

RGGTACYKKT CCTAACCCCA RCAAMGTGTT TTGAACGGGC TCCTCAMCGT TTGTCCTGGC 840 

60 
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TGGWWKKGSM GATCTCAACT CTCC3CTC?rGT ACCGATTGCC ACACCCAGGC TGGAGGCCGG 900 

ANATOC3TGGG CCTOCTGTCG CTGTCGGACA TGACCCGCAT ACTGAACATG CTCATCCnOT 960 

TCCGCTTCCT GCGTATCATC CTCAGCATGA AGCCGATGC3C CGTGGTGGCC AGTACCGTCC 1020 

TCGGCCTCGT CXMAACATG COTGCGTTTG GCGGGATCCT GGTQGTC3GTC TACTTACXSTAT 1080 

TTGCCATCAT TGGGATCAAC TICTTTAGAG GCGTCATTGT GGCICTTCCT GGAAACAGCA 1140 

GCCTCGCCCC TCCCAATAGG TCGGCGCCCT GTGGGAGCTT CGAGCAGCTG GAGTACTGGG 1200 

CCAACAACTT CGATCACTTT GCX3GCTGCCC TGGTCACTCT GTGGAACTTG ATGGTGGTGA 1260 

15 ACAACTCGCA GGTOTTTCTG GATGCATATC GGCGCTACTA AGGCCCGTOG TCCAAGMCT 1320 

ATTTTCTATT GTCGTQGCTG GTOTCGTCTG TCATCTGGGT CAACCTGTTT CTGGCCCTGA 1380 

TTCTGGAGAA CTTCCTTCAC AAGTGGGACC CCCGCAGCCA CCTGCAGCCC CTTGCTGGGA 1440 

CCCCAGAGGC CACCTACCAG AIGACTGTGG AGCTCCTGTT CAGGGATATT CTGGftGGAGC 1500 

CCGGGGAGGA TGAGCTCACA GAGAGGCTGA GCCAGCACCC GCACCTGTGG CTGTGCAGGrT 1560 

25 GACGTCCGGG TCTGCCATCC CAGCAGGGGC QGCAGGAGAG AGAGGCTGGC ATAACACAGG 1620 

TGCCCATCAT GGAAGAGGCG GCCA1GCTGT GGCCAGCCAG GCAGGAAGAG ACCTTTCCTC 1680 

TGACGGACGA CTAAGCTCGG GACAGGAACC AAGTCCTTTG CGTGTGGCCC AACAACCATT 1740. 

TACAGAACAG CTGCTGGTGC TTCAGGGAGG CGCCGTGCCC TCCGCTTTCT TTTATAGCTG 1800 

CTTCAGTGAG AATTCCXTTTG TCXSACTCCAC AGGGACCTTT CAGACAAAAA TGCAAGAAGC I860 

35 AGCGGCCTCC CCTOTCCCTT GCAGCTTCCG TGGTOCCTTT GCTGCCGGCA GCCCTTGGGG 1920 

ACCACAGGCC TCACCAGGGC CTGCACAGGT TAACCGTCAG ACTTCCGGGG CATTCAGCTG 1980 

GGAATCATAC TAATACCTCC GATTTTAGCC CAGCACCACA GGGTACGTTC CAGTTTrTAT 2040 

TTCrrrcCAT AGCTGTAAGG CCXOTTCTGG GAATGGTTAT CATTCTCCTT AATCTATTAT 2100 

TGGOTCAGrr TTCCTGCATG TCrCCAGCCT CCCATCACTG CCACCCACTC CCCACAGftGA 2160 

45 TCCCCTGCTC ATCCGACTGG GGCTTTGACT CTCACACTGT GTACCCXTTCT TGTGTGGACG 2220 

CCCTGCTGCC AAAACCTTCA GCAAACAGCT TTCCAAATGG AAGTTGTCAC TGTCAGQGCC 2280 

TTTACAATCA GCAACAGCAA AATCTACATG CTGCTGAGGG TCCTGCCTCA TTAAGATGCA 2340 



50 

ATAAATATGT AAGTACATAA AAAAAAAAAA AAAAAAA 2377 



55 

(2) INFORMATION FOR SEQ ID NO: 259: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1193 base pairs 
60 (B) TYPE: nucleic acid 
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(C) STRANDEDNESS: double 

(D) TOPOUJCy: linear 

(xi) SEQUENCE DESCRIPTIC»I: SEQ ID NO: 259: 

5 

TCIX3irCGCC GTCGCCCCGC CCCTGGCCTT TGCCCGGTCG GGCQOGACTT CXrrGTGTCCT 60 

ATTTCCAAGG ACTCCAAAGC GAGGCCGGGG ACTGAAGGTG TGGGTGTCGA GCCCTCTGGC 120 

10 AGAGGGTTAA CCTGGGTCAA ATGCACGGAT TCTCACCTCG TACAGTTACX; CTCTCCXX3CG 180 

GCAOGTCCGC GAGGMVTTGA AGTCCTGAGC GCTCAAGTTT GTCCX3TAGTC GAGAGAAGGC 240 

CATQGAGGTG CCX3CCACCX3G CACCGCGGAG CTTTCTCTOT AGAGCATTGT GCCTATTTCC 300 

CCGAGTCTTT GCTGCCGAAG CTGTGACTGC CGATTCGGAA GTCCTTGAGG AGCGTCAGAA 360 

GCX3GCTTCCX: TACGTCCXIAG AGCCCTATTA CCCGGAATCT GGATGGGACC GCCTCCGGGA 420 

20 GCTGTTTGGC AAAGACACAG TGAACACTAG TCTGAATGTA TACCGAAATA AAGATQCCTT 480 

AAGCCATTTT GTAATTGCAG GAGCTGTCAC GGGAAGTCTT TTTAGGATAA ACGTAGGCCT 540 

GCGTGGCTGG TGGCTGGTGG CATAATTGGA GCCTTGCTGG GCACTCCTGT AGGAGGCCTG 600 

CTGATCGCAT TTCAGAAGTA CTCTGGTGAG ACTGTTCAGG AAAGAAAACA GAAGGATCGA 660 

AAGGCACTCC ATGAGCTAAA ACTGGAAGAG TGGAAAGGCA GACTACAAGT TACTGAGCAC 720 

30 CICCTTGftGA AAATTGAAAG TAGTTTACAG GAAGATGAAC CTGAGAATGA TGCTAAGAAA 780 

ATTGAAGCAC TCCTAAACCT TCCTAGAAAC CCTTCAGTAA TAGATAAACA AGACAAGGAC 840 

TGAAAGTCCT CTGAACTTGA AACTCACTGG AGAGCTGAAG GGAGCTGCCA TCTCCGATGA 900 

ATGCCAACAG ACAGGCCACT CTTTGGTCAG CCTGCTGACA AATITAAGTG CTGGTACCTG 960 

TGGTCGCAGT GGCTTOCTCT TGrCTTTTTC rmcmT r AACTAAGAAT GGGQCTGTTG 1020 

40 TACTCTCACT TTACTTATCC TTAAATTTAA ATACATACTT ATGTTTGTAT TAATCTATCA 1080 

ATATATGCAT ACATGAATAT ATCCACCCAC CTAGATTTTA AGCAGTAAAT AAAACATTTC 1140 

GCAAAAGATT AAAGTTGAAT TTTACAGTTA AAAAAAAAAA AAAAAAAAAA AAA 1193 

45 
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50 



(2) INFORMATION FOR SEQ ID NO: 260: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1262 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
55 (D) TOPOLOGY: linear 

(xi) SEQtJENCE DESCRIPTION: SEQ ID NO: 260: 

GAAAAACCCA AAGATGCAGA CAATCTCTTT GAACATGAAT TGGGGGCTCT CAATATGGCT 60 
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GCATTACTAC C5AAAAGAAGA AAGAGCAAGT CXTCTTASTA ATCTTGCSCCC ATOXTOTAAG 

GocnrorocT tcagacggga ncrocAATO cgaaagcagc ttcttaaaaa -tgagaagggc 

5 ACCATAAAAC AAGCTTACAC GAOTSCTCCA ATGGTAGACA ATGAATTACT TCGATO3AGT 
CnCGGTrAT TTAAGCGGAA GACTACTO3C CATGCTCCAG GACATGAAAA GACTCAAGAT 
AATAAACTTT CACAGTCCAG TATCCAACAG GAACTGTGTG TGTCTTAAGA CCGAAGTTCA 
ATATOGTATT TTTOGTACTG TCTTCCTTCA GCAGTGCATA TTCTTTTGCA AAGTTCrTTG 
GTTPGACAAG CATTAGTCAC AAAGOCAGAA AAGATTTATC AGCCATGCTA AAAGAOTGAA 

15 GAATmGAT dTTAGAGAC ACTACOTTTC GCCAACTTAA GATTTTACGT TAATTTTTAC 
ATAGTATTTC ACACTCATCC AAAATAATGT GAAAACATCT AGATTTAGTA GTTrATTCIG 
CGCCmTGT TAAAACTCAA GATTTTCGAA AATGGTTGTC ACTGCTCTTC CAGCCTATGA 

20 

ATATTmCT GAAATGGAAC CATOGATTTA TGTCTGGATC ATCCATACAG AACCAACAAT 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 

TTTATTCAAA AACAATGTGT TCATCAAAGT AATTGCTCAC ATTGTGCAGT ACTATGTTGT 780 
25 ACAGACCACG K5AAAGGGAA IGCTOGTCTA GCTGGCGTGG TATGITTATA GGCGAATTrC 840 
AGCAGAAGGA AGCCAAAATA GmTTTCCT -PrTGAAAGTT TTTTAAAAAT TATTTCATGG 

cn^rmrrr taattaatat gtctccattg ttacaatgta tgttggatgt cttttgaccc 



30 

TAAATCCrrr TTITCTrATC AGAGATTCTG TACTATTTTT ATrrTTAATA AATGTATCrr 



900 
960 
1020 

cccirrccrrr GnrrAGATT tactitcctc ttcgttaatc TTArrccrGA tgatctagaa io80 

35 CATTAGTCAT CAACATTACA TCTITCATGC ITCAGATATT TTACrGCTTG TCTCCITATT 
GTTOGACAGC ITTAAACAGA C?ITCATCGTA CTTCAAATAT AGCTCATTGA TACTTAAGGG 
CANCTTCCTT GQGATGTCGG CTTTTTCGAA GGAAAAAAAT TNCXXXMAG GCAAATCCCA 



40 

GT 



1140 
1200 
1260 
1262 



45 



55 



(2) INFORMATION FOR SEQ ID NO: 261: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1179 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOUXSy: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 261: 
GGCAAACirr CCCCCAANGC TTCGAAACTT GCAAGCCGAA ACCTTGAATC GTTAAAAGTT 
GGCrrcCGNC GGCGCCCTGG CCCGAAGAAG CGCAATTGGC GTTCCGCGAA CGTTGGCCCT 
60 CAACGGCTCG GCAGCCAGCC ATOTCCTCCA CCCAGGACAG CGGCCCTGGG CTACAAGGAC 



60 
120 
180 
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CTCGACCTCA TCTTCCTOCG CCGACCTC3CG CGQGGAAGGG GftGTTTCAGA CTGTGAAQGA 240 

CGTCGTCCTC GACTGCXriCT TCGACTTCTT ACCCX5AGGGG GTCAACAAAG AGAAGATCAC 300 

5 

ACCACTCACG CTCAAGGAAG CTTATGTGCA GAAAATGGTT AAAGTGTGCA ATGACTCTGA 360 

CCGA1X3GAGT CTTATATCCC TGTCAAACAA CAGTC3GCAAA AATGTGGAAC TGAAATTTGT 420 

10 GGATTCCXrrC CGGAGGCAGT TTGAATTCAG TGTAGATTCT TTTCAAATCA AATTAGACTC 480 

TCTTCTGCTC TTTTATGAAT GTTCAGAGAA CCCAATGACT GAGACATTTC ACCCXy^CAAT 540 

AATCGGGGAG AGCGTCTATG GCGATTTCCA GGAAGCCTTT GATCACCTTT GTAACAAGAT 600 

CATTOCCACC AGGAACCCAG AGGAAATCCG AGGGGGAGGC CTGCTTAAGT ACTGCAACCT 660 

CT TC GTCAGG GGCTTTAQGC CCGCCTCTGA TGAAATCAAG ACCCTTCAAA GGTATATGTG 720 

20 TTCCAGGTTT TTCATCGACT TCTCAGACAT TGGAGAGCAG CAGAGAAAAC TQGAGTCCTA 780 

TTTCCAGAAC CACTTTGTGG GATTOGAAGA CCGCAAGTAT GAGTATCTCA TGACCCTTCA 840 

TGGACTGGTA AATGAGAGCA CAGTGTGCCT GATGGGACAT GAAAGAAGAC AGACTTTAAA 900 

CCTTA-TCACC ATX3CTGGCTA TCCGGGTGTT AGCTGACCAA AATGTCATTC CTAATGTGGC 960 

TAATGTCACT TGCTATTACC AGCCAGCCCC CTATGTAGCA GATGCCAACT TTAGCAATTA 1020 

30 CTACATTQCA CAGGTTCAGC CA6TATTCAC GTGCCAGCAA CAGACCTACT CCACTTGGCT 1080 

ACCCIGCAAT TAAGAATCAT TTAAAAATGT CCTGTGQGGA AGCCATTTCA GACAAGACAG 1140 

GAGAGAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAGAGC 1^79 

35 



25 



40 



(2) INFORMATION FOR SEQ ID NO: 262: 



(i) SEQU^CE CKARACIERISTICS : 

(A) LENGTH: 1162 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
45 (D) TOPOLOGY: linear 



50 



(xi) SEQUENCE DESCRIPTIOJ : SEQ ID NO: 262: 

GGCAAACTTT CCCCCAANGC TTCGAAACTT GCAAGCCGAA ACCTTGAATC GTTAAAAGTT 60 

GGCTTCCX3NC GGCGCCCTGG CCCGAAGAAG CGCAATTQGC GTTCXX3CGAA CGTTGGCCCT 120 

CAACGGCTCG GCAGCCAGCC ATGTCCTGCA CCCAGGACAG CGGCCCTCGG CTACAAQGAC 180 

55 CK3GACCTCA TCTTCCTGCG CCGACCTGCG CGGGGAAGGG GAGTTTCAGA CTGTGAAGGA 240 

CGTCX5TGCTG GACTC3CCTGT TGGACTTCTT ACCCGAGGGG GTGAACAAAG AGAAGATCAC 300 

ACCACTCACG CTCA7M3GAAG CTTATGTGCA GAAAATGGTT AAAGTGTGCA ATGACTCTGA 360 



490 



CCGATQGAGT CTTATATCCC TGTCAAACAA CAGTGQCAAA AATGTGGAAC TGAAATTTGT 
GGATTCCCTC OGGAGGCAGT TTGAATTCAG TGTAGATTCT TTTCAAATCA AATTAGACTC 
TCTTCTGCTC TTTTATGAAT GTTCAGAGAA CCCAATGACT GAGACATTTC ACCCCACAAT 
AATCGGGGAG AGCGTCTATG GCGATTTCCA GGAAGCCTTT GATCACCTTT GTAACAAGAT 
CATTGCCACC AGGAACCCftG AGGAAATCCG ACSGGGGftGGC CTGCTTAAGT ACTQCAACCT 
CTTGGTGAGG GGCTTTAGGC CCX3CCTCTGA TGAAATCAAG ACCCTTCAAA GCTATATGTG 
TTCCAGGTTT TTCATCGACT TCTCAGACAT TGGAGAGCAG CAGAGAAAAC TGGAGTCCTA 
TTTCCAGAAC CACTTTGTQG GATTGGAAC3A CCGCAAGTAT GAGTATCTCA TGACCCTTCA 
TGGAGTGGTA AATGAGAGCA CAGTGTC3CCT GATGGGACAT GAAAGAAGAC AGACTTTAAA 
CCTTATCACC ATGCTGGCTA TCCGOGTGTT AGCTGACCAA AATGTCATTC CTAATGTGGC 
TAATGTCACT TGCTATTACC AGCTAGCCCC CTATGTAGCA GATGCCAACT TTAGCAATTA 
CTACATTGCA CAGGTTCAGC CAGTATTCAC GTQCCAGCAA CAGACCTACT CCACTTQGCT 
ACCCTGCAAT TAAGAATCAT TTAAAAATGT CCTGTGGGGA AGCCATTTCA GACAAGACAG 
GAGAGAAAAA NAANQAAAAG AG 

(2) INFORMATION FOR SEQ ID NO: 263: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 735 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doiible 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 263: 
CGGGCTGGGT ATTTGCCTCG CACCATGGCG CCCAAGGGCA AAGTGGGCAC GAGAGGGAAG 
AAGCAQATAT TTGAAGAGAA CAGAGAGACT CTGAAGTTCT ACCTGCGGAT CATACTGGGG 
GCCAATCCCA TTTACTGCCT TGTGACGTTG GTCTTCTITT ACTCATCTGC CTCATTTTGG 
GCCTQGTTGG CCTTGGGCTT TAGTCTGGCA GTGTATGGGG CCAGCTACCA CTCTATGAGC 
TCGATGGCAC GAGCAGCGTT CTTCTGAGGA TGGGGCCCTG ATGGATGGTG GCACGAGCTC 
AACATGGAGC AGGGCATQGC AGAGCACCTT AAGGATGTGA TCCTACTGAC AGCCATCGTG 
CAGGTGCTCA GCTGCTTCTC TCTCTATGTC TGGTCCTTCT GGCTTCTGGC TCCAGGCCQG 
GCCCTTTACC TCCTGTGGGT GAATGTGCTG GGCCCCTGGT TCACTGCAGA CAGTGGCACC 
CCAGCACCAG AGCACAATGA GAAACGGCAG CGCCGACAGG AGCGGCGGCA GATGAAGCGG 
TTATAGCCAT TGACATTGTG GCCACAGGCC ACTGGCCCTG GGTGGCTCTG TCAGGGTGCA 
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CAGCCCCTCA TCCXTTCGAGC AATGAGGGTC TAGTCCAGGG GCCAAAAGCA GTCPGAGGTA 
TTGGGTATAC TTATACTCTA TAGGGTCGTT GAATAAATGG CTTAGAATGT GAAAAAAAAA 

5 

AAAAAAAAAA ATTTT 



10 



20 



30 



40 



50 



(2) INFORMATION FOR SEQ ID NO: 264: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 783 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 264: 

AAGTCCATGA GCTGCCGATG TXSGTGCTTAG TGATTGCGGT TTCGGTCGCT CTCCCGTGTT 60 

TCCCGGGCTG GGTATTTGCC TCGCACCATG GCGCCCAAGG GCAAAGTGGG CACGAGAGGG 120 

25 AAGAAGCAGA TATTTGAAGA GAACAGAGAG ACTCTGAAGT TCTACCTGOG GATCATACTG 180 

GGGGCCAATC CCATTTACTG CCTTGTGACG TTGGTCTTCT TTTACTCATC TGCCTCATTT 240 

TGGGCCTOGT TCGCCTGGGC TTTAGTCTGG CAGTGTATGG GGCCAGCTAC CACTCTATGA 300 

GCTCGATOGC ACGAGCAGCG TTCTCTGAGG ATGGGGCCCT GATGGATGGT GGCATGGACC 360 

TCAACATOGA GCAGGGCATG GCAGAGTGA6 TGTCCCCCAC CGCCAGCCCA GGCACCTTAA 420 

35 GGATCTGATC CTACTGACAG CCATCGTGCA GGTGCTCAGC TGCTTCTCTC TCTATGTCTG 480 

GTCCTTCTCG CTTCTCGCTC CAGGCCGGGC CCTTTACCTC CTGTGGGTGA ATGTGCTGQG 540 

CCCCraSTTC ACTGCAGACA GTGGCACCCC AGCACCAGAG CACAATGAGA AACGGCAGCG 600 

CCGACAGGAG CGGCGGCAGA TGAAGCGGTT ATAGCCATTG ACGATTnCGC SACNRGCCAC 660 

TGGCCCTCGG TGGCTCTGTC AGGGTGCACA GCCCCTCATG CCTGGAGCAA TGAGGGTCTA 720 

45 GTCCAGGGGC CAAAAGCAGT CTGAGGTATT GGGTATACTT ATACTCTATA QGGTCGTTGA 780 

ATA "^^3 



(2) INFORMATION FOR SEQ ID NO: 265: 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LEWGTH: 1638 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEI»5ESS : double 

(D) TOPOLOGY: linear 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 265: 
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GGCACGAGGC GGCGGCAGCX3 GTGGCGGCGG CGCCCCCCX3G CGGGAGCCGT NCCCTTTCCC 60 

GTCGGGGAGC GCGGGGYCGG GGYCCAGGGG ANCCCGGGMC ACGGAGAGCG GGAAGAGGAT 120 

5 

GGATTGCCCG GCCCTCCCCC CCGGATQGAA GAAGGAGGAA GTGATCCGAA AATCTGGGCT 180 

AAGTQCTQGC AASAGCXamS TCTACTACTT CAGTCCAAGT GGTAAGAAGT TCAGAAGCAA 240 

10 GCXrrCAGTTG GCAAGGTACC TQGGAAATAC TGTTGA.TCTC AGCAGTTTTG ACTTCAGAAC 300 

TGGAAAGATG ATGCCTAGTA AATTACAGAA GAACAAACAG AGACTGCGAA ACGATCCTCT 360 

CAATCAAAAT AAGGGTAAAC CAGACTTGAA TACAACATTG CCAATTAGAC AAACAGCATC 420 

AATTTTCAAA CAACCGGTAA CCAAAGTCAC AAATCATCCT AGTAATAAAG TGAAATCAGA 480 

CrCACAACGA ATGAATGAAC AGCCACGTCA GCTTTTCTGG GAGAAGAGGC TACAAGGACT 540 

20 TAGTGCATCA GATGTAACAG AACAAATTAT AAAAACCATG GAACTACCCA AAGGTCTTCA 600 

AGGAGTTCGT CCAGGTAGCA ATGATGAGAC CCTTTTATCT GCTGTTGCCA GTGCTTTGCA 660 

CACAAGCTCT GCGCCAATCA CAGGGCAAGT CTCCGCTGCT GTGGAAAAGA ACCCTGCTGT 720 

1TOGCTTAAC ACATCTCAAC CCCTCTGCAA AGCTTTTATT GTCACAGATG AAGACATCAG 780 

GAAACAGGAA GAGCGACTAC AGCAAGTACG CAAGAAATTG GAAGAAGCAC TGATGGCAGA 840 

30 avrCTTGTCG CGAGCTGCTG ATACfiGAAGA GATOGATATT GAAATGGACA GTGGAGATGA 900 

AGCXTTAAGAA TATGATCAGG TAACTTTCGA CCGACTTTCC CCAAGAGAAA ATTCCTAGAA 960 

ATTCAACAAA AATGTTTCCA CTGGCTTTTG CCTGTAAGAA AAAAAATOTA CCCGAGCACA 1020 

TAGAGCTTTT TAATAGCACT AACCAATGCC TTTTTAGATG TATTTTTGAT GTATATATCT 1080 

ATTATTCAAA AAATCATGTT TATTTTGAGT CCTASGACTT AAAATTAGTC TTTTGTAATA 1140 

40 TCAAGCAGGA CCCTAAGATG AAGCTGAGCT TTTGATGCCA GGTGCAATCT ACTGGAAATG 1200 

TAGCACTTAC GTAAAACATT TGTTTCCCCC ACAGTTTTAA TAAGAACAGA TCAGGAATTC 1260 

TAAATAAATT TCCCAGTTAA AGATTATTGT GACTTCACTG TATATAAACA TATTTTTATA 1320 

CTTTATTCAA AGGGGACACC TGTACATTCT TCCATCRTCA CTGTAAAGAC AAATAAATGA 1380 

TTATATTCAC AGACTGATTG GAATTCTTTC TGTTGAAAAG CACACACAAT AAAGAACCXX: 1440 

50 TOGTTAGCCT TCCTCTGATT TACATTCAAC TCTGATCCCG GGGCTTTAGG TTTGACATGG 1500 

GAGGTGGGAG GAAGATAGCG CATATATTTG CAGTATGAAC TATTGCCTCT GQGACGTTGT 1560 

GAGGAATTGT GCTTTCACCA GAATTTCTAA GGATTTCTGG CTTAAATATC ACCTAGCCTG 1620 
TGGTAATTTT TTTTCCCT 



35 



45 



55 



1638 



60 
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(2) INFORMATION FOR SEQ ID NO: 266: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14S5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 266: 

CGTCCGTACT GCCATGCAGG TACCGGGTCC GGAATTCCCA GGGTCGACCC ACGCGTCCGC 60 

TCA£?ITQGCA AGGTACCTGG GAAATACTGT TGATCTCAGC AGTTTTGACT TCAGAACTGG 120 

15 AAAGATGATG CCTAGTAAAT TACAGAAGAA CAAACAGAGA CTGCGAAACG ATCCTCTCAA 180 

TCAAAATAAG GGTAAACCAG ACTTGAATAC AACATTGCCA ATTAGACAAA CAGCATCAAT 240 

TTICAAACAA CCGGTAACCA AAGTCACAAA TCATCCTAGT AATAAAGTGA AATCAGACCC 300 

ACAACGAATG AATGAACAGC CACGTCAGCT TTTCTQGGAG AAGAGGCTAC AAGGACTTAG 360 

TX3CATCAGAT GTAACAGAAC AAATTATAAA AACCATGGAA CTACCCAAAG GTCTTCAAGG 420 

25 AC?nX3GTCCA GGTAGCAATG ATGAGACCCT TTTATCTGCT GTTGCCAGTG CTTTGCfiCflC 480 

AAGCTCTCCG CCAATCACAG GOCAAGTCTC CGCTGCTGTG GAAAAGAACC CTGCTGTTTG 540 

GCTTAACACA TCTCAACCCC TCTGCAAAGC TTTTATTGTC ACAGATGAAG ACATCAQGAA 600 

ACAGGAAGAG CGAGTACAGC AflGTACGCAA GAAATTGGAA GAAGCACTGA TGGCAGfiCAT 660 

CTTOTCGCGA GCTGCT6ATA CAGAAGAGAT GGATATTGAA ATGGACAGTG GAGATGAAGC 720 

35 CTAAGAATAT GATCAGGTAA CTTTCGACCG ACTTTCCCCA AGAGAAAATT CCTAGAAATT 780 

GAACAAAAAT GTTTCCACTC GCTTTTGCCT GTAAGAAAAA AAATGTACCC GAGCACATAG 840 

AGCTTTTTAA TAGCACTAAC CAATGCCTTT TTAGATGTAT TTTTGATGTA TATATCTATT 900 

ATTCAAAAAA TCATGTTTAT TTTGAGTCCT AOGACTTAAA ATTAGTCTTT TGTAATATCA 960 

AGCAGGACCC TAAGATGAAG CTGAGCTTTT GATGCCflGGTr GCAATCTACT GGAAATGTAG 1020 

45 CACTTACGTA AAACATTTGT TTCCCCCACA C3TTTTAATAA GAACAGATCA GGAATTCTAA 1080 

ATAAATTTCC CAGTTAAAGA TTATTGTGAC TTCACTGTAT ATAAACATAT nTTATACTT 1140 

TATTCAAAGG GGACACCTGT ACATTCTTCC ATCRTCACTG TAAAGACAAA TAAATGATTA 1200 

TATTCACAGA CTGATTGGAA TTCTTTCTGT TGAAAAGCAC ACACAATAAA GAACCCCTCG 1260 

TTAGCCTTCC TCTGATTTAC ATTCAACTCT GATCCCGGGG CCTTAGGTTT GACATGGGAG 1320 

55 GTGGGAGGAA GATAGCGCAT ATATTTGCAG TATGAACTAT TGCCTCTGGG ACGTTGTGAG 1380 

GAATTCTCCT TTCACCAGAA TTTCTAAGGA TTTCTGGCTT AAATATCACC TAGCCTGTGG 1440 

TAATTTTTTT TCCCT 1455 



30 



40 



50 
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(2) INFOKMATION FOR SEQ ID NO: 267: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1086 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: double 
10 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 267: 

CGCCrcCAGT ACCGGTCCGG AATTCCCGGG TCGACCCACG CGTCGCTGAC CCAGGAGAAG 60 

CTCCCTGTCT ACATCAGCCT GGGCTGCAGC GCGCTGCCGC CGCGGGGCCG GCAGCTGAAC 120 

TATGTOCTCT TCAGGGCGGG CACCGTOTTG CATTCATCTT TGTACCCCCA GCATCTAGCA 180 

20 GTOrrOGCAT GTAGTAGGCA CTCAAGAAAT GTGTGTTGAA TGAACGATGC CTGTGACAAG 240 

CAAGCGGACT TTATTCTTTC CTCACCCTTG CTCCTATGAC ACACCTCCTC CTGACTGCCA 300 

CTCTCACTCC TTCAGAGCAG AACTCCTCTA GGGAACCTGG ATGQGAAACA GCCATGGCCA 360 

AGGACATCCT GGCTPGAAGCA GGGCTACACT TTGATGAACT GAACAAQCTG AGGGTGTTQG 420 

ACCCAGAGC?r TACCCAGCAG ACCATAGAGC TGAAGGAAGA GTGCAAAGAC TTrGTGGACA 



25 



35 



45 



480 



30 AAATTOGCCA GTTTCAGAAA ATAGTTGGTG GTTTAATTGA GCTTGTTGAT CAACTTGCAA 540 



AAGAAGCAGA AAATCAAAAG ATCAAGGCCA TCGGTGCTCG GAACTTGCTC AAATCTATAG 600 

CAAAGCAGAG AGAAGCTCAA CAGCAGCAAC TTCAAGCCCT AATAGCAGAA AAGAAAATGC 660 

AGCTAGAAAG GTATCGGGTT GAATATGAAG CTTTGTGTAA AGTAGAAGCA GAACAAAATG 720 

AATTTATTGA CCAATTTATT TTTCAGAAAT GAACTGAAAA TTTCGCTTTT ATAGTAGGAA 780 
40 GQCAAAACAA AAAAAAGCCT CTCAAAACCA AAAAAACCTC TGTAGCATTC CAGCGGCTTG 

accaatgacc tatotcacaa gaggtqgcgt gtaaggaatg cagccccctg aagacagcac 

TACAAOTCTG GGQGAGCCAG TTTTAACATC AGTGCACAGC TGCTGCTGGT GGCCCTGCAG 960 

topacgttct CACCTCTTAT GCTTAGTTGG AACTAAGCAG TITOTAAACT TTCATCCTTT 1020 

TTTTTCTAAA TTCACAAAGC TTTQGAAGGA GARGCAATAA ATTTTTGKTT TCNAAATGGC 1080 

50 ttgatg 



840 
900 



1086 



55 (2) INFORMATION FOR SEQ ID NO: 268: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1003 base pairs 

(B) TYPE: nucleic acid 
60 (C) STRANDEENESS: double 
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<D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTIC»I: SEQ ID NO: 268: 
GGCACGGGAG CAGCCX3GGCT GGTCCTGCTG CGAGCCQGCG GCCOGGAGTG QGGCGGOGGA 
GCAAACATGA ACGTTGGAGT TGCCCACAGT GAAGTGAATC CAAATAOCOG TGICATGAAC 
AGCCGOGGTA TGTQGCIGAC ATATGCATTG GGAGTIGQCT TOCTTCATAT TGTCTTACTC 
AGCATTCCCT TCTTCAGTGT TCCTGTTGCT TGGACTTTAA CAAATATTAT ACATAATCTG 
GGGATGTACG TATTTTTGCA TGCAGTGAAA GGAACACCTT TCGAAACTCC TGACCAGGGT 
AAAAGCAAGG CTCCTAACTC ATTGGGAACA ACTQGACTAT QGAGTACAGT TTACATCTTC 
ACGGAAGTTT TTCACAATTT CTCCAATAAT TCTATATTTT CTGGCAAGTT TCTATACGAA 
GTATGATCCA ACTCACTTCA TCCTAAACAC AGCTTCTCTC CTGAGTGTAC TAATTCCCAA 
AATGCCACAA CTACATGGTG TTCGGATCTT TGGAATTAAT AAGTATTGAA ATGTTTTGAA 
ACTCAAAAAA AATTTTACAG CTACTGAATT TCTTATAAGG AAQGAGTGGT TAGTAAACTG 
CACTGTTTCT CTGATAATGT GAAATGAGAA GTATTTACAT TGGAGGGCCA ATGGCTQGTC 
CTTCAAGTOC TOTTTTGAAG TGCAGATTTC CATTAAATGA TGCCTCTGTT TAATACACCT 
GQTACATTTC TGAAGAGGGG CTTTATAAGC AGGCTGGGCA GGCCCAGCTT ATAAGTTAAA 
GGGCATCACA GTGAGGGTGT AGTAGATAAA TTCAAQGAAA TAAGAGATTT GTAAGAAACT 
AQGACCAGCT TAACTTATAA TGAATGOGCA TTGTGTTAAG AAAAGAACAT TTCCAGTCAT 
TCAGCTGTGG TTATTTAAAG CAGACTTACA TGTAAACCX3G AATCCTCTCT ATACAAGTTT 
ATTAAAGATT ATTTTTATTA CCGTAAAAAA AAAAAAAAAA AAA 

(2) INFORMATIOI FOR SEQ ID NO: 269: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1234 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEraOESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 269: 
ATCAGCATCT ACAAGTAGCA TATTTTGGAT QGTGTTTGTG TGCTACTTCA AAGTAACTAG 
GAAAAAATAA TCCTCGCAAC ACAQGTACCT TGTCATQTCA GAATTGGGGG TGTrAQGTTG 
CCAGTTGTAT CAGTGTTGAT TCATTTCATT ACTTCCTACA GAGCAAACAT GAACGTTGGA 
GTTGCCCACA GTGAAGTGAA TCCAAATACC OGTGTCATGA ACAGCCGGGG TATGTGGCTG 
ACATATGCAT TGGGAGTTOG CITOCTTCAT ATTGTCrTAC TGAGCATTCC CTTCTTCAGT 
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(2) INFORMATION FOR SEQ ID NO: 270: 



50 



600 
660 



CTICCTOITC CITOGACTTT AACAAATATT ATACATAATC TGGGGATGTA CGTATTTTTG 360 

CATCCAGTGA AAGGAACACC TTTCGAAACT CXTTCSACCAGG CTAAAGCAAG GCTCCTAACT 420 

^ CATTOC3GAAC AACTCGACTA TGGAGTACAG TTTACATCTT CACGCSAAGTr rrTCACAATT 480 

TCTCCAATAA TTCTATATTr TCTOGCAAGT TTCTATACGA AGTATGATCC AACTCACTrC 540 
10 ATCCTAAACA CAGCTTCTCT CCTGAGTGTA CTAATTCCCA AAATGCCACA ACTACATGGT 
GTTCC3QATCT TTGGAATTAA TAAGTATIGA AATC3TTTTGA AACTC3AAAAA AAATTTTACA 

C3CTACTC5AAT TTCTTATAAG GAAGGAGTGG TTAGTAAACT GCACTGTTTC TSTGATAATG 720 

TGAAATGAGA AOTATTTACA TTGGAGGCSCC AATGGCTGGT CCTTCAACjrG CTGTOITCAA 780 

GTCCAGATTT CCATTAAATG ATGCCTCTGT TTAATACACC TGGTACATTT CTGAAGAGGG 840 
20 GCTTTATAAG CARGCTOGGC AGGCCCAGCT TATAAGTTAA AGGGCATCAC AGTGAGGGTG 900 
TAGTAGATAA ATTCAAGGAA ATAAGAGATT TGTAAGAAAC TAGGACCAGC TTAACTTATA 960 

ATCAATCGGC ATTOICTPAA GAAAAGAACA TTTOCAGTCA TTCAGCTGTG GTTATTTAAA 1020 

GCAGACTTAC ATCTAAACCG GAATCCTCTC TATACAAGTT TATTAAAGAT TATTTTTATT 1080 

AOCRTACATA TTTCKCnGT TTTATGTAAG YGGATGTATA TCCrClTOTT TTATACAAGC 1140 

30 CAGTTCCCAC TTATGMXGr ACTTTTTTGG TTITOCTGGG CTTAATATTG TGTATTGGTC 1200 
AATGAGGCCA TTTTTACANT TATTAACGTT ACAG 



1234 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 574 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

45 (xi) SEQUENCE DESCRIPTIC»I : SEQ ID NO: 270: 

NGAGGTGCGT TCTGAGCCGT CTCTCCTGCG CCAAGATGCT TCAAAGTATT ATTAAAAACA 60 

TAIXSGATCCC CATGAAGCCC TACTACACCA AAGTTTACCA GGAGATTTGG ATAGGAATGG 120 

GQCTCATGGG CTTCATCGTT TATAAAATCC GGGCTGCTGA TAAAAGAAGT AAGGCTTTGA 180 

AAGCTTCAGC GCCTGCTCCT GGICATCACT AACCAGATTT ACITGGAGTA CATGTGAAAG 240 

55 AAAACGICAG TCTGCCTGTA AATTTCAGCA AGCCGTGTTA GATGGGGAGC GTQGAACGTC 300 

ACTGTACACT TOTATAAGTA CCGTTTACTT CATGGCATGA ATAAATGGAT CTGTGAGATG 360 

CACTGCTACC K3OTACTCCT TICAGTGTOr TCCCCCTCAG CCCTCOGGCG TGTCAGGCAT 420 

60 
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ACTCTGAGTA GATAATTTGT CATGCAGCGC ATGCAATCAG AATCTCACTG AGCXACCCAT 480 

CATTGTGAAA TAATTACCTC AGTTGTACAG GACTTGGTGA TCAGC3ATCCA GGCACTCACT 540 

TGTATTCTAC TGCTCAATAA Aa3TTTATTA AACT 574 



25 



35 



45 



55 



10 (2) INFORMATION FOR SEQ ID NO: 271: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1731 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS: double 

(D) T0P0IXX3Y: linear 

(Xi) SEQUENCE DESCRIPTIC^I: SEQ ID NO: 271: 

20 GCTGCAAGGT GCGCCTCGTG CCGCTGCAGA TCCAGCTCAC TACCCTGGGA AATCTTACAC 60 

CTTCAAGCAC TGTGTTTTTC TGCTGTGATA TGCAGGAAAG GTTCAGACCA GCCATCAAGT 120 

ATTTIX3GGGA TATTATTAGC GTGGGACAGA GATTGTTGCA AGGGGCCCGG ATTTTAGGAA 180 

TTCCTGTTAT TGTAACAGAA CAATACCCTA AAGGTCTTGG GAGCACGGTT CAAGAAATTG 240 

ATTTAACAGG TGTAAAACTG GTACTTCCAA AGACCAAGTT TTCAATGGTA TTACCAGAAG 300 

30 TAGAAGCGGC ATTAGCAGAG ATTCCCGGAG TCAGGAGTGT TCTATTATTT GGAGTAGAAA 360 

CTCATGTGTG CATCCAACAA ACTGCCCTGG AGCTAGTTGG CCGAQGAGTC GAGGTTCACA 420 

TTGTTOCTGA TGCCACCTCA TCAAGAAGCA TGATGGACAG GATGTTTGCC CTCGAGCGTC 480 

TCGCTCRARC CNGGGATCAT AGTGACCACG AGTGNAQGCT GTTCTGCTTC AGCTGGTAGC 540 

TGATAAGGAC CATCCAAAAT TCAAGGAAAT TCAGAATCTA ATTAAGGCGA GTGCTCCAGA 600 

40 GTCGGGTCTG CTTTCCAAAG TATAOGACAT TTGAAGAACT GGTATGCTAC TCACTGGTGA 660 

AQGACAGTCA GGTGAAGGAC TGTAAGCCCA CACAAGCTCT TCTTATCTCT ACTAGAATTA 720 

AAATGTTAAG TCAAAAACGG CTCCTTTTTT GCGCCTCCTA GTGAACTTAA CCAGCTAGAC 780 

CATTTGAGTA CCAGCATTTA GTTACAAACG TCAAAGGCTT CCGGTGCTGC TTACCTTCCT 840 

TTTTTX3TTAA TGTGCTTTTA TTTATTAAAA AAAATTACAA TGAAGATGCC TGTTTTGTCT 900 

50 CTACTGTGTA CTCTGATCGT ATCTTTCCAA AGTGCAGACT CTTGTGAAGT TTTCTTAAAT 960 

TCTTCACTTT AAAGAAAATG ACGTACCAAC AATGATTTGG CTTTTATATT ACTGTAAGAT 1020 

GTTATAATGT TAATGTGGAT GTAGTGCTTT TACTTTACAG ATTGATTGGA ATAAGATTAT 1080 

TGCATATGAA TTTACCCACA GGACTCTGAA TCATGTTACC CACTCCCCTC ACAATGTTGT 1140 

CCACTTAGTC AGTTGCATTG ATCTATCCGT ACCAAATGAT GTTGAATAAT TACATATCTT 1200 

60 TCTKGACTAT ACTGATTTCT TATTTTGGTC ACTATTACTA AATCTCTGTT AATATTCTCT 1260 
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CrrrTAACTC AAAAGQCSATC GCSATAGAftCG GTTTGCAATG CCATATTATT GGTGGAGGGC 

rormAACA TcirreAflOT ATOGcrrocr gaatatcttt accaacatct tgaatatata 

^ TTCTACHOTC CACAAGATTT AGCAAAAAGA TAAAGCTTGG GTC3GAATATC ATTTTAAAAT 
CTTCATOTK: TOTTCTATAT TTTCITCACC TACTCTCCAA ATATTOTAAT GCAAAAftGTC 
10 TCACTAATCA TITOGTACTA TTAATTriCT GGTCATTCTT TCTCTTCGAT AAATTTATTT 

TCATTAAATA CTTRTTAGAG GOTPTTSAAA TGimTCAA ATATCTGAAA TOPGAAACTC 1620 



CTG' 



HCTTTTA TATTAAAGTA ATTAAAGAAA ATOTATTOTG ATTGAAATTA TTTTGNCCTC 



1320 
1380 
1440 
1500 
1560 



1680 



CACAAGATGG CTCTArcAGT ATTCTTCCAG GGATTCTAAT ATTTATTTAA G 1731 



20 



(2) INFORMATION FOR SEQ ID NO: 272: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1320 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOIjOGY: linear 



30 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 272: 
CroCTTAGGA AGAGAAGOTC AGAOTTCGCG GGGGCAGAGG CATTCTTGCC GCTQGCCCAG 
TCACTATCTA GTOGAGGGGC AGACACCCTC CCGCAAATTC TQGAAGGTTC TTAGTCTCGA 
35 CTAGGGCAOT AGCCCAGGAC OOTTAGTCGC CGGCTTCAGG TCACTGCCGG CTCAACGGAG 

CTCCCGTCGC CATOmXSGC TOCTTOGTGG CGGGGAGGCT GGTOCAAACA GCTOCACAGC 240 
AAGTOGCAGA GGATAAATTT CTITITOACT TACCTGATTA TGAAAGTATC AACCATGTIG 

40 

TOcnrrrrAT gctoggaaca atcccattic ctcagggaat gggaggatct gtctactttt 

CTTATCCTCA TTCAAATCGA ATOCCAGTAT GGMAACTCCT AGGATTTGTC ACGAATQQGA 
45 AGCCAAGTCC CATCrrCAAA ATITCAGGTC rTAAATCTGG AGAAGGAAGC CAACATCCTT 
ITOGAGCCAT GAATATTCTC CGAACTCCAT CTGTTGCTCA GATTOGAATT TCAGTGGAAT 
TATTAGACAG TATOGCTCAG CAGACTCCTC TAGGTAATGC TGCTGTATCC TCAGCTGACT 
CATTCACTCA GTTCACACAA AAGATOTTCG ACAATTTCTA CAAnTTGCT TCATCATTTG 
CrorCTCICA QQCCCAGATC ACACCAAGCC CATCTGAAAT GTTCATrCCG GCAAATOTGG 
55 TTCTCCAAAT GGTATCAGGC ATOTrCTGTC TCCAATATTA AGGCnTTTA TAACTGAATA 
TCTATTITCT CTATCAATAT ATTCCrmT TGACATTTAA ACATATTCTT TTATTGTGAA 
CATCAGCACT GCATCCCATT AAAGTATCTA CTATAGAGAT CTGATGAGAA ACAGrTCTTA 

60 



60 
120 
180 



300 
360 
420 
480 
540 
600 
650 
720 
780 
840 
900 
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35 



CCXTTAAATAT TTTGTTATAT TOTCGCXATT ATGAATTTAT AAAGACAOSA AAATATAGTT 960 

GCrTATOTTT TAGGGACCAC TATTAAAGCT TATAAATATT TGTOrATTrT CATTTAGAAG 1020 

TACCATCTAT GAGAC3TAGTT TATACTGCAC TGTGfTACATG AATQGCTAAT GAATCTATTT 1080 

TCXAACTTK: CCCTGrrrTA TAGATATTTC TTTTCACrrT GAGTATCCTA GAGATGGGAG 1140 

GATOCCTAGG AASAGTTTGT TGAGAAGTGG TACCATG6T6 TAGCATOQGA GAQCATTGGG 1200 

AATOCACTAG OTTTCAATTT GGCATAATGG TAGCTATGTG ACCCTGAGCA AATTTCTCTC 1260 

ATCIGCTCAT CTCANGAATG AGGAAATAGG AGTGAATTTG ATNTTTCCTA GGTCCWTCTA 1320 

(2) INFORMATION FOR SEQ ID NO: 273: 



20 (i) SEQUENCE CHARACTERISTICS: 

(A) LQK3TH: 515 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 273: 
CCCTOGAGAG GGGCTGCTCT GCCAGCTTGG GGAGGGTCTG GGATGGGGCT GCCCCTGATG 60 
30 GCCCIGATGT GGAOTACCTT GCCAGCATCT GCTGGGGTGA ACTTTATITr AGCCCTTCCC 120 
TTOITCYTCT TATGAAGAAC AGAGGAGGGG TGGGCAGGTC AGTGATGTCA GCAGTGAGTA 180 
TTCCCAGCAC AGCGGCTCTC GAAGAGGCAT GAGGCATTTC TTTCAGGAAA TGRTCATTAT 240 
TCAGCCAGAA GGCATTCATT AAGTAAGTCC TGACTTTGTG CCCAGCTCTG TGTTATAGGC 300 
CCTTOGCGAG ACTCAGGAGG QGCARAQGAC GCTAGKTTKT AGWTAACACG GAACCTCARA 360 
40 GGWTATATGG TCCAAGAAGA CCCGGGQGCG GTGAAAACCC TGTGGACTAA TGCTCACGGG 420 
AGCCOGAGGT CACACTTTGA CTTTQCTACC ATGGGCTGTG TCTANGNACG TATATATGCT 480 
GCGTAATTAT TACAGAGGCA GTCCATGTGC ATTGT 515 

45 



50 



(2) INFORMATION FOR SEQ ID NO: 274: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2995 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
55 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 274: 

TGACACCCAT AAGGAATTCA TGAAGAAAGT AGAAGAAAAG CGAGTGGACG TTAACTCAGC 

60 



60 



PCT/US98/04493 

WO 98/39448 

soo 



ACTflGCCATO GCSftGAAGTCA TCCTCGCTCT CTOOCACCCC CaTTGCKTCA CAACCATCAA 
ACACTOCaTC ACCATCAICC GfleCTOGCTT CXaa3*C3GTC CTGftCATCQG CTAK5CK3CA 
CCAGC^GCOT CTWAAACGG CCriOTC^ 

ACnCTGGCA TCGATCCAC?r GGGCTCAGAC CaCCCTCATT CflGOQGGATC AGCSfiGOCAKT 
COCGCAGAAC ATTOACCGAG TTAAAGCCCT TATCCXTGAG CATCAGACAT WATGGAGGA 



120 
180 
240 
300 
360 



GATCACICGC AAACAGCCTC ACfflGGACCG QGTCACCAAG ACATACAAAA GGAAAAACAT 420 

480 

540 

600 

660 



AGftGCCTACr CACGCGCCPT TCATAGAGAA ATCCXXXaOC QGAGGCAGGA AATCCCTAAG 
15 TCAGCCAACC CCTOCTCCCA TCCCAATCCT TICACftGTCT GAAGCAAAAA ACCCACGGAT 
CAACCAGCrr TCTOCCCGCT GGCAGCAQGT GrrGGCTCTTA GCACTQGAGC OGCAAfiGGAA 
ACK3AATCAT GCXOTQGATC GQCTOGM5GA GITGAAAGAA TPTGCCAACT TreACTTTOA 
TOICreGAGG AAAAACTATA TOCCTIGGAT GAATCACAAA AAGTCTCGAG TGATGGKTTr 720 

780 

840 
900 
960 



CITCCQOCGC ATTOATAAGG ACCAGGATGG GAAGATAACA OGTCAGGACT TTATCGATGG 
25 CArmAGCA TCCAAOTTCC CCACCACCAA GCTAGBGATG ACTQCTGrrGG CrGACATTTT 
CGACCX5AGAT GGGGATOGTT ACATTCATTA TTATGAATIT OTOQCTQCTC TTCATCCCAA 
CAAGGA-reCG TATCGACCAA CAACXXyvTCC AGATAAAATC GAfiGATGAGG TTACAAGACA 
ASIGGCTCAG TOCAAAT^ CAAAAAGCTT ICAGOTGGAG CAGATCGGAG AGAATAAATA 1020 

1080 

1140 

1200 
1260 



CCQCTIdTC CTCGGCAATC AOTTIGGGGA TTCTCAGCAG TTGCGGCTGG TCCGTATICT 
35 GCQCAACCOT 6ATOCTIC0C CTTOGTOGAG GATGGATGGC CTTGGATGAA rrrTTAGTCA 
AAAATCAOCC CTCCCGSGCA CGAOCTAGAA CPAACATIGA ACTTAGAGAG AAATICATCC 
TACCfiGAQGG AGCATCCCAG GGAATCAOCC CCTTCCGCTC ACGGGGflCGA AGGTCCAAAC 
^ CATCnCCCG GGCAGCrrcC CCTACrcGTT CCAGCICCAG TGCTAGTCAG AGTAACCACA 1320 

1380 

1440 
1500 
1560 



GCICTACAIC CATOCXATOr TCICCAGCCA CCXXAGCCAG TGGAACCAAG OTTATCCCAT 
45 CAICAGCTAG CAAOITCAAA CGACXSACAC CAACTTTTCA TTCTAGTCGG ACATCCCTIG 
CTCGTCATAC CAGCAATTAG TlCrrCXXXXS GCCTCCaCAG GTCOCAAAAC TAATCGGGCA 
GACCCTAAAA AGICTCCCAG ■TCGCCCTGOG AGTCGGGCTG GGAGTCGAGC CGGGAGTCGA 
^ GCCAGCAGCC GQCGAGGAAG TOACGCTTCT GACTTPGACC TCTTAGAGAC GCATTGCTTG 1620 
ncCGACACT TCAGAAAGCA GCGCTOCAGG GGGCCAAGGC AACICCAGGA GAGGGCTAAA 1680 
55 CAAACCrrcC AAAATCCCAA CCAOCTCTAA GAAGACCACC ACTGCCTCCC CCAGGACrcC 1740 
MCTCCCAAG CGATAACACT CTCTAAGCAC CCCCAAGCCA CTATCCACTT TGAATCCTCC 1800 
TCCATACAW GGGTOTATAT rTATTCTCAA OQGGAGAAGT TATATTGTTA AAAGIGTAAA I860 

60 
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10 



15 



20 



25 



30 



35 



AGAATAATTG TGTTATGAAG CTGCCTTATT rX ' m ' XTiT TTGTAAGTTA CTATTTTCAT 1920 

GTOAATATTT ATCTAGATAA AATTTGCXnC CTGGTAACCC TOTAATGGAT GGC3GCCCAGA 1980 

AATGAAATAT TTGAGAAAAA CAAGTGAAAA GGTCAAGATA CAAATGTCTA TTAAAAAAAA 2040 

AAAAGCCTAT TAATAQGGTT TCTCCGCXX3T GCAQGGTTGT AAACCTGCTT TATCTTTTAG 2100 

GATTATTCCT AAATGCATCT TCTTTATAAA CTTGACTTGC TATCTCAGCA AGATAAATTA 2160 

TATTAAAAAA ATAAGAATCC TGCAGTGTTT AAGGAACTCT TTTTTTOTAA ATCACX3GACA 2220 

CCTCAATTAG CAAGAACTGA GGGGAGGGCT TTTTCCATTG TTTAATGTTT TGTCATTrTT 2280 

AGCTAAAGAG AGGGAACCTC ATCTAAGTAA CATTTQCACA TGGATACAGC AAAAGGAGTT 2340 

CATTGCAATA CTGTCTTTGG ATATTGTTTC AGTACTQGGT GTTTAAAGGA CAAATAGCTG 2400 

CTAGAATTCA GQQGTAAATG TAAGTGTTCA GAAAACGTCA GAACATTTGG GGTTTTAAAC 2460 

TGATTTOTTC CTCCXTTATCC AGCCTAGACA CXMTAACTC TTCICTTCAC CAGGACCCAG 2520 

ACCCTTGGCA AGGGATAGGC TOOTTGGTGA CATTGTGAAT TTCAGATTTG TTTTATCCAC 2580 

TTTTTTTGCT ATTTATTTAA ATGGTCGATC AACTTCCCAC AAACTGW3GA ATGAATICCA 2640 

CGAGCCnCTT CTGAAAATGT GGACXyTAAGA CAAACACGTG CTCGTCCTTT AATGGAGTTC 2700 

ACCAC3CACAC TTGTTAACCA GTCCTGTTTG CTTTCGTCTT TTTTTGTGCG TAATAAAGTC 2760 

AACTGACCAA GTGACCATGA AAAGGGGCTG TCTGGGGCTC CTGTTTTTTA GCTGCTGTTC 2820 

TTCAGCTCCG ACCATGTTGC TGTOrGATTA TCTCAATTGG TTTTAATTGA GGCAGAAACT 2880 

GAAGCTCTAC CAATGAACTG TTTAGAAACA AGACACACTT TTGTATTAAA ATTGCTTGCA 2940 

GTAACAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAACTCG AGGGGGGCCC GGTAC 2995 



40 



45 



50 



55 



60 



(2) INFORMATION FOR SEQ ID NO: 275: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1990 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEIKESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 275: 
GGGACCCGCG CGSCTCCCGG GGATGGTGAG CAAGGCGCTG CTGCNWOGTG TCTGCCGTCA 
ACCGCAGAGG ATGAAGCTGC TGCTQGGCAT CGCCTTGCTG GCCTACGTCG CCTCTGTTTG 
GGGCAACTTC GTTAATATGA GGTCTATOCA QGAAAATGGT GAACTAAAAA TTGAAAGCAA 
GATTCAAGAG AIGGTTGAAC CACTAAGAGA GAAAATCAGA GATTTAGAAA AAAGCTTTAC 
CCAGAAATAC CCACCAGTAA AGTTTTTATC AGAAAAQGAT CGGAAAAGAA TTTTGAWTAA 



60 
120 
180 
240 
300 
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CAGGAGCSCGC AfiKGrTCaiXS GGCTCCCATC TOAACTGACA AACTCATGAT GGACGGCCAC 360 
GAGCyrcaCCG TGOTCCSACAA TTTCrTCACG GQCAGGAAGA GAAACGTGGA GCACTGGATC 420 
^ GGACATCAGA ACPTCGAOTT GATTAACCAC GACXJTGTOGG AGCCCCTCTA CATCGAGGTT 480 
GACCAGATAT ACCATCTCGC ATCTCCAGCC TCCXXTTCCAA ACTACATGTA TAATCCTATC 540 
10 AAGACATTAA AGACCAATAC GATTQGGACA TTAAACATGT TGGGGCTQQC AAAACGAGTC 
CSGTOCCCGTC TQCTCCTGGC CTCCACATCG GAGGTGTATG GAGATCCTGA AGTCCACCCT 
CAAACTGAGG ATTACTCGGG CCACGTCAAT CCAATAGGAC CTOGGGOCTO CTACGATGAA 
GGCAAACGTG TTCCAGAGAC CATCTQCTAT GCCTACATGA AGCAGGAAGG CGTGGAAGTG 
CGAGTCGCCA GAA'TCTTCAA CACCTITCGO CCACGCATGC ACATGAACGA TGGGCGAGTA 840 
20 <?ICAGCAACT TCAICCrcCA GGCGCTCCAG GGGGAGCCAC TCACGGTATA CGGATCCGQG 
TCrCAGACAA GGGCGTrCCA GTACGTCAGC GATCTAGTGA ATGGCCTCGT GGCTCTCATG 
AACAGCAACG TCAGCAGCCC GGTCAACCTC OGGAACCCAG AAGAACACAC AATCCTAGAA 



25 

TOTOCTCAGT TAATTAAAAA CCrrGTTGGT AGCGGAAGTO AAATTCAGTT TCTCTCCGAA 



35 

ACITCATCGG ATOTATITPr GGCTmrTT TGTTGTCGIT TAAAGAAAGA CTTTAACAGG 



600 
660 
720 
780 



900 
960 
1020 
1080 

GCCCAGGATG ACCCACAGAA AAGAAAACCA GACATCAAAA AAGCAAAGCT GATGCTGGGG 1140 
30 TOGGAGCCCG TQGTCCCGCT GGAGGAAGGT TTAAACAAAG CAATTCACTA CTTCCGTAAA 
GAACTCGAGT ACCAGGCAAA TAATCAGTAC ATCCCCAAAC CAAAGCCTGC CAGAATAAAG 
AAAGGACGGA CTCGCCACAG CTCAACTCCT CACTTPrAGG ACACAAGACT ACCAITOTAC 



1200 
1260 
1320 
1380 

IXSTCA'raUVG AACAAACTCG AATrTCATTC TGAAGCTTGC TITAATGAAA TGGATGTGCC 1440 
40 TAAAAGCTCC CCTCAAAAAA CKSCAGATTT TGCCTTGCAC TTTTTGAATC TCTCTTTTTA 
TOTAAAATAG CGTAGATCCA TCTCTGCGTA VmCM^VT TrTTTATCTT GCTGTGAGAG 
CATATOriGT GACTCTCGTT GACAGTTTTA TTTACTGGTT TCITICTGAA GCTGAAAAGG 
AACAITAAGC GGGACAAAAA ATOCCGATTT TATTTATAAA AGTGGGTACT TAATAAATGA 1680 
GTCGTrATAC TATGCATAAA GAAAAAYCCT AGCAGTAITC TCAGffrGGTC GTCCGCCGGC 1740 
50 ATTCAirTTA GQGCAGATAA AAGAATTCTG TGTGAGAGCT TTATGTTrCT CTmAATTC 1800 

AGAGmrrc caaogtctac TrnxsAOirc caaacttcac tttgaaatat tccicttggt i860 

CATGATCAAG GATATTTCAA ATCACTACTG TGTTrTGCTG CGTATCTGQG GCGGGGGCAG 
GTTCGGGGGC ACAAAOTTAA CATATTCTTC GrTAACCATG GTTAAATATG CTATTTTAAT 



1500 
1560 
1620 



AAAATATTGA 

60 



1920 
1980 
1990 
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(2) INFORMATION FOR SEQ ID NO: 276: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2436 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGV: linear 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 276: 

AACTTCGCTT AGCTCTCCAG GGTNAAACGG GTC5AGNCCTT AAAAACAGAA GAGAACAAGA 60 

15 TTTAAAGTCC GTTGCATTCA AAATAACAAA CAATATCAAT GTTTTAATCA AGGATCTCTT 120 

CCACATTCCT CCTTCTTATA AGAGCACAGT AACACTATCC TGGAAACCTG TACAAAAGGT 180 

TCAGATTOGG CAAAAGAGAG CCAGTGAAGA TACAACTTCA GGTTCACCAC CCAAGAAATC 240 

TTCAGCAQGA CCAAAAAGAG ATOCCAGGCA GATTTATAAC CCTCCCAGTG GGAAATATAG 300 
CAGCAATTTG GGCAACTTTA ATTATGAGCA GAGAGGAGCC TTCAGGGGAA GTAGAGGTGG 



GCATTCTTCA GCATTCTCAT GAGCTTAATA TACTTAAATT CTACTACTCA TTGGATTGCC 



360 



25 CCGAQGTIGG GGCACACGAG GAAATCGTAG TC0GG6AAGA CTCTACTGAA TAAGACATCA 420 



480 



GGGGATOTCC CTTTAAACAG ACTGCTGCCT TCAQCTAAAA ACTTAATGIT CTTTATACCT 540 



600 
660 



TTGTATOTAT GACCTACTTT TGTAACAGAC CATGGTTGTG TCCAAGGTAA AACCACAGTG 
ATATTTITOG ATOCTTTGTC TGCAATCTTG ACTTGTTTTT QCAGTATCAT TATTCAGACT 

35 TCAAATTOTG AATCTTTTAA ACATCTTGAT AATTTGTTGT TGAGAGCTGT TCATTCTAAA 720 

ATCTAATGAA ATTCAGTCTA GrTCTGCTGA TAAAGATCAT CAGTTTTGAA AGGTTACTGA 780 

TTrrCCTCTT CCCTCTTAGT TTTTTACCCA ATATATGGAG AAGAGTAATG GTCAATCTTA 840 

ACATTrreTT TTAATTCTTT AATAAAGCTG CTGGGCAGTG GTGCAGCATT CCTACCTAGT 900 

GTCATAAAAG CAAAATACTT ACATAGCTTT CITAAAATAT AGGAATGACA TTACATTTTT 960 

45 AGGAGAAACT AAQTTCCTTT GCACCGCCTA CTTAATTCTT TTCCATATAT TGTGATACAA 1020 

ACTTTTCAAT ATCGAATCTT ACTATTTGAA TAGAAATCTTG TATGTATAAT ATACATACAT 1080 

ACATAAGCAT ATATCTGTGT GTGTGTGTGT ATATATATAT ATATGCATGC TGTGAAACTT 1140 

GACTACACAA CATAAATCAC TTTTTAAATT CCAGGAACGG GTAGTCTGAC ACGGTGATTA 1200 

TCCTTITCAG GCTCAATCCG TTATTAACTT GTTATTTAGG TTTTTACTCC CAGTAGCAAG 1260 

55 GGATTCTAAG TTAGTTGCAC TTACATK^TT ATTGTTATTT AAAACTAAGA ATAAAGGCTG 1320 

CATTTTCAAA GATAAATTGG AATTGCTGTT GGTGAAATAA CAACCAAAAT ACTGAATCTG 1380 

ATCTACATAC AGGTTTCTAC AGGAAGAGAT GGTATAATTT ACAATTTGGA GATTTAATAA 1440 

60 



30 



35 



(2) INFOKMATION FOR SEQ ID NO: 277: 



W098/39448 PCr/US98«)441« 

504 

CCAGGGCTAC CCAC5AAAAAG TCRCTTCATA ACATGCjraCC AATAAGTAAG GGATCCTCTC 1500 
TCQOTITCCr TITOCCACrr TCAftGATrrr AACTTCTCAG GTTATTAATC AARMTATTG 1560 
5 TATAAGOTAG CCAATAGAAT TITTAGGTTA AAACftftCftGA TOGGGGOm GTGGAGIGTT 1620 
TAATOTCATC GCSCAnTITA CTAGCATAGA CCCTrPGTTC TGCATTTCAA TGnnTCGTAT 1680 

ATmrorrr cacagttaat cttccctccc caastttgct attcaaatca actocctgaa i740 

TCRCATTTCT ACTAffWrTOA TOTATTrrTC TCAC3GAATAG TTTGTGATTC CAATGCfiGGT 1800 

CTCTTCMTA CCATTACCTC TACACTOCAG AAGAAGCAAA ACTCCTTTAT TAGAATTACT 1860 

15 GCACATGIOT ATGCSGGAAAA TACTTCTCAA AC3QCTAGAAT GATACAAGTO AGCAAAMTT 1920 
GGTCAGCTK; GCTATOCSWyr gctqgcaata atctctaaac attccaaaag accatgsgct 

GAACCTAAAC TCCCTTOGAA TCTQAACAAA QGAATATAAA ATTGCCATTT GAAAACTGAC 

20 

CAGCTAATCT GGACCTCASA GATAGAOCAG CCAGTGQCCC AAAGCCATTT CAftGfTACftGA 2100 
AATTATAGAG ACTACAGCTA AATAAATTTG AACATPAAAT ATAATTTTAC CACTrTTTGT 2160 
25 CTTTATAAGC ATATITOrAA ACTCAGAACT GAGCAGAAGT GACTTTACTr TCTCAAGTTT 2220 
GATACTCACT TCACTOTTCC CTTATCCCTC ACCXTTOCCC TTCCCnTCC TAftOSCAATA 
GTCCACAACT TAGGTrATTT TTCCTTCCGA ATTTGAATGA AAAACTTAAT GCCATGGATT 

■ mrivm -T gcaagacacc tctttatcat cttgtttaaa tgtaaatgtc cccttatgct 

TTTGAAATAA ATTTCCTTTT GTAATTTTAA AAAAAA 



1980 
2040 



2280 
2340 
2400 
2436 



40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 782 base pairs 

(B) TYPE: nucleic acid 

(C) STI»MDBDNBSS: double 

(D) TOPOUXW: linear 

45 

(xi) SEQUENCE DESCRIPTICW: SEQ ID NO: 277: 
GCCACIGACT TCTCCCACCC TTCTGTCTCC CCCATAATAG TTTAnTOGT TGGTCTGGAC 
50 TCACTTOTOG CCTTrRATTA AATTCCTAA6 GQGCCTGAAG AAGACATTTC TACTQCAC3AG 
GCTTAGAGGC ACTTCAGCAA GGCCCCCACA TCCCAACTCT QGGAGTTGTG GTGGGAGGAG 
GCACTTCTGG GG6ATAGGAC CAGACAAGAT AACAQGAGCT CACATGGNAA GCAGAAGCTG 
TCACAAOTTT ACyTACTCCCA AAATCQGTTA TATCCCTTCC CCCTITACAT CAGAATCTTO 
TGAAAIGGGA AAACAACAGA AGGAGGGGAT CAAAGATAGC TGATCTCACA TGCTTCCCAG 360 
60 GCAGGGCARA GCrTOGGAGTC AAACCCGGGT GACAGGTGGG TGGAGAGCCC TGTTTGAGGT 420 



55 
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TOTGQCTCAT CCCTCTCTGG TATTAGinT TCCCCTC3GGA GCAGC3AAGCC CTAGGAAGAG 480 
GGGACTCCAG GC?rCCCCRCX5 C3GATCTTTCC TCCCTCCCCT GCATGAGGCA GAGGCAACCT 540 
GCCTCCCAAC (XCCTCCCTC AAGGAATGGC CTTGCCCAGG AATGCXXIACC ACACATACCC 

TCTTCTrrrr ttctagtcaa AcrcrrGTTT ATTccrrGGc ttgcctccct ccttcctccc 

10 CTCTCAACCT TTACTTCTGA TTTCTATTTC ATGGAATTTG GGATTGAAGT TAAACTACAA 720 
CAGTCCCGCC AACACCAAGT CTK3CAGGAA AAAAATACAA AGAAATTTAA CAAAAAAAAA 780 
AA 



15 



20 



30 



40 



50 



(2) INFORMATION FOR SEQ ID NO: 278: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 961 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
25 (D) TOPOLOGY: linear 



600 
660 



782 



300 

360 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 278: 

GAGTTCCGC3C TCSGftGACCCG TGCTCTGGGC CGGCGCCTTC ACCATQGCCT CGGCAGAGCT 60 

GGACTACACC ATCGAGATCC CGGATCAGCC CTGCTOGAGC CAGAAGAACA GCCCCAGCCC 120 

AGGTGGGAAG GAGGCAGAAA CTCQGCAGCC TGTC5C3TGATT CTYTTGGGCT GGC3GTGQCTG 180 

35 CAAGGACAAG AACCTTGCCA AGTACAGTGC CATCTACCAC AAAAGGGGCT GCATCCTTAAT 240 
CCGATACACA GCCCCGTGGC ACATGGTCTT CTTCTCCGAG TCACTGGGTA TCCCTTCACT 
ITOTCTTTTG GCCCAGAAGC TGCTCGAGCT QCTCTTTGAT TATGAGATTG AGAAGGAGCC 

CCTCCTCTTC CATGTCTTCA GCAACGGTGG CGTCATGCTG TACCGCTACG TGCTCGAGCT 420 

CCTCCAGACC CCTCGCTTCT GCCGCCTGCG TGTGGTGGGC ACCATCTTTG ACAGCGCTCC 480 

45 TCGTCACAGC AACCTGGTAG GGGCTCTGCG GGCCCTQGCA GCCATCCTGG AGCGCCGGGC 540 

CGCCATCCTG CGCCTOTTCC TGCTGGTGGC CTTTGCCCTG GTGGTCGTCC TGTTCCACGT 600 

CCTGCITGCT CCCATCACAG CCCTCTTCCA CACCCACTTC TATGACAGGC TACAQGACGC 660 

GGGCTCTCGC TGGCCCGAGC TCTACCTCTA YTCGAGGGCT GACGAAGTAG TCCTGGCCAG 720 

AGACATAGAA CGCATGGTCG AGGCACGCCT GGCACGCCGG GTCCTGGCGC GTTCTGTGGA 780 

55 TTTCGTCTCA TCTGCACACG TCAGCCACCT CCGTGACTAC CCTACTTACT ACACAAGCCT 840 
CTCTOTCGAC TTCATGCGCA ACTGCGTCCG CTGCTGAGGC CATTGCTCCA TCTCAMCTCT 
GCTCCAGAAA TAAATGCCTC ACAMCTCCCC ACAAAAAAAA AAAAAAAAAA ACTCGAGGGG 

60 



900 
960 
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(2) INFORMATION FOR SEQ ID NO: 279: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1228 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANIXI3NESS : double 

(D) TOPOLOGY: linear 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 279: 
CGCGCTTTGC AGTTCGOTCT CCTCSGTOTAC GGCCAACGCC AAGTAGGQGA TTGCGTTCCC 60 
TCCAC?rCGCA GCCCTA'TCAG ATTTGGATAT GTCCTTCATA TrTGATTGGA TTTACAGTGG 120 
20 TTICAGCAGT GTCCTACAGT TTTTAGGATT ATATAAGAAA ACTGGTAAAC TGGTATTTCT 180 
TC3GATTGGAT AATGCAGGAA AAACAACATT GCTACACATG CTAAAAGATG ACAGACTTGG 240 
ACAACATOTC CCAACATTAC ATCCCACTTC CGAAGAACTG ACCATTGCTG QCATGACGTT 300 

25 

TACAACnrr GATCTGGC?rc GACATOTTCA AGCTCGAAGA GTGTGGAAAA ACTACCTTCC 360 
TGCTATCAAT GQCATTCTAT TTCTGGTGGA TTGTGCAGAC CACGAAAQQC TOTTAGAGTC 420 
30 AAAAGAAGAA CTTCATTCAC TAATGACAGA TGAAACCATT GCTAATGTQC CTATACTGAT 480 
TCTTOQGAAT AAGATCGACA GACCTCAAGC CATCAGTGAA GAGAGGTPQC GAGAGATGTT 540 
TCGTTTATAT GOTCAGACAA CAQGAAAGGG GAGTATATCT CTGAAAGAAC TGAATGCCCG 

35 

ACCCTTAGAA GTTTrCATGT GTAGTGTGCT CAAAAGACAA GGTTACGGAG AAGGCTTCCG 
CTGGATGGCA CAGTACATTG ATTAACACAA ACTCACATTG GTTCCAGGTC TCAACGTTCA 720 
40 GGCTTACTCA GAGATTTGAT TGCTCAACAT GCATAACTTG AATTCAATAG ACTTTTGCTG 780 
GTTATAAAAC AGATCTTTTT TAGATTATTA ATATTAAATC AACTTAATTT GAATGAGAAT 
TCAAAACTCA TTCAAGTAAG TTTGAGTATC ACAATGTTAG CirrCTAATT CCATAAAAGT 

45 

ACTTOCTTTT TACAOTTTAT AATCTGACAT CACCCCAGCG CCATTTGTAA AGAGCAACTT 
TCCAGCAGTA CATTTGAAGC ACTTTTTAAC AACATGAAAC TATAAACCAT ATTTAAAAGC 
50 TCATCATOTT AAATTTTrTA TOTACTTTTC TGGAACTAGT TTTTAAATTT TAGATTATAT 1080 
GTCCACCTAT CKTAAGTCTA CAGTTAATAA TTAGCTTATT CAAIGATTGC ATGATGCCTT 1140 
ACAGTTTrCA ATAACTTTTT TTCTTAOGCA AACGTCATGC AATAAAACAA ACTCTAATGT 1200 
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(2) INFORMATION FOR SEQ ID NO: 280: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1327 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEZ3NESS: double 

(D) TOPOLOGY: linear 



10 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 280: 
TCTCGGGTCT CGGGACAGGT GAGCACCCTG ATGAAGGCCA CGGTCCTGAT GCQGCACCTG 60 
GGCGGGTCCA GGAGATCGTG GGCGCCCTCC GCAAGGGCGS CGGAGACCGG TTACAGC3TGA 120 
15 TTTCTCATTT TRACATGACC TTGAGCAGGT TTGCATATAA TGGAAAGCGA TGCCCTTCTT 180 
CTTACAATAT TCTGGATAAT AGCAAGATCA TCAGTGAGGA GTGTCGGAAA GAGCTCACAG 240 
CGCTCCTTCA CCACTATTAC CCAATTGAGA TCGACCCACA CCGGACCGTC AAGGAGAAGC 300 
TACCTCATAT GGTOGAATGG TQGACCAAAG CGCACAATCT CCTATC?rCAG CAGAAGATTC 360 
AGAAGnrCA GATAGCCCAG GTGGTTAGAG AGTCCAATGC AATGCTCAGG GAGQGATATA 420 
25 AGACCTTCTT CAACACACTC TACCATAACA ACATTCCCCT TTTCATCTTT TCTGCGQGCA 480 
TTCGTCATAT CCTCGAAGAA ATTATCCGAC AGATGAAAGT GTTCCACCCC AACATCCACA 540 
TCGTC3TCTAA CTACATGGAT TTTAATSAAG ATGGTTTTCT CCAGGGATTT AAGQGCCAGC 
TGATACACAC ATACAACAAG AACAGCTCTG TGTGTGAGAA CTGTC3GTTAC TTCCAGCAAC 
TTCAGGGCAA AACCAATCTC ATCCTGCTGG GAGACTCTAT CGGGGACCTC ACCATGGCCG 720 
35 ATCGGGTTCC TGGOCTGCAG AACATTCTCA AAATTGGCTT CCTGAATGAC AAGGTGGAGG 780 
AQCGGCGGGA NCGCTAACAT GGACTCCTAT GACATCGTGC TGGAGAAGGA CGAGACTCTG 
GATCTGOTCA ACGQGCTACT QCAGCACATC CTGTGCCNAG GGGGTCCAGC TGGAGATGCA 
AGGOCCCTOA AGGCGCAGGC TCCNAAGKCC SCTGCAGGCC GTGC3TGAGGA QGGGCGCCTC 



30 



40 



50 



600 
660 



840 
900 
960 



CCCAGACJrCT GCTCCCCCGT GAACACAGAG CAGAGCCAGG GTGGCCAQCA GTGGCTGGGT 1020 



45 CCTTCCQCGC CCCTCCGTCC TCCTTTCCCT GAGCACCTTC ATCACCAGAG GCTTGAAGGA 1080 

ACCCCGCCAT GTOQCAGGGC ACAGGCACTG TTCCTGGTGA ACCTTGGACC ACAQCATGTC 1140 

AGTGCTCTAG GGATTOTCTA CTCCAGGGAT TTTCTTCAAA ATTTTTAAAC ATGGGAACTT 1200 

CAAACAAATA TAATCTGTGA AACAGATCAA AATTTTTAAA ATGAAAAAAA AGCTGCTCTG 1260 

ATTCAGGGGA TGTGGGTCGG GGTAGAACCT QGACCTCTTG GCCTGGGGGC ACATGGGATG 1320 

55 CTTCTAG ^'^^'^ 



60 (2) INFORMATION FOR SEQ ID NO: 281: 



wo 98/39448 
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(i) sEQUE^x:n3 characteristics: 

(A) LENGTH: 799 base pairs 

(B) TYPE: nucleic acid 

5 (C) STOANDEDNESS: double 

(D) TOPOLCXY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 281: 

10 TCACCCTGCC TACAC3CGIX3G AGCTCAGATG ACTGCGCCCT CCACGGTCAC TGTGAGCAGG 60 

TOOTATTCAC AGCCTGCATC ACCCTCACGG CCAGCCCTGG GGTGTTCCCC GTCACTC3TGT 120 

GGcrrroGCT gaagcctaat tccacagctc cnmrm T gagagagact gagagaacca i80 

TAATCCTTGC CTCCTGAACC CAGCCTGGGC CTGGATGCTC TGTGAATACA TTATCTTCCG 240 

ATCTITCGCyrr ATTCCAGCCA AAGACATTTC AAGTGCCTGT AACTGATTTG TACATATTTA 300 

20 TAAAAA-TCTA TTCAGAAATT GGTCCAATAA TGCACGTGCT TTGCCCTGGG TACAGCCAGA 360 
GCCCTTCAAC CCCflCCTTOG ACTTGAGGAC CTACCTGATG GGACGTTTCC ACGTGTCTCT 
AGAGAAGGAT TCCTGGATCT AGCTGGTCAC GACGATCrTTT TCACCAAGGT CACAGGAGCA 

TOGCGrrCGCT GATOGGGITG AACTTTGGTr TGGnTCTTGT TTCAGCCCAA TATGTTAGAGA 540 

ACATTTGAAA CAGTCTGCAC CTTTCATACG GTATTGCATT TCCAAAQCCA CCAATCCATT 600 

30 TTCTOGATTT TATCTOTCTC TGGCTTAATA ATCATAGTAA CAACAATAAT ACCTTITrCT 660 

CCATTTTCCT TCCAGGAAAC ATACCTTAAG TmTTITCT TITOITrrTG TITmTGTr 720 

TTlTOITrrC CTTTATCAAG AAAAAATAAA ATAGTCACAT TTTTAATACY AAAAAATGGA 780 
CAAAAAAAGT CGAGGGGGG 
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(2) INFORMATION FOR SBQ ID NO: 282: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2196 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



50 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 282: 

AAAGACTCTA ACATCCATGA GCITCAACAT GAGCAAGAGC CTACTTGTGC CKSCCAGATG 60 

GCIGAGCCCT TCCGTACCTT CCGAGATQGA TGGC3TCTCCT ACTACAACCA GCCTGTGTTT 120 

55 CTGGCTGGCA TGGCyrCTTCC TTTCCTTTAT ATGACTGTCC TGGGCTTTGA CTGCATCACC 180 

ACAGGOTACG CCTACACTCA GGGACTGAGT GGTTCCATCC TCAGTATTTT GATGGGAGCA 240 

TCAGCTATAA CIGGAATAAT GGGAACTCTA QCTTTTACTT OGCTACGTCG AAAATGTGGT 300 

60 



509 



•nOGTTCGGA CAQCTCTCAT CTCAGGATTG GCACAGCTTT CCIGITTGJiT CTTGTGTGTG 
ATCTCTGTAT TCATGCCTGG AAGCCCXXTG GACTTC5TCCG TTTCTCCTrr TGAAGATATC 
CX3ATCAAGGT TCATTCAAGG AGAGTCAATT ACAOCTACCA AGATACCTGA AATTACAACT 
GAAATATACA TOTCTAATGG GTCTAATTCT GCTAATATTG TCCCGGAGAC AAGTCCTGAA 
TCTGTCCCCA TAATCTCTCT CAGTCTCCTG ITPQCAGGCG TCATTGCTGC TAGAAT0GC3T 
CnTGGTCCT TTCATTTAAC TGTGACACAG TTGCTGCAAG AAAATGTAAT TGAATCTGAA 
AGAGGCATTA TAAATGGTGT ACAGAACTCC ATGAACTATC TTCTTCATCT TCTQCATTTC 
ATCATOSTCA TCCTCGCTCC AAATCCTGAA GCTTTTGGCT TGCTCGTATT GATTTCAGTC 
TCCTTTGTGG CAAIXSGGCCA CATTATGTAT TTCCGATTTG CCCAAAATAC TCTGGGAAAC 
AAGCTCTTTG CTTCCXSGTCC TGATGCAAAA GAAGTTAGGA AGGAAAATCA AGCAAATACA 
TCrGTTOTTT GAGACAGTTT AACTGTTGCT ATCCTGTTAC TAGATTATAT AGAGCACATG 
TGCTTATTTT GTACTGCAGA ATTCCAATAA ATGGCTGGGT GTrrTGCTCT OmTTACCA 
CAGCTGTCCC TTCAGAACTA AAAGCTGTTT AGGAAACXTTA AGTCAGCAGA AATTAACTGA 
TTAATTTCCC TTATGTTCAG GCATGQAAAA AAAATTGGAA AAGAAAAACT CAGTTTAAAT 
ACGGAGACTA TAAOXSATAAC ACTGAATTCC CCTATTTCTC ATQAGTAGAT ACAATCTTAC 
GTAAAAGAGT GOTTAGTCAC GTGAATTCAG TTATCATTPG ACAGATTCTT ATCTGTACTA 
GAATTCAGAT ATOTCAGTTT TCTGCAAAAC TCACTCTICT TCAAGACTAG CTAATTTATT 
TTTTTTCCATC ITAGTTATTT TTAAAAACAA ATTCTTCAAG TATGAAGACT AAATTTTGAT 
AACTAATATT ATCCTTATTG ATCCTATTGA TCTTAAGGTA TTTACATGTA TGTGGAAAAA 
CAAAACACTT AACTAGAATT CTCTAATAAG GTTTATGGTT TAGCTTAAAG AGCACCTTTG 
TATTTTTATT ATCAGATCQG GCAACATATT GTATGAAGCA TATGTAQCAC TTCACAGCAT 
GOTTATCATG TAAGCTGCAG GTAGAAGCAA AGCTGTAAAG TAGATTTATC ACACAATGAC 
TGCATACAGA CTTCAAATAT GTCAATAGTT TGGTCATAGA ACCTAGAAGC CAAAAGCCAC 
ACAGAAGGGC AAGAATCCCA ATTTAACTCA TGTTATCATC ATTAGTGATC TGTGrTTGTAG 
AACATGAGGG TOTAAGCCTT CAGCCTCGCA AGTTACATGT AGAAAGCCCA CACITOTGAA 
GOTTTGTTT TACAAATCAC TTGATTTAAC ACACTCAGGT AGAATATTTT TATTTTTACT 
GnriATACC CAGAAGTTAT TTCTACATTG TTCTACAGCA AGAATATTCA TAAAAGTATC 
CCTTTCAAAT GCCTTTGAGA AGAATAGAAG AAAAAAAGTT TGTATATATT TTAAAAAATT 
GTTTTAAAAG TCAGTTTCCA ACATGTCTGT ACCAAGATGG TACTTTGCCT TAACCGTTTA 
TATGCACirr CATCGAGACT GCAATACGTT GCTATGAGCA CTTTCTTTAT CCTTGGAGTT 
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510 

TAATCCTTTG CTTCATCTTT CTACAGTATG ACATAATGAT TTGCTATGTT GTAAAATCTT 2160 

TOTAAAAAAT TTCTATATAA AATATTTGAA ACTTAA 2196 

5 

(2) INFORMATION FOR SEQ ID NO: 283: 

10 <i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1185 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 283: 

GCAGTTAAGG CTTCTGATAA GGAAAGAGAG TCTGAACAGA GCACACACAT CTGGAGCTCC 60 

20 AGGAGTGGGG GATGCAGCAT CAGATTCCAT CTTGAATTTC TGCTAAAATA CTTTGTACTC 120 

ATAATQGATC TCAACAAAGA TCTGTATTTC ATCTGTGGCT CCATCTTCCC TCTGGGTCAA 180 

GTAGATCTTA AGCTGGACCT TQGCACQCCT CTTAACATGA AGAGATCTAG CTAGACAGAC 240 

AGACTCCCCC ATTTATGQGA ACAAGAATTC AATTTATTCT CTATTTATAA AACATTTTTT 300 

TAAAQTGCCT TGQQTATAAA AATCTAAATG TCTGCGGTGT GATCAGTCAG GAGCACGTAA 360 

30 CTATCACTCT TCGCATCCTT 'roGTCACrGG GAGATCCTTT GGGGGCTGGG AGGTCCTTCT 420 

GTCCCAGGCT AAAGGAAAAG CTTCACAAGG GTAAGAGCCA CAGAACCCTC GGCAAGAAAG 480 

GCCGGTCAGG GAGAATGAAT GGTACAGAGA GGAAAGGAAG GAAAQGGGGT GGftACAGAGG 540 

TAGAAGQCAA GGAAGGGATC CCGCACTGGA GACCGATGGG GACACTCTAA TTGTGCAAGA 600 

GGGAGGATCT TCCTTCTTCA ATGCTGAACA CAGCTAGTCT GAACCTTCCT TGGAAAGTCC 660 

40 AGCTGTTTGC CCATGCATAG GGCCAACTCT CCCTQCAAAG CAGCAAATGT GGCTTCTATC 720 

AGGAAGGAAA AGTATCCATC AGTGTGACAA GAGGTCACCT TCGAACTTGC ATGAACTCCT 780 

TOCGCAQCCA CAAAGAGTCC TCGTAGAAGT GAGGATCGCC TAGTCTTACG GCTGTCCGTT 840 

TATAGAAGTA GCAGTACAAC ACTGCTQCTA GTCTCTQGAA TACAAACAGC ATTTGAAGTC 900 

CATCTCTCCA TATCAAGCTG TTQGAGTTTT TCCAGCGTAA GTTCATGACC CAGACATGAA 960 

50 GGGAGATGCT GAGQGCAAAG TACACAGCTG TCAGGATGAT QGTCCCTrTG AACTTATQGA 1020 

ATAGGAGGTT GACCAQGCCA GCCTGGAAGA CGAAGGTGTT GAAGAACATG AGGAAAATGA 1080 

IGATGATGTT GAAGAGGACT GCAATATCCT GGATGCACTG AGQGAGAGGY TTCTAGTTCC 1140 

TTTGAATCAG AGCTGTTTCC CTTGCTCTAA GGCAAGCACC TCCAA 1185 
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(2) INFORMATIC»I FOR SEQ ID NO: 284: 

(1) SEQUETSCE CHARACTERISTICS: 

(A) LENGTH: 1634 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDNESS: doilble 

(D) T0P0LCX3Y: linear 

(xi) SEQUENCE DESCRIPTICN: SEQ ID NO: 284: 

AGGGAAAGGG GAGGGTAGCG GGAGQGTAGC AGGTGAGTTC CTAa3GCTGG AAGGTTTAGC 60 

AGCAGCCTGG TGCAGTGCCC TC3TCATCAAG ACAAACCCAC GGTCCTMCTG GGTGCCTACC 120 

AAGCTTGGTT TGTACAAAAG CAAGGTGGGA GTCTATTTTT GTACATGAGA TACATCACAC 180 

TTACCTGTGG GCCAGTATTG TGAAGTGAGT CTGAGTTGTT TACACTGATG CCTTCCCTGC 240 

CCACCACAAA TTGTGTACAT AGTCTTCAGA TGATACCACC CCTTTCCCCA GCTCCCAACC 300 

AAGAGCTGGT TCTAGGCCTG TGTTATATGT CATATTTAGC STTTTTATAT ATGACCTTTG 360 

ATTTCTOTTC TTTGTATTTT AGCACAGTGT ATGCACCTTC ATTTAAATAC ATCTGTGTGC 420 

ATACAGATAC GCATATATGT GTGTGCGTAT GCATATATCT CTCATCTGTA GTTTCCAAGA 480 

GTTCAGCTGA AGCAGATGGA GTCCTOCAGC CCAGGAGACA OCCTGCATCC CTGCTAATAG 540 

TOTTTCCCAC AAGTATTAGT GAGTCTTCCT TATTAATATT TTCATTTCAG AAGACTGAAG 600 

CAAAGCTCAT AGTGTTTGCT GTTTCTTTGG CAGCTAAGTG AGGGTCTTG6 GATGACTTGC 660 

TGTCTTCCTC AAGCTGCACT TTGGGGCCAT CTCTGCASTA TTAQCCCCCT TTTTGCTTGG 720 

TGGTACTCTC TCTGTGCCTG TGTGTGTGTG TGATAGTCAC TCTTGCATC3G CTTCCATGTC 780 

TGGTTTOTGG CATTTGGGGA TAAGGTGCTG AAGCCAGAGC ATTTGCAGTT TGTTTGAGGC 840 

CTCGTrCCCA A1X3ATAGATC ACTCCTGTTG ACCTGGTATG TCTGCTTCCT TGCTGCTTTT 900 

CCTTGCTTTC TCTTGGAAGA GGAAAGGACT CTGGTCAGGC CCAGGCTGAG TGAGATGAGC 960 

TCCAGCTOGC TCATOGCCTT CTTAGAGCAG AGAGAGGAGT ATGTCATTTT ACTAAGTTCC 1020 

TAAACAAACA TTTATGCAGG CAACACTCCT TGCAGATCCA GAAACTGAGG CACAATAGQG 1080 

TTATGACITC CTCAAGAATA TGTAGCTGCT AGGGGGTAAA TCAAGGCATC ACAATTTCTG 1140 

TTCAGCQGGC AQGAATAGGC TGTGAATTGC TAGCACTTTT TTTTTTTAAG CAATTACTTT 1200 

TTGACTTCTT CCTCTGAAAG TGCAAGAGGC GTACACCTTT CCCAAATGTA GACTAGAATC 1260 

TGCAGGATGC CACCCACTGT ATAGTTCTGC TTTCCCAGAG AGGAAGAACT TTTAGAAACC 1320 

AAATGATCTT AATTGTTATT GCCCACCCCT GGCTTTTCCG GGTAGAAAAT TCACAGTAGG 1380 

AATGATTGTT AAGAGAGAGT GCTTGGAACC ATGGGTTAAC AGGAAAGGCT ACCTAACTTC 1440 

ACATATCrcC AACCAGAGCA GCCACCAAGC ATTACTTAGC AGCAGGAAAA TGA1TGTATT 1500 
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CTOaacaCOG TCGCK:ACAC CTOTAA'TCCC WSCAYnrGG CWAGQCCSAG QQCQQGGOGG 1620 

1634 

5 KTTCACXX3C3G GGTC 



30 



10 12) INFORMATKSJ FOR SEQ ID NO: 285: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LaJGTH: 1795 base pairs 

(B) TYPE: nucleic acid 
J5 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 285: 
20 TKCCCCCAG CHTOBCTTCC rrCGATTC^ 

caaccccck: nrcrcAACc tcaataatca GcrcAcrcyre gaggagcagc t<x3ggcacag 120 

cnx:«XGrYAT«^TnOTTACCCCCCAAGACC«:^ 

«a«:CCICAGCCCAfiATCCTGCAGCAGCTTffl«n«^ ^40 
TrAGAGAAGCA(«CAT«5ATCC0C«GQGACCTQG«^ 

cccn«GACA riTAOGAcer octgaitok: cromccrc gccatatccc gcggcaccgc 360 

«»CTnnOG ACICGCa«5Y TOCCiccm 

TCATCccicAT«x:ccnxxrrG«nw.TGATan«^ 
a«,Gccmo GGGA^ ccmrcrrc 

OICC^CIX^AGCCCACCAGCTTCCAGCCGCAGCCCnCAAG^^ ^00 

SMGACriTC CWIGCKGG imAGGCCTC GTGCAGACTC ..3GAGGCCCG AAGTOAGAGG 720 
TOGACTOTOT GATCCCADCT CTOGAGCAAG CICTAGACGG ACAGCAGGAC ArKSGACCTC 780 
TAGAGCAAGA -rorCACTAGG A-^ACCKXIA CCCTCCn^G ACATGAA^ TCCATGGAGG 840 
GCCTGCriGGC TCAACAIGCT GAATOVTOIC CAACAAAACC CAGCCCCAAC TrrCTCTCT.5 
50 AOGCICCAOC ATIGGGGCAG GGGCA-TOGTO GCCCATCTAG TCTOCTOGGC CTCACCATCC 
CAGAAGAGGA G-IXX^AGCCA GCrCAGAGAA GGAACT^ 

TTAGCCCreG GCCTOGACCT COCTGOSATr TCCCACTCCT TTCTTAGTCT TCTICCAGAA 
55 _w-»>»iw. rrrp-r-TWTW CAGGACCTGT 1140 



40 



ACAGAGAAGG GGATOKSreC CTOGGAGAGG CTCTGTCrCC TTCCTGCTGC CAGGACCTGT 
GCCTAGACTT AGCATCCCCT TCACTCCAGT GTCAGGCCIT TAGATCGGAC CCAGCGAAAA 

60 T^^ccrr cigac^icaca tcaccgacac igagcagtog aaaggggcta ta-hjigtatg 
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AATAGACCAC ATTCAAGGAG CACAATGCCC TCCTGTGTTG ATGCCACTTC CCAGGGTGGA 1320 

GACAGTGGAA AAGAACCGAG GACAGGAAAG GATTGGGTAG GTGAAGQGGT CAGGGGACTG 1380 

^ GTAGTCACCC AATCTTGGAG AGGTGCAAAA AGCACTGGGG GCTACCCGTT AGCTGCATCT 1440 

GCCCTCGCTC ITTGCCCGTr CATGTCACAA ACTGCCACTA CTATGTACCT GCAGTGGGGT 1500 

10 TGCAGAGATC GGGGAGACTC AAGTCTTACT CCCCftGGAGC TCCCAGGGCC CAAGGAGGAG 1560 

AATGCTQCCT CCTTTCAGTC TGGTCTACAC CXrACTTTCTG GTAGCCTCTC TGCTTCCTGT 1620 

AATTCTGGCT OimTCCAG ACTCAGCTCA AATAGTGCCC CTCCTTAAGC CCATCCCTCG 1680 

CCCCCAGCCT GAGOIGATCT TTCCCTCCTC TGAACTATTA GAGCAGTTAC TGTCTGTTCA 1740 

CTTCCnrrGG CAGGCACACA CAGTGGCATA AAITCTATTG TTTTGAACTC TGATT 1795 
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(2) INFORMATION FOR SEQ ID NO: 286: 



25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 858 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTICM^: SEQ ID NO: 286: 
TCTGCTTTCG GTCCTGCGTC TACTGCTGGG CGGCTTCTTC GCGCTCGTGG GGTTOGCCAA 60 
35 GCTCTCGGAG GAGATCTCGG CTCCAGTTTC GGAGCGGATG AATGCCCTGT TCGTGCAGTT 120 
TGCTGAGGTG TTCCCGCTCA AGGTATTTGG CTACCAGCCA GATCCCCTGA ACTACCAAAT 180 
AGCTGTOQGC TTTCTGGAAC TGCTGGCTGG GrTGCTGCTG GTCATGGGCC CACCGATGCT 240 
GCAAGAGATC AGTAACTrGT TCTTGATTCT GCTCATGATG GGGGCTATCT TCACCTTGGC 300 
AGCTCTGAAA GAGTCACTAA GCACCTGTAT CCCAGCCATT GTCTGCCTOG GGTTCCTGCT 360 
45 GCTGCTCAAT GTCGGCCAGC TCTTAGCCCA GACTAAGAAG GTGGTCAGAC CCACTAGGAA 420 
GAAGACTCTA AGTACATTCA AGGAATCCTG GAAGTAGAGC ATCTCTGTCT CTTTATGCCA 480 
•K3CAGCTOTC ACAGCAGGAA CATOGTAGAA CACAGAOrCT ATCATCTTGT TACCAGTATA 540 
ATATCCAGGG TCAGCCAGTG TTGAAAGAGA CATTTTGTCT ACCTGGCACT GCTTTCTCTT 600 
riTAGCrrTA ctactctttt gtgaggagta CATGTTATGC ATATTAACAT TCCTCATGTC 660 
55 ATATGAAAAT ACAAAATAAG CAGAAAAGAA ATTTAAATCA ACCAAAATTC TGATGCCCCA 720 
AATAACCACT TTTAATCCCT TGGTGTAAGT ATACCTCTGA ACTTTTTTCT GTGCCITTAA 
ACAGATATAT ATTTTTTTTT AATGAAAATA AAACCATATA TCCTATTTTA TTTCCTCCTT 

60 
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TTAAAACCTT ATAAACTA 
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(2) INFORMATION FOR SEQ ID NO: 287: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 915 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: dotible 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 287: 
GAATTCGGCA CGAOCGCGGC CATCGCGCTC CTGCTTTCGG TGCTGCGTC3T ACTGCTGGGC 60 

GGCTTCTTCG CGCTCGTCGG GTTGGCCAAG CTCTCGGAGG AGATCTCGGC TCCAGTTTCG 120 

20 GAGCGGATGA A'TCCCCTOTT CGTGCAGTTT GCTGAGGTGT TCCCGCTGAA GGTATTTGGC ISO 

TACCAGCCAG ATCCCCTGAA CTACCAAATA GCTGTGGGCT TTCTGGAACT GCTGGCTGGG 240 

TTGCTCCTGG TCATQGGCCC ACCGATGCTG CAAGAGATCA GTAACTTGTr CTTGATTCTG 300 

CTCATGATGG GGGCTATCTT CACCTTGGCA GCTCTCSAAAG AGTCACTAAG CACCTGTATC 360 

CCAGCCATTG TCTCCCTGQG GTTCCTGCTG CTGCTGAATG TCGC3CCAGCT CTTAGCCCAG 420 

30 ACTAAGAAGG TGGTCAGACC CACTAGGAAG AAGACTCTAA GTACATTCAA GGAATCCTGG 480 

AAGTAGAGCA TCTCTGTCTC TTTATGCCAT GCAGCTGTCA CAGCAGGAAC ATGGTAGAAC 540 

ACAGAGTCTA TCATCTTGTT ACCAGTATAA TATCCAGGGT CAGCCAGTGT TGAAAGAGAC 600 

35 

ATTTTCTCTA CCTOGCACTG CTTTCTCTTT TTAGCTTTAC TACTCTTTTG TGAGGAGTAC 660 

ATOTTATCCA TATTAACATT CCTCATGTCA TATGAAAATA CAAAATAAGC AGAAAAGAAA 720 

40 TTTAAATCAA CCAAAATTCT GATGCCCCAA ATAACCACTT TTAATGCCTT GGTGTAAGTA 780 
TACCTCTCAA CTTTTTTCTG TGCCTTTAAA CAGATATATA TTTTTTTTWA ATGAAAATAA 840 
AACCATATAT CCTATTTTAT TTCCTCCTTT TAAAACCTTA TAAACTATAA MAAAAAAAAA 900 
AAAAAAAAAA CTCGA 



45 



50 

(2) INFORMATION FOR SEQ ID NO: 288: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LH«3TH: 1517 base pairs 
55 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 288: 
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CCrTGTGGCA ACTAGTOGGT CCCCCGGGCT GCAGNAATTC GGGCAGTGGT TCTOJGTCTG 
AAGATACTCT GAGTTCCTCT GAGAGATCCA AAGGCTCCGG GAGCAGACCC CCAACCCCCA 
AAAGCAGCCC TCAGAAGACC AGGAAGAGCC CTCAGGTCSAC CAGGGGTAGC CX^TCAGAAGA 
CXAGCTOTAG COCTCAGAAG ACCAGGCAGA GCCXTICAGAC GCTGAAGCGG AGCCGAGTGA 
CCACCTCACT TCAAGCTTTG CCCAlCAGGAC AGTOCTTGACA GACAAGAGTG GGCX5ACAGTC 
GAAGCTCAAG TCCTTCCAGA CCAGGGACAA CXAGGGCATT CTCTATGAAG CTGCACCCAC 
CTCCACCCTC ACCTGTGACT CAGGACCACA GAAGCAAAAG TTCTCACTCA AACTGGATGC 
CAAGGAIGGG CGCTTGTTCA ATGAGCAGAA CTTCTTCCAG CGGGCCGCCA AGCCTCTGCA 
AGTCAACAAG TGGAAGAAGC TGTACTCGAC CCCACTGCTG GCCATCCCTA CCTGCATGGG 
TTrCGC?roTT CACCAGGACA AATACAGGTT CTTGGTGTTA CCCAGCCTGG GGAGGAGCCT 
TCAGTCGGCC CTO6ATGTCA GCCCAAAGCA TGTGCTGTGC AGAGAGGTCT GTGCTGCAGG 
TQGCCIGCCG GCTCCTGGAT GCXXTGGAGT TCCTCCATGA GAATGAGTAT GTTCATGGAA 
ATOIGACAGC TGAAAATATC TITOrGGATC CAGAGGACCA GAGTCAGQTG ACTTTGQCAG 
GCTATOGCTT CGCNTTCCGC TATTGCCCAA GTQGCAAACA CGTGGCCTAC GTQGAAGGCA 
GCAGGAGCCY TCAOGAQGQG GACCTTGAGT TTCATTAGCA TGGACCTGCA CAAGGGATQC 
GGGCCCTCCX: QCCGCRGYGA CCTCCAGAGC CTGGGyTAMT GCATGCTGAA GTGGYTCTAM 
GQGTTrCTGC CATCGACAAA TTGCCTTCCA AMAMTGAGGA CATCATGAAG CAAAAACAGA 
AGTTGCCTTG GGATTCATTT TAATGTAAGC TKGACTTTGT CATGCCAGAA ACAAGGCTCG 
GTCACCCjrCA GCAGTTTGCA GTTTTCCACC TCCWCCCAGT TCCTCCGTGT GGTTGACCCA 
GATATCTCCG TTAIGCAGCC GCCTCCGGGG GACCACCTCC CTCCCTTTGA GTCAGCCACA 
GACAGtXTAC TTCACGGCCC CQCTGGCCCC CACATTCCAC TGAACTGTGC GGATGCCACA 
GTCACXXCCT CTCAGGCACA GCATCACCTC CTGAAGTCGA GCCTGCTTGC TTTGAACCTA 
CCAGTTAAAA TCTCCICAAA ATGTTTGGAT ACCGCCCATT GQCCCCTCAC AGCCACGAGC 
TCCCTCACCA GIGTCCXTTGr GTOK3TGTGT GTGTGTCTGT GTGTGTQCTT GGGACGGGTG 
QQGAGGTCAC CTTTOQCnXST GCGGTCTGCC CCCAGGACCT GTAAGTAATA AAATCTTTAT 
TTCCAAAAAA AAAAAAA 
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{2) INFORMATION FOR SEQ ID NO: 289: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3865 base pairs 
60 (B) TYPE: nucleic acid 
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(C) STRANIMWESS: double 
<D) TOPOIiOGV: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 289: 
TCGRtSGGGGG GAGCTTCCrr GAGCAGKCK CCCMGOCTG GCCCTCCACA CrrCATTCTC 
TOACCmCr CTCICCTCAT TTCOGTCCAT GICCmCTG CflGCTGCCTT TCAGCACftGG 
TOCTTCCACT GGGGC3CMCT AftCCJCTCAGT GftCAAOGATG GCSAflGCCACA GCSTGCATnT 
j^CTCMBtCt TCtCrAGICA ATGftfiGQGCA CCCAGTOCTT CTAG0C3CAGG CTGC5SK»IO 
GTCCCCTAGG TATCKSCCTC ICTTACTCTA CTCTCCGCSGA ATCTTAftCCT TTCIATITTC 
AGCCTCTCCC ACCroiCTAG GCAAGCTC3GC ■PTCOCCATTG GCCCCTOTCG GTCCACfiGCA 
GCXJTOGCrSC CCCCCAQGGC CACCGCTTCT TICrrGATCC TCTTTCCTTA ACAGTCACTT 
C3C3GCrrGA£?r CTCCSCAflGGA ACCTTOCTTT TAGCTTCACC ACCAAGGAGA GAGGTTGACA 
TGACCTCCCC GCCCCCTCAC CAAGGCTCGG AACAGAGGGG ATGTGGTGAG AGCCAOSTTC 
CiCTOGCCCT CTCCAGG(?IG nTrcCACTA GTCACTACTG TCrrCTCCTT GTAGCTAATC 
AATCAATATT CTTCCCTTQC CTSTOGGCAG TOQGAGRGTG CTCCTQQGTG TACGCTOCAC 
CnkoCACTG AOTK3GQC5AA AGBGGKTAAT CAGTGAGCAC TCTTCTGCTC AGAGCTCCTG 
ATCTACCOCA CCCCCTAGGA TCCAGGACTG GCyiCAAAGCT GCATGAAAOC AOCSOCCTGGC 
AGCAACCCro GC3AATOQCTC GAGOTOQGAG AGAACCTGAC TTCTCrPTOC CTCTCCCTCC 
TCCAACATTA CTOGAACTCT ATCCTOTTAG GATCTTCTGA GCTTGOTrCC CTGCTGQGTC 
GGACAGAGGA CAAAGGAGAA GQGAGC3GTCT AGAAGAGGCA GCCCrrOTr OICCTCTOQG 
GTAAATGAGC rTCACCTAGA GTAAATOGAG AGACCAAAAG CCTCTCATTT TrAMTICCA 
TAAAATCTTA GAACTATATA TATACATATA TATATTTCTT TAAATPmO AGflCTTTOAT 
ATtaCTAAAA ATCCATTCCC TCTCCCCTGA AGCCTGAGTG AGACACATGA AGAAAACI^ 

CTrrCATTTA AAGATOTTAA TTAAATCA-PT GAAACTTGGC TGTQGCTACT GCTTCTTAAT 
CTIGQGGGGA CAGGGCAfflG GICTOGGOCC ACATTTAGAA GQGAAAATGT TITCCCTGCT 
GCACACATIG GACCCAACTA TCGGCCKTrr CTGCCTAGTA CTOCCAAAGG GACTGTTAAG 

GKjrcirorc catcttctac cccccacccc ccattacggg taaaggraac cccagactag 

<yiGAGGC3GCC AGCAGCTCCC TCACATTCTG TTCTCTCCTC AGATGGTCCA GCTCACATCC 
MftCACCTTG TTCAGACATT TTATTTCAAT TTATGACAGT GA-raGGGATT TGACTGAGAT 
GCCTTATOGA GAAGTACCCC ACCCTCTATG AAGACAGAAT CACTCTCTGC CATTCATTCT 
GCCTCA-TCCT AACAACACGC AGCTCATTTA GGGAGTGTCC CAGCCTAGCT GGATCAAGOG 
60 AAATICCAGG AGCCCTCGGG CAGGCCCTCG NCCOCAGTGC CAAGCCICAG AGTAAGCAGA 
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CATTOCXSAAA CTTCCCAACC ACITOCTAGA CCACTAGGTT CTCTC?ITrTC CXTTTCOCITr 

ccrrrrcAAA tcccacagtt tcltct t ggg gagaagctgt aattagccta gptccaggtac 

CAGAICCCAG CTAGGGGCGC AGCTCNCITC GATAACTCCA AGAAAACCTG GQCACCAGTA 
TrmCCAAT TATAAGGACT GTOGCATAAA TrTTTAAATG AGTTATATIG AAACCAGATT 
TCTCCAGCTC CCAAGGGAAG AAGGTAGGGC TOGACTCCXTT GCTGTGGCCC AGCCCITOTT 
AGGGOTTGGT CICTCACIGC AGCCAGACAG GATGATCCTG GGTTCTGGGG AGGGTAAGCT 
GCCCCPTGCC GAGTTCTCCA CCGAATAAAG AGTCCAAACC CGCTGCTTCC GTGTCCTGAG 
AGATOGCTAA ATCGCSTGATC GATGGAGCAG ACTC5AAGAGA CAGCAGATGA CTCAGTGC3TG 
GAAGAAGGGG GGAAGATGCT GGGCTGGCTA GCTAATGTTC CCCCCTTTCA GCGATTTACA 
GGAAATOSAG CCX^GCITCG TCATGAAGTT GGTTTGCTTC CACTGTGCXSA TGCACTCCTC 
AGAAATTITC AAGTCAGCCT GCAACTTCTC GAAGACTITC TTCTTGGGCT TGAGCTCCTC 
ATCTGGTTCG CCCTTPTCAT AGCCCTTCAC AAACACGTGC TCACCAGGAG CAGAGCCTGC 
CGGAGGCTTCC AGAGGTTCAA CTCGCGGrrTT ATCCCTTCTA TAGAAGCACA CAGAAGCATG 
CCTOX3GACT CGACTCCTCT CATCTTCTGG GGTTTCAGGT TGCACAGCAC CACTACCAGC 
CTOTCCIGCA Gn 'iXXri l XJT T GQGCACGAAC TGTACCAGGC CGCTCACCAC AGTCCGTGGT 
TCAGCTOXC CCACGTCAAT CTTCTCTACA TACAGGCTGT CTGCATCTGG GTlWrTCTCC 
ACAGTCATGA rmCCCXIAC ACGGATATCC AGCCGGGATG QGATGACCTC CTCTGGTTCT 
GAATTCTTGG CAGGCCmO GCCATTOQCT TCTGCTTTGA GGGATCTGGG TAGGCAGCGC 
TOGCXAGnr ITTCAGGGCA GQGGTATTAA ACTTTTCCCG GATTGGATCX: AGCAACTTGT 
TCAGTOCXSAC TTCAACAGAA TTCTTCAGGT CTCCAGGATG TACAACCTCA GCAGCAAAGT 

ccrrrrccAG gtccacgtaa gctgtotagg rmviTr cc accccatttc tcatctcgta 

GGA'TCACAAA CTCCGACrTA AGGGGAAAAA GGACATGCTT GATCAAGGAC AGAACCCCAT 

TGrrcTCCAc atttcctcgc tcacagaagg ccttcttcag Tmrrcrrc acatcctcct 

TCCGATCAAG GAGATCAATC itcgactcct cttctgaaga GCTCATTTTG CTGCCTGTTA 
ATCCTOGAAC CATAGGATTC ATCAGATGGA CCCGnTTGA ATAGCCAAGT GCAGGGAGGT 

AcrrcTCrec aaaggixsaaa ATCrmrrcr gatcaatgcc tccaaattgg GCATCTAcrr 

TTAAATACTC TTCATCCAAA GCCTCCAGTC CGGGGTATAA GAGQCCACTC AGCAAAGQGT 
GCTCCACCTC CTTTACCACC TCAGCTCCAG OTTCTTGGA ATCGTGCTGT GTGACCACGG 
AGGAGAC?rCT GTACACATCT AOTGTCTACT OTTTGCTGAG CTGGTAATCA GTGCCrrTGA 
•K5AACITCAG CTTCTCCAAG GGCACACCAA TGCTCTCCAG CATTGCITrG ATCACATTCT 
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CATMSTAACT GACICOGACT ICTfOMSn CCCATGC3C3GC TTTCATGrrA TCCAGGTATG 
CeroGAOOTC CGCAAACAGA ATTOrTACCT CACACCCTGC CPITAAGAAG TCTQCAATCT 
^ TTGACATOQG CACAAACTAA GCCACATXyTO OrTTGCCCGT GGTTGCCGCT CCCCACTAAA 

rrrrAAorrc coGcrccrrc agpatctcct tcagcttctc ttcccccaga acctccpgca 

10 GOITCCGCSOT GATAAOercC ACTTICTCTT CAGOGCTGGG AGCGICCCCC ATGSTCOGCT 

Accccrocrr cccccgctca GCcrascACC agagcooctt cctgqgtcac ogtcgcoqoc 

GCGTGCCGGG AACTGTCACG CGaCT 
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(2) INFORMATION FC» SEQ ID NO: 290: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1910 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: doxible 
25 (D) TOPOIiOGy: linear 

(xi) SEQUENCE DESCRIPTIC»J: SEQ ID NO: 290: 

«3GGfiGAGGA GGAGAGC3QQG TCreCC3CC303 GCCGCTACCC AGAAGCCAGC GGACGGCAQC 

ACQC3AOIGGG CICTCCCCCSA GCCCAGCCCC GAGCGAGCCC CCCCCCCC3CC CCCGMAQGAC 

CSOGCCTVCCA GCCAGCCCGA dCCTAGGAG GAGGGCaGGC GGGARAGCAG CTCAAGCCTC 

35 ACCCACCGCC CroCCCCCAG CCCCGCCACT CCCAGGCTCC TCGGGACICG GCGGGTCCTC 

CTCGGACTCT CQC3AGC3GC3AC OGNCIGTGCA GACGCCATGG AfiTTCCJIGCT OGTCITCCTC 

TOCACCCreC TOGCCCCCAT QSIOTQGCC AGTC3CAGCTG AAAAGGAGAA GGAAATOSAC 

CCmro^TT ATOATTACCA GACCCI«AC» ATIOX^ 

rrcTCQOTPG GGATCCrcCT TATCCTAAC3T CGCAGGTOCA AGTQCA(3m CAATCAGAAG 
45 CCCCGGGCCC CAGGAGATCA GGAAGCCCAG GTOGAC5AACC TCATCACCGC CAATC3CAACA 
GAGCCCCAGA AAGCAGAGAA CTCAAGTGCA GCCATCAGGT GGAAGCCTCT GGAACCTC3AG 
GCCSGCroCTr GAACCTTTCG ATCCAAA-IXJT CGATGCTTAA C5AAAACCGGC CACTTCAGCA 
^ ACAGCCCTTT CCCCAGGAGA AGCCAAGAAC TrGTGTGtCC CCCACCCTAT CCCCTCTAAC 
ACCATTOCTC CACCTOATCA TCCAACTAAC ACITCSCCrCC CCACTGCAGC CTGCGGTCCT 
55 C3CCCACCTCC CGTOA-KJICT GIGTC^tOK^ GTGTGTGTCT GACTGTC3TC3T GTTTGCTAAC 

TOTCcyrcm GroocTAcrr gtitctcgat ggtattcjict TrcrrACTGA actgtggact 
cxscrrrcccA ggcagqggct gajsccacatc gccatctgct cctccctqcc cccgtcgccc 
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TCCATCACCT TCTCCTCCTA GGAGC3CTGCT TGTTGCCCGA GACXIAGCCCC CTCCCCTGAT 
TTAGGGATGC OTAOSGTAAG AGCACGGGCA GTOGTCTTCA <?rCGTCTTGG GACCTQC3GAA 
GOTPrcCAGC ACTTTCTCAT CATTCTTCAT GGACTCCTTT CACTCCTTTA ACAAAAACCT 
TGCTTCCTTA TCCCACCTGA TCCCAGTCTG AAGGTCTCTT AGCAACTGGA GATACAAAGC 
AAGGAGCTOG TGAGCCCAGC GTTCACGTCA GGCAGGCTAT GCCCTTCCCT GCHTAATITC 
TICCCAGGC3G CTTCCACGAG GAGTCCCCAT CTGCCCCGCC CCTTCACAGA GCGCCCGQQG 
ATTCCAGGCC CAGGGCTTCT ACTCTOCXXrC TC30GGAATC3T GTCCXTTGCA TATCTTCTCA 
15 GCAATAACTC CA'TOGGCTCT GGGACCCTAC CCCTTCCAAC CTTOXTTCSCT TCTGAGACTT 
CAATCTACAG CCCAGCTCAT CCAGATGCAG ACTACAGTCC CTGCAATTGG GTCTCTGGCA 
GGCAATAOTT GAAGGACTCC TOTTCCGTrG GGGCCAGCAC ACCQGGATGG ATGGAGGGAG 
AGCAGAGC3CC TXTOCTTCTC TCCCTACGTC CCCTTAGATG GGCAGCAGAG GCAACTCCCG 
CATCCITTGC TCTGCCTCTC GC3TCGTCAGA GCXX3TGAGCG AGGTGGC3TTG CSAGACTCAGC 
25 AGGCTCCCmS CAGCCCTTOG GAACAGTGAG AGGTTGAAOG TCATAACGAG AGTGGGAACT 
CAACOCAGAT CCCGCCCCTC CTCTCCTCTG TGTTCCCGCG GAAACCAACC AAACCGTGCX5 
CTOTCACCCA TTGCTOITCT CTGTATCGTG ATCTATCCTC AACAACAACA GAAAAAAGGA 
ATAAAATATC CTTTGTTTCM TAAAAAAAAA AAAAAAAAAA AGGQGGC3GGG 



20 



30 



1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1910 



35 



(2) INFORMATION FOR SEQ ID NO: 291: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3276 base pairs 
40 (B) TYPE: nucleic acid 

(C) STOANDEDNESS: double 

(D) TOPOLOGY: linear 



45 



50 



55 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 291: 
GCGACCCTTCG TITCAGTCGT CGC1GCCGCT GCCGCTGCCA CTGCCACTGC CACCTCGCGG 
AlCAGGAGCC AGCOTJOTTC GCCCGACGCC TCGCTGCCGG TGGGAGGAAG CGAGAGGGAA 
GCCGCTTOCG QCTTrGTCGC CGCTGCTCGC CCACCGCCTG GAAGAGCCGA GCCCCGGCCC 
AGTCQCTCGC TTGCCACCGC TCGTAGCCGT TACCCGCGOG CCGCCACAGC CGCCGGCCGG 
GAGAGGCGCG CGCCATGGCT TCTOGAGCCG ATTCAAAAGG TGATGACCTA TCAACAGCCA 
TTCTCAAACA GAAGAACCGT CCCAATCGGT TAATTGTTGA TGAAGCCATC AATGAGGACA 
ACAGTCTCOT GTCCTTOTCC CAGCCCAAGA TQGATGAATT GCAGTTGTTC CGAGGTGACA 
CACnOTTCCT GAAAGGAAAG AAGAGACGAG AAGCTGTTTG CATCGTCCTT TCTGATGATA 
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CTTOTTCTGA TGAGAAGMT CX3GATGAATA GAGTPGTTCG GAATAACCTT CGTGTACGCC 
TAGGGC3ATOT CATCAGCATC CAGCCATGCC CTGATGTCAA GTACGGCAAA CGTATCCATG 
TCCTOCCCAT TCATCACACA (?raGAAGGCA TTACTOGTAA TCTCTTCGAG GTATACCTTA 
AGCCC?rACTT CCTOC3AAGCG TATCGACCCA TCCGGAAAGG AGACATTTTT CTTCTCCGTG 
GTOGGArrcCG TOCTOTOGAG TTCAAAGTGG TGGAAACAGA TCCTAGCCCT TATTGCAriG 
TIGCTCCAGA CACACTGATC CACTCCGAAG GGGftGCCTAT CAAACX3AGAG GATGAGGAAG 
AGTCCriGAA TCAAGTAQGG TATCATGACA TTGGTGGCTG CAGGAAGCAG CTAGCTCAGA 
TAAAGGAGAT GOrGGAACTG CCCCIGAGAC ATCCTGCCCT CTTTAAGGCA ATrGGICTGA 
AGCCrcCTAG AGGAATCCTC CTTTACGGAC CTCCTGGAAC AGGAAAGACC CTGATTGCTC 
GAGCTCTAGC AAATCAGACT GGAGCCTTCT TCTTCTTGAT CAATGGTCCT GAGATCATGA 
GCAAATTOGC TGC?IGAGTCT GAGAGCAACC TTCGTAAAGC CTTTGAGGAG GCTGAGAAGA 
ATCCTCCTOC CATCATCTTC ATIGATCAGC TAGATGCCAT CGCTCCCAAA AGAGAGAAAA 
CTCATCGCGA GGTGGAGCGG CGCATTOTAT CACAGTTGTT GACCCTCATG GATGGCCTAA 
AGCAGAGGGC ACA-KTreATT GTTATOGCAG CAACCAACAG ACCCAACAGC ATTGACCCAG 
CTCTACGGCG ATrTCC?rCGC TTTOACAGGG AGGTAGATAT TGGAATTCCT GATGCTACAG 
CSACGCTTAGA GAnXTTTCAG ATCCATACCA AGAACATGAA GCTGGCftGAT GATGTOGACC 
TGGAACAOTA GCCAATCftGA CTCACGGGCA TCTGGGTGCT GACTTAGCAG CCCTGTGCTC 
AGAGGCTGCT C1GCAAGCCA TCCGCAAGAA GATCGATCTC ATTGACCTAG AGGATCAGAC 
CAITCA-PSCC GAGGTCATCA ACTCTCTAGC AGTTACTATG GATGACTTCC GGTQGGCCTT 
GAGCCAGAGT AACCCATCAG CACTCCGGGA AACCGTGGTA GAGGTCCCAC AGGTAACCTG 
GGAAGACATC GGGGGCCTAG AGGATGTCAA ACGTGAGCTA CAGGAGCTGG TCCAGTATCC 
TGTGGAGCAC CCAGACAAAT TCCTCAAGTT TGGCATGACA CCTTCCAAGG GAGTrCTGTT 
CTATOGACCT CCTGGCTCTC GGAAAACTTT GTTGGCCAAA GCCATTGCTA ATGAATGCCA 
GGCCAACTTC ATCTCCATCA AGGGTCCTGA GCTGCTCACC ATGTGGTTTG GGGAGTCTGA 
GGCCAATGTC AGAGAAATCT TIGACAAGGC CCGCCAAGCT GCCCCCTGTG TGCTATTCTT 
TCATGAGCTG GATTCGATTG CCAAGGCTCG TGGAGGTAAC ATTGGAGATG GTGGTGGGGC 
TCCTCACCGA GTCATCAACC AGATCCIGAC AGAAATGGAT GGCATGTCCA CAAAAAAAAA 
TOIGTTCATC ATIGGCGCTA CCAACCGGCC TGACATCATT GATCCTGCCA TCCTCAGACC 
TCGCCGTCTT GATCAGCTCA ■TCTACATCCC ACITCCTGAT GAGAAGTCCC GTGTTGCCAT 
60 CCTCAAGGCT AACCTGCGCA AGTCCCCAGT TGCCAAGGAT GTGGACTTGG AGTTCCTGGC 
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TAAAA-TCACT AATOGOTTCT COGGAGCTCA CCTGACAGAG ATITCCCAGC CnXSCTTGCAA 2340 

GCTCGCCATC CGTCAATCCA ICGAGAGTCA GATTAGGCXSA GAACX3AGAGA GGCAGACAAA 2400 

^ CCCATCAGCC ATCGAGGTAG AAGAGGATCA TCCAGTGCCT CSAGATCOSTC GAGATCACTT 2460 

TCAAGAAGCC A'TCCGCTTTG CGCGCCGTTC TGTCAGTGAC AATGACATTC GGAAC3TATGA 2520 

10 GATOTTTCCC CAGACCCTTC AC5CAGAGTCG GGGCTTTGGC AGCTTCAGAT TCCCTTCAGG 2580 

GAACCAGGGT GGAGCK3GCC CrACyiCAGGG CAGTGGAGGC GGCACAGGTG GCAGTGTATA 2640 

CACACiAAGAC AATCATGATG ACCTCTATGG CTAAGTQGTG GTCCCO^ TGCAGTGAGC 2700 

TCGCCTOCCT GGACCTTOTT CCCTOGGGGT GGGGGCGCTT GCCCAGGAGA GGGACCAGGG 2760 
CTGCGCCCAC AGCCTGCTCC ATTCTCCAGT CTGAACAGTT CAGCTACAGT CTGACTCTGG 
20 ACAGGGGG1T TCIGTTGCAA AAATACAAAA CAAAAGC3GAT AAAATAAAAG CGATTTTCAT 

TTOGTAGGCG GAGAGTGAAT TACCAACAGG GAATTGGGCC TTGGGCTATG CCATTTCTGT 2940 

TOTAOrrrGG GGCAGTGCAG GGGACCTGTG TGGGGTGrrGA ACCAAGGCAC TACTQCCACC 3000 

25 

TGCCACAGTA AAGCATCTGC ACTTCACTCA ATGCTGCCCG AGCCCTCCCT TCCCCCTATC 3060 

CAACCTGGGT AGGTCGGTAG GGGCCACAGT TGCTGGATGT TTATATAGAG AGTAGGTTGA 3120 

30 TTTATnTAC ATCCTTTTGA GTTAATGTTG GAAAACTAAT CACAAGCAGT TTCTAAACCA 3180 

AAAAATGACA TOTK3TAAAA GGACAATAAA CGTTGGGTCN AAATGGGWRA AAAAAAAAAA 3240 
AAAAAAQGGG GGCXXCTCTA AAGNNCCANN CTTCGT 
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(2) INFORMATION FOR SEQ ID NO: 292: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1695 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
45 (D) TOPOIXX3Y: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 292: 

TTCCAATGGT TGAATTCCCC TCCTCACGCC AGCCTAGGAG AAGAAGTTCG TAGTCCCAGA 60 

GOTGAGGCAG GAGGCGGCAG TTTCTGGCGG GTGAGGGCGG AGCTGAAGTG ACAGCGGAGG 120 

CQGAAGCAAC GGTCGGTCGG GCGGAGAAGG GGGCTGGCCC CAGGAGGAGG AGGAAACCCT 180 

55 TCCGAGAAAA CAGCAACAAG CTCAGCTGCT GTGACAGAGG GGAACAAGAT GGCGGCGCCG 240 

AAGGGAGCCT CTGGGTGAGG ACCCAACTCG GGCTCCCGCC GCTGCTGCTG CTGACCATGG 300 

CCTTCGCCGG AGGTTCGGGG ACCGCTTCGG CTGAAGCATT TGACTCGGTC TTGGGTGATA 360 
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coGCorcPTG ccACOGOXc TOicASi^ ccr;«xxrrr go^cacctac octaaogaag 
;^cn^A CGCA^^ AG«»nGCA c«ci«rmc «.t™tc«5 Tr«ncG.«. 
A^x^Ti^ crrA;^«:GA actamtiog «vt«igaatc TOOvaerAO. G^AGC^TAT^ 

COCAA«n«V aCAGOATAT GCrreCOVTC n^BTBSCCA GAATC«GCTO OCAnOGCTC 

;,«:,««acA AOAACAAcrr Aixnmnx^ t^caaaaat gcacctactc rmr«rPAA 

„,,,«H«AG CnCATTCICG AOI«ACA«A tOOACICCCC ACAGAGCnC ATAACCTCTT 

ovTOGAcm rmicncAA gccgatg^cg gaaaaatast tatattcc^ tctaagcc«. 

;^TCC«(^A CGCACCACAT r«X.«^ 

GCAAAA1X7IC CTA«n«CAA AT«AGAAATT CACAAfiOGCA CAGC^Trrr CH^AAGATG 

c-i^cT c«x»«;ato c^in«:i^ ^ 

,«;«3CAC^;. «n.iccc«:T <»gaagcica otatotat^; ^.ac^gag TrrA«AA«5 

;,;,CAAAAGCT AAACAGATAT CCAGCTKHT CTCrTOTGGT ,<jrTAG«rcr AAAACTGAAG 
;.^^GA AGCAGGGCCT CTACCTACAA AAG,«AATCT IGCTCATTCT OAAATTI^ 

CATrmcrr ttaaaagaca agi^aatag acatctaaaa ticcactcct catagagctt 



rrAAAA^;.^ T^^GAT ATAGOCCTTA ACAAA^CT ATAAAA^ AATAAAOTTA 

arc^.^ .^AAGACi^ ArnocTATA ACTrrArKK. ta^i^ <^^™ 

35 AfiGAGCnOGA a«m««AT T^rATTArrA rmACTAAT ATCnTTAGCr ATm^ 

™,em«^ TAnarrrrr n«xrrrm: ttagctatga acx«AaK:Arr oc^ccn^ 
,a:,ccr«x: atoataci^ cach^aoctt Acnn-AACAAO ctgaatattt agtaoaaa^ 

„AA,OACAC TACATrmrA GGAACI^AAA OCATTAAAAT rrTATTOOAA TAATTAAAAA 
45 AAAAAAAAAA AANCT 



1380 
1440 
1500 

1560 



1680 
1695 



50 (2) INFORMATION FOR SEQ ID NO: 293: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1601 base pairs 

(B) TYPE: nucleic acid 
-c (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 293: 

60 cAcn^c. ..--rm^: 
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AACMAGftGAG AGACTOITCr CTTOTAAAAC TCriTCAAAA ATTCTGATAT GCTAAGGTAC 
40 riGAGACCCT TCACCAGAAT GTCAATCTTT TXTTCTOrGT AACATGGAAA OTrGTGTGAC 

CArrAGCArr gttatcagct totactcgtc tcataactct GGrrrrrGGAA gaataatitc 

GAAATTCTTC CTGTGTTCTG TGAAAATAAC CTCCCCAAAA TAATTAGTAA CraGTKTTrC 
TACnXSGTAA TITCACACCC TOTTAATAAC GCAATTATTT CTGTCTrTCTT AAACAGTATA 
AATACyiTOTA ACTITOCATC CATXSATOGAA AAATAAAAAC CTGTATCTCT GTTAAAAAAA 

50 A 



120 
180 
240 
300 
360 
420 
480 



TAGTCrCCAA TATCGAGCAT CTCAAGCTTC TCCTGGGGGA TGGGGATTGG GATGGGCAGA 
ATCTCTrrTC GWTCTCCGGG TTATTTCCAG TGGGTGTAAA AGCAGAGCTG GGCCrTTCCC 
^ TCTCTTATCC CIGAGGOrGG GTAAGAAGGA CIGTATCTAC ACCTGTTCTT CCCTACCTTC 
TCmTOTTA GGGAGGCCTC ATTCTAAGTT CCTCAAGAGA GTCCTTGGCT TAAAGCTGTA 
10 GCAAGQOTOT GCTAGGTCGG QGATITOGAG CAAAACCGTC GAGTAGGCAT GATACTCGTA 
OaaOTOGGC CTOCAAAATC AGACAGAAAT GGCTTGAGAA GCCGCAGGQG AGCATGCCTG 
TCTCTCACTG ATAGAGTATC GGAGGGACCT CCCTAGCTTG GAAAATGAGA ATTGAAGGGG 
TTATCAACAA ATAGGATCCC TAOTTCAGGA TGTTCCCAAA GTITrGTCCA ATCTTATCAT 540 
TAGTAGATTT TATAAGCXAC AGAGACAAAC CAGAAACGGA ATAATGTTAC TrTGGATGCT 600 

20 TTATrrrrrr GTrcvMsatG TCGcmCTA catgcagaag aatgctatat gctgcacatt 660 

ITCCCTTTAA ACTICTTACGA CTITCCCCAT rTTAGTCTAA TGQGAAGATA CAGATOIGCA 720 

A(yixrrcCTrT rrrcrrrrrr GrrATTATTT rnrmTrr gctctgtgtt atggacattt 780 

25 

TCAGACATGC acagaagt:x3g agaggatggt ccttggaccc matgtgtcca tcacctagct 
gcatcactta tcagctatog tcaacctcgt ttcatctgta tctctctctt ttcacctgta 

30 TTOnTAITC AAAATCCAAG ACACTATCCC AATGCAACCG TGACTACrrr gggagattgg 
TAOTCTCTTT TGATOGTCAT AOTX^TGGGG TGCACTATCA TAATCACATC AGGTCTGCTT 

CTiGcrrrrA atottaacta atoaagitcc agagatgggc cttagaaatg TGnrrAAGA 

ATTAACAAGG A^JTCTCAAAA AGAAATCAGA GGGATGCTTC CrrTOCCCTT GCATCTACAA 1140 

1200 

1260 

1320 

1380 

1440 

1500 

1501 



840 
900 

960 
1020 
1080 



55 (2) INFORMATION FOR SEQ ID NO: 294: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2683 base pairs 

(B) TYPE; nucleic acid 
50 (C) STRANDEDNESS: double 
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(D) TOPOLOGV: linear 
(xi) SBQOENCE DESCRIPTICW : SEQ ID NO: 294: 
TCMmnOCJP CCXS3C3(SreCN GATreOCACN C3CCrCCC3COG CGGCTCGTGG TTGTCCCQCC 

A-reocAcror cgcgggggct gccccgggag croarrGAOG cggtcgccgg csggccgggto 

CIOGIGOIOG GQGOGGGCGS CAiaSGCIOC GAGCTCCTCA ACaMCTCGT GCTCftCOGGT 



60 
120 
IBO 



nxrroccACA togbcctcat tcmciggat ACTATTGATG TAAGCAACCT CAACAGACKS 240 
TrmCTm: aaaasaaaca totiggaaga tcaaaggcac aggttoccaa ggaaactota 

15 CTCCAG-mr ACCCGAAAGC TAATATOGCT CCaa^XfOG ACAGCATCAT GAACOCTOaC 
TATAATOTCG AATTmCCG ACACTTTATA CTGGTTATGA ATGCmAGA TAACAGMCT 
GCXXGAAACC ATGTTAATAG AATOTGCCTC GCAGCTGATG rrCCTCTTAT TGAAAGTGGA 
ACAGCTOGCT AICITCGACA AOTAACTACT ATCAAAAAGG OTGTCACCGA GTOTTATCAG 540 
TOTCATCCTA AGCCGACCCA GAGAACCTTT CXTCGCTGTA CAAITCGTAA CACACCTTCA 
25 GAACCPATAC AnCCATCOP TTCGGCAAAG TAdTOTTCA ACCAGTIGTr TCGGGAAGAA 
GATCCTCATC AAGAACTATC TCrTOftCASA GCTGACCCTC AAGCTOCCTO GGAACCAACG 
GAAOCCGAAG CCAGAGCTAG AOCAOCTAAT GAflGATGGTO ACATTAAACG TATTTCTACP 
^® AfiGGAAlGGG CTAAATCAAC TCGATATOAT Oa«?TO«AA CmTTACCA AGCTmTAA 840 

AGA-TCACATC AGCTATCTCT TCACAATQC5A CAAACTATQG CGGftAAAGGA AACCTCCAKT 
35 TCCG1TCGAC ■TOGGCTCAAG TACAAAGTCA AGGAGAflGAA ACGAATGCAT CAGATCAACA 
GAATCAACCC CACTTAQGCC OX^GACCA GCAOGTTCTA GAICTAAAGA QCTATOCAOG 

TcmrrrcA aagagca-k:g agactttcag AcyrrcATTTA gcagaaaagg gggatggaqc iobo 



300 
360 
420 
480 



600 
660 
720 
780 



900 
960 
1020 



TCAGCTCATA 1GGGATAAGG AIGACCCATC TCCAAT^GAT TnCTCACCT CTCCIGCAAA 1140 

1200 

1260 

1320 

1380 



CCrCAGGATC CATA-mTCA OTATCAATAT GAAGAGTAGA TTTGATATCA AATCAATOGC 
45 AGOGAACATT ATTCCTOCTA TIGCrACrAC TAATGCAGTA AnOCIGGOT TGATAGTATT 
CxaAGGATTC AAGATTTTAT CAGGAAAAAT AGACCAGTIGC AGAACAATTT ITTIGAATAA 
ACAACCAAAC CX:AAGAAAGA AGCnCnGT GCCT^ 
TICTTAIOTA WreCCAGCA AGCCAGAGGT 6ACTCTGCGG CW5AATGTCC ATAAAGTGAC 1440 
TOITCia^CCTTACAAGACAAGATACnGAAAGAAAAATITGCTATGGrAG 1^00 
55 CCAAAITCAA GAIGGGAAAG GAACAA-TCCT AATAICTTCX: GAAGAGGGAG AGACXK3VAGC 
TAATAATK:AC AAGAAGTK?! CAGAAirrGG AATTAGAAAT GGCAfiCCXXSC TICAAGCAGA 

rcAcrrccTC caggactata crrrAircAT caacatcctt catagtgaag acctaggaaa 

60 



1560 
1620 
1680 
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GGACGTTGAA TTrGAAGTTG TTGGTGATGC CCCGGAAAAA GPTGGGGSCCA AACAAGCTGA 1740 

AGATQCTGCC AAAAGCATAA CCAATGGGCA GTGATGATGG AGCTCAGCCC TCCACCTCCA 1800 

CAGCTCAAGA GCAAGAOXSAC GTTCTCATAG TTGATTCX5GA TGAAGAAGAT TCTTCAAATA 1860 

ATGCCGACGT CATGAAGAAG AGAGAAOCCG CAAGAOGAAA TTAGATGAGA AAGAGAATCT 1920 

CAGTTGCAAAG AGGTCJVCGTA TAGAACAGAA GGAAGAGCTT GATGATCrPCA TAGCATTAGA 1980 

TTGAACAGAA ATGCCTCTAA ACAGAACCCT CTTACTATTT AGTTTATCTG GGCAGAACCA 2040 

GATTGTTATG TCCTTTGTTC CAAAGGGAAA AAATTGACAG CAGTGACTT6 AAAATGATTC 2100 

15 TGCTCCCTTT GAAAGCATTC ATTTTGCTAG AACTGTTAGA CACATTQCAG TATGCTGTAT 2160 

TGAAAGrTAGG AATATAGTTT TAAAAACCCT TTGAACAAAG TGTGTGCATA ACCAGTCATG 2220 

AGATAAAACA ACACAATGCA TGTTGCCTTT TTAATGTAAA TACXXOTAGG TATCATTAAT 2280 

20 

AGrrrrCAAAA TATTOTGGTT TAGTAAAGTT GATACCTGGT TATAAATATT ATGCCTTTAT 2340 

TTTTGGCTAG AAGAAGAATT ATTTTTAGCC TAGATCTAAC CATTTTCATA CTCTTAACTG 2400 

25 ATTGAAACAG ATTCAAAGAA GTATCGAGTG CTATGCATTG AAA C TT G TT T TTAAATGTTA 2460 

GATGGCACTA TGTATATTAA TCTAAAACAA TGTTAATTTA CTCAAGTrTT CAGTTTGTAC 2520 

CGCCTGGTAT G TCTGT G TAA GAAGCCAATT TTTGTGTATT GTTACAGTTr CAGGTTATTT 2580 

ATATTCGAT6 TTTTGTAAAA CTCAAATAAC GACTATACTT ATGGACCAAA TAAATGGCAY 2640 

TGCATTCTKG TKAAAAAAAN NACAGAAAAA AAAAAAAACA AGA 2683 



30 



35 



(2) INFORMATION FOR SEQ ID KO: 295: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1454 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 295: 
GGACTCGGGG TGGCTCTAAG GGGCAGGGAT AGGGCTGGGG AGCGCCGGCC TGTGGCCCTG 60 
50 ACCAGCCCCT TCTCGTGCAG GTTCCACCCC GATGCAGGTG GTCACGTGCT TGACGCGGGA 120 
CAGCTACCTG ACGCACTGCT TCCTCCAGCA CCTCATGGTC GTGCTGTCCT CTCTGGAACG 180 
CACGCCCTCG CCGGAGCCTG TTGACAAGGA CTTCTACTCC GAGTTTGGGA ACAAGACCAC 240 

55 

AGGGAAGATG GAGAACTACG AGCTGATCCA CTCTAGTCGC GTCAAGTTTA CCTACCCCAG 300 
TGAGGAGGAG ATTGGGGACC TGACGTTCAC TGTGGCCCAA AAGATGGCTG AQCCAGAGAA 360 
60 GGCCCCAGCC CTCAGCATCC TGCTGTACGT GCAGGCCTTC CAGGTGGGCA TGCCACCCCC 420 
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TOGGTOCTOC AGGGGCCCCC TGCQCCCCAA GACACTCCTG CTCACCAGCT CCGAGATCTT 480 

CCTCCTOGAT GAGGACTGTG TCCACTACCC ACTGCCOGAG TTTQCCAAAG AGCCGCCGCA 540 

5 

GAGAGACAGG TACCGQCTGG ACGATGGCCG CCGCX3TCCGG GACCTGGAOC GAGTGCTCAT 600 

GQGCTACCAG ACCTAC0CX3C AGCCCTCACC CTCGTYTTCG ATGAC3GTGCA AGGTCATGAC 660 

10 CTCATOGGCA GTGTCACCCT GGACCACTTT QGGGAGGTGC CAGGTGGCCC GGCTAGAGCC 720 

AGCCAGQGCC GTGAAGTCCA GTQGCAGGTO TTTGTCCCCA GTGCTGAGAG CAGAGAGAAG 780 

CTCATCTCGC TGTTCGCrCG CCAGTGGGAG GCCCTGTGTG GCCTGAGCTG CCTGTCGAGC 840 

TCACCGGCTA GCCCAGGCCA CAGCCAGCCT GTCGTGTCCA GCCTGACX3CC TACTGGGGCA 900 

GGGCAGCAGG CTTTTGTGTT CTCTAAAAAT GTTTTATCCT CCCTITOGTA CCTTAATTTG 960 

20 ACTCTCCTCG CAGAAATGTG AACATGTGTG TGTGTTOTGT TAATTCTTTC TCATGTTGGG 1020 

AGIGAGAATG CCGGGCCCCT CAGGGCTGTT CGGTGTGCTG TCAGCCTCCC ACAGGTGGTA 1080 

CAGCCGTCCA CACCAGTGTC GTCTCTGCTG TTGTGGGACC GTTGTTAACA CGTGACACTG 1140 

TOQGTCIGAC TTTYTCTTCT ACACGTCCTT TCCTGAAGTG TCGAGTCXS^ TCCTTTGTTG 1200 

CTCTTQCTGT TOCTGTTGCT GITQCrGTTG GCATCTTGCT GCTAATCCTG AGGCTGGTAG 1260 

30 CAGAATGCAC ATTOGAAGCT CXXJ^CCCXAT ATTGTTCTTC AAAGTGGAGG TCTCCCCTGA 1320 

TCCAGACAAG TCGGAGAGCC CGTGGGGGCA GGGGACCTGG AGCTGCXAGC ACCAAGCGTG 1380 

ATTCCTGCTG CCTCTATTCT CTATTCCAAT AAAOCAGAGT TTGACACCGW MAAAAAAAAA 1440 
AAAAAAAAAA AACN 



25 



35 



40 



50 



(2) INFORMATION FOR SEQ ID NO: 296: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



60 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 296: 
ACCCTOGCAT GCCCCACAAA CAGATCACCA GCCAGCTTAC ACAGGCATTA ACTCTCCTCA 

ATXSAOGAAGA ATCATTCACA ACTGAGCAAG ACATTCATAT GATCATITAA GGAAGTGTTT 120 

55 CCCTTATCTG TTAGCAAGTA TAATCQGCTA ACTCCTAAAT CCCAATGAAT AGTCCTAGGC 180 

TGGACAGCAA TGGGCTGCAA TTAGGCAGAT AAAGACATCA GTCCCAGTAA ATGAATCCAT 240 

AGACTCATCT AGCACCAACT ACCATTAGCA CTATGTTAGG AGCTGCAAGG CCCCAAAGTA 300 
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GAAGATGTGC ATAATGTCTG CTCTTGTGTA GCTCAGGAGA CAATTCCAGC ACAGACACTA 360 

CAGTTAACGC TGAACTGCAG CTOCAAGTAA TAGCAM3AAC AGTCAGAAAA ATACCTTAT6 420 

AGOGGGCAGG GCTGAAGCTG GGCCTTGAAG GATGGAT6AA ATTTGGATAG AGAATGAGGA 480 

AGACAGAGGG NCTCCAAGTG AGAGAAGCAT GAAAAATGAG CARGGGCCTG GATCAGTGGG 540 

GTGTATTCAG AGCACCTYTC CAGATGCACC ATGCATGCTC ACAGTCCCTT GCCTATGTGT 600 

GGCAGAGTGT CCX7MSCCAGA TGTGTGCCCC CACOCCATGT CCATTTACAir GTCCTTCAAT 660 

GCCXACCTCA AAAGGYACYT CTTCTGTAAA GCTTTCCCTK GGTATCAGGA ATCAAAATTA 720 

15 ATCAGGGATC TTTTCACACT GCTGTTTTTT CXrTCTTTGGT CCTTCTATCA CTAAAACTTCA 780 

TCTCATTCAG CCTTACAGCA TAACTAATTA TiTOmTCC TCACTACA 828 



10 



20 



(2) ItJFORMATION FOR SEQ ID NO: 297: 



(i) SEQUENCE CHARACTER STI CS : 
25 (A) LEtJGTH: 2416 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

30 (xi) SBQUOKIE DESCRIPTION: SEQ ID NO: 297: 

TCAATTTCCA TTAACTCAGA TCAGCCATTG TGATTCACCA TTTGTCAGGC TCTCAGGTTT 60 

AACAAAACCT ACTATCACCA TCATCCTTCA ACAGCCACAG TCTGAATTGA GCCAACATTT 120 

35 

■rmiTcriT GAGAAAGAAG TQGACTGGGG CACAACTTTT AGTCTGAGGG GAGCTAGTGG 180 

AAATCTAGAC AATAGAAGTC ATCGATAGCA GCTTTTCCTC AAATGTC3TGA CTCCTCAGGG 240 

40 GCTAAACTGC TCTTAGCTTA GAATTATGCT TTACTAGAGA TCTAGCAGAT AAGTGGGTTA 300 

ATCACTACCA TCCTGTAACT AGTTATATAG CTTCCAGACA TGAGGGAGAC ATCAAACAGG 360 

GATGGAAGCA ACCCCAAGGA TATGCAAGAA GGGCATGATG AACCCCCTTC CCTCTGGCAG 420 

45 

GAGAACAAGG CCAACCAAGG GACAGACTGG AAAGCACTTA GATGTTTAAG GAGGAGAAAG 480 

GGGAAGCTTT GACCAGTCCT TGCCTTTTGC CAAGTTCAGC CAGTTCTCCG CTGCTTGCAA 540 

50 CCTCTAGCGC AGTAACATTT GCAGAATTGC AGATTTTCCC CCAGATACTA GGAQGAAAGG 600 

GACTTTQGGG GGTGGGGAAG GGGTCGTGGT GTTTTAAAAG CAT/^AGTTAC CTGTTTGCAC 660 

TGTTTTAAGA TAGGAAAAAA AAATAGTGGG CAAGGTGAAC ATCAGACGTA AATTTGTGTG 720 

55 

TTTTTATTTT GTCATGCTCT TGAAAATGTT TGACCATTTG TAGTATACAC AGTGAAACTT 780 

GATTCTCTGT TGCATAAAAC ACTATATTTT TTTGGAAATG TTACTGTCCA AAAGCCTCTT 840 

60 CCCTCCCTTT CCTTTTCCTA TGTACTTCCT TCATACTTGC TTTACTGATC AGCCAGGCAA 900 
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TAGCCATCCA AGAGCTAGAG CATGAAACAG GGCCCTTTCC AAGTAGQCTC TGGGTGTCCT 960 

AAGCCAGCOT GTGCXrCTCTG GTTTAGTCAG TGTAATAGAG TCXXTTGOCAC CTTTCTTTGC 1020 

5 

AAATGAGGCT AACAGACCAG ACTGCAGCAA GTTATCAGAT TCCTCAATCA GATGCACTAG 1080 

GAGTGAGGAG CCCAGGGATG GAOGGGGTTC CTGAAGTATT GCAGTTGGCT GTAGTAOCTG 1140 

10 AGTTCTTTTC CATGTTACCG AAACTGTAGC CAGTITACAGT TTACTCAGGA AAACGGTAGA 1200 

TCAATTCAGC CATQGTAGTO CTGGTTGGCA GGGATTGGTA ACGGAGAGAA CTGCTCATCA 1260 

GCCAAAACTC AAGCCTTOCX: TTTTAGGAGG CCACCAGCAG AGGGACTTGG TCCTCCTTGT 1320 

15 

CTGCTACTTG TGTACATGCC GGTGACCTGA GGACTCCACT CACACTGGCG AGCAAAAAGG 1380 

GAGCAGTGAT TCTCTTTTCT CTCCCCACCC CCTGCCCTTT GTTACCAACA CCAGTTTCCC 1440 

20 AGGGGGTACA TGAGTTTCTG AATTTTTAAA AAATGTTTTT GGTTTGGTTT TTCTGGQGAC 1500 

TQATAAGTGC TTTAAGCAAT GTCCATACXX: CX5TCAAGACT CCCAfiCTTAG TCATTTTCTT 1560 

GTATTTTTCT GTTCACAGTA TTTGTGTGTG TGCTTC5TTTT GGCAQCTCAT TTTGGCTGTA 1620 

25 

TTATATATTG AGTGATGAAT TGATCCTCTT TTTTCCCTAA GGGATATGAA TIXJlTmvr 1680 

TGTGTTATAT TCTGCTTGTG AATAGCTGGA GCAAACCTQG QGCTGACACG CGTAAGSTAG 1740 

30 GGCTGCAAAR CGAGAAGAGA GCXXXSTGGAG TGTACTTGTC CCTGACAOGC TGACCTACCT 1800 

GAGTCTCTGA GCTTTTCAGT CCAAATCTTT QCAAGGCTCA AAATGCCACA GAACCTCTCC 1860 

TCTTCTCCCC ACTCCCCATG GCAOGGACCG GACCATCCXTT ACATQCAACA TGCTGTTCCT 1920 

35 

ccAGcxxxrrc CCATTGCCAT GGCAAAACAG GTACCTTTGG GGCATGGGGG CATTACATQG 1980 

GATGCTTGTG TAATCGACCA CCTAGCCTTC TCTCTCCCCT CCCGTCCTCC CCCAGAATCA 2040 

40 CTTCCTAGGA CACCCGAGCT GCTTGCCCAG GGTCXrTGTTT CCCTGCTAAC TCCAGAGAAG 2100 

CATCCCAGGG CTTTGTGACA GTCTCTAATT CCCTTCCCTT CTCGrTTAAGA ATCATATTGT 2160 

ATAGTAGCTT TCAGACCATA CAGTATTCAT TGGGTTACTC CTATTATTAT CAAGTAGCTG 2220 

45 

GAATTGTGAA GGTCGGAGTA GTTAGATCTT TAGCTTTTAT TCCTTATTTT TTTGTATTAC 2280 

TCTCCATGTG TATAAATTAT TGATCATGTT QCTGGCTTTT ATAAACTCTA AGCGAAGGAG 2340 

50 GAGCACTGCC TCAGCCTTTG CACATGGTAA TGAAGCACTG TTTTTAAATA AAAGRGRGAA 2400 

MCMCCAAAAA AAAAAA 2416 



55 



<2) INFORMATION FOR SEQ ID NO: 298: 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 545 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 298: 

GAATTCX^GCA CGAGCXTATGC YTGGCCTCTC CTTGATTCTT ACAGTCACTT TGTTGGCTGT 60 

TTCTGACTCA GCAGCTACCT GCATTGTGGC CAAAGGATGA OCTATTCCTT CTCAGC5AGGG 120 

CAAAAATGTG GAATAGTGTC TGTCCATGCC TCTCCTCATG GGCTACCACC TCTOCCflCCG 180 

TGGTTAATCX GTAACAACCA GGAGAGAAGC TGCTGGAACT GACCTCTGGG AACTC<XTGG 240 

15 ATGGTTTC5GT GCAQGAATGT AGTAGGCATA CACGTGGTTG CGTGGATCTG GGCCCTCCTG 300 

ATGTGAGTAG AGAGGTAAAA GGSCACCATC TCCTTGACCT YTGGGGAACT CATCCACAAA 360 

GAAGATGTTT CCAAGATGCT TCTGAAGATT GSCTAAAAAT AGCCGGTTTC CACCCCCGTG 420 

20 

AATGCATCCA TTCTAGAATG CTCCTTCACC AGGACCAGAG AACTGATTTA CAGAAGTGAC 480 

ATGAAAACAT TCCATCCCAG AATTTGCANT ACCTCAAATT NAATTTCTAC CTATTAAAAA 540 

25 NAAAA 545 



30 (2) INFORMATION FOR SEQ ID NO: 299: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1530 base pairs 

(B) TYPE: nucleic acid 
35 to STRANDEDNESS: double 

(D) TOPOLOGY": linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 299: 

40 GGCTCTGCTG GGCATCATAC TTGTCACTGG GTAAACAGTT TGCCCACTTA CCGCAGATGA 60 

AGCTGCTTGC CAGGGCTCTC CGGCTCTGTG AGTTTGGGAG GCAQGCATCT TCCAGGAGGC 120 

TGGTGGCTGG CCAGGGATGT GTGGGGCCCC GGCGAGGGTG CTGCGCTCCC GTCCAGGTGG 180 

45 

TTGGGCCCAG GGCTGATCTC CCACCCTGTG GAGCCTGCAT TACTGGAAGG ATCATGCGGC 240 

CAGATGATGC CAACGTGGCC GGCAATGTCC ACGGGGGGAC CATCCTGAAG ATGATCGAGG 300 

50 AGGCAGGCGC CATCATCAGC ACCCGQCATT GCAACAGCCA GAACGGGGAG OGCTGTGTGG 360 

CCGCCCTGGC TCGTGTCGAG CGCACCGACT TCCTGTCTCC CATGTGCATC GGTGAGGTGG 420 

CGCATGTCAG CGCGGAGATC ACCTACACCT CCAAGCACTC TGTGGAGGTG CAGGTCAACG 480 

55 

TGATGTCCGA AAACATCCTC ACAGGTQCCA AAAAGCTGAC CAATAAQGCC ACCCTGTGGT 540 

ATGTGCCCCT GTCGCTGAAG AATGTQGACA AGGTCCTCGA GGTGCCTCCT GTTGTGTATT 600 

60 CCCGGCANGA GCAGGAQGAG GAGGGCCGGA AGCGGTATGA AGCCCAGAAG CTGGAGCGCA 660 
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TGGAGACCAA GTGGAGGAAC GGGGACATCG TCCAGCCAGT CCTCMXCA GAGCCGAACA 720 

CTGTCAGCTA CAGCCAGTCC AGCTTGATCC ACCTGGTGGG QCXnTCAGAC TGCACCCTGC 780 

5 

ACGGCTTTGT GCACGGAGGT GTGACCATGA AGCTCATGGA TGAGGTCGCC GGGATCGTOG 840 

ctgcacgcx:a CTGCAAGACC AACATCGTCA CAGCTTCCGT GGAOGCTATT AATTTTCATG 900 

10 ACAAGATCAG AAAACSGCTGC GTCATCACCA TCTCQGGACG CATGACCTTC ACGAGCAATA 960 

AGTCCATGGA GATCGAGGTG TTGGTGGACG CCGACCCTGT TGTGGftCAGC TCTCAGAAGC 1020 

GCTACCQGGC CGCCAGTGCC TTCTTCACCT ACGTGTCGCT GAGCCAGGAA QGCAQGTCGC 1080 

15 

TGCCTGTGCC CCAGCTGGTG CCCGAGACCG AGGACGAGAA GAAGCGCTTT GAGGAAGGCA 1140 

AAGGGCGGTA CCTGCAGATG AAGGCX3AAGC GACAGGGCCA CGCQGAGCCT CAGCCCTAGA 1200 

20 CTCCCTCCTC CTGCCACTGG TGCCTCGAGT AGCCATGGCA ACGGGCCXAG TGTCCAGTCA 1260 

CTTAGAAGTT CCCCCCTTGG CCAAAAACCC AATTCACATT GAGAGCTGGT GTTGTCTGAA 1320 

GTTTTCGTAT CACAGTGTTA ACCTGTACTC TCTCCTGCAA ACCTACACAC CAAAGCTTTA 1380 

25 

TTTATATCAT TCCAGTATCA ATGCTACACA GTGTTGTCCC GAQCX^CCOGG AGGCGTTGGG 1440 

CAGAAACCCT CGGGAATGCT TCCGAGCACG CTGTAGGGTA TGGGAAGAAC CCAGCACCAC 1500 

30 TMATAAAGCT GNTGCTTGGC TGGGGAAGNA 1530 



35 (2) INFORMATION FOR SEQ ID NO: 300: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 997 base pairs 
<B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: double 

(D) TOPOIXXV: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 300: 

45 AGGTAGTGAG AGACACATTA CACCTAACCA ACAAGAAGAA GGATCCTCCC CCTTATAATT 60 

TAACTATGTT TACAGGGAAT GCGTACATTG TGGCTTCCCG AGNATTTCGT CCAACATGTT 120 

TTGAAGAACC CTAAATCCCA ACAACTGATT GAATQGGTAA AAGACACTTA TAGCCCAGAT 180 

50 

GAACACCTCT GGGCCACCCT TCAGCGTQCA CGGTGGATGC CTGGCTCTGT TCCCAACCAC 240 

• CCCAAGTACG ACATCTTCAG ACATGACTTC TATTGCCAGG CTGGTCAAGT QGCAGGGTCA 300 

55 TGAGGGAGAC ATCGATAAGG GTGCTCCTTA TGCTCCCTGC TCTGGAATCC ACCAGCGGGC 360 

TATCTGCGTT TATGGGGCTG GGGACTTGAA TTGGATGCTT CAAAACCATC ACCTGTTGGC 420 

CAACAAGTTT GACCCAAAGG TAGATGATAA TGCTCTTCAG TGCTTAGAAG AATACCTACG 480 
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TTATAAGC5CC ATCTATQQC3A CTGAACTTTG AGACACACTA TGAGAGCGTT GCTACCTGTG 540 

GGGCAAGAGC ATGTACAAAC ATGCTCAGAA CTTGCTGGGA CAGTCTGGGT GGGAGACCAG 600 

5 GGCTTTQCAA TTCGTGGCAT CCTTTAGGAT AAGAGGGCTG MTATTAGATT GTGGGTAAGT 660 

AGATCTTITC CCTTQCAAAT TGCTGCXTTCG GTGRATGCTG CrTGTTCTCT CACCCCTAAC 720 

CCTAGTAGTT CCTOCACTAA CTTTCTCACT AAGTGAGAAT GAGAACTGCT GTGATAOGGA 780 

10 

GAGTGAAGGA GGGATATGTG GTAGAGCACT TGATTTCAGT TGAATGCCTG CTGGTAOCTT 840 

TTCCATTCTC TGGAGCTGCC GTTCCTAATA ATTCCAGGTT TGGTAGCGTG GAGGAGAACT 900 

15 TTGATGGAAA GAGAACCTTC CCTTCTGTAC TGTTAACTTA AAAATAAATA GCTCCTGATT 960 

CAAAGTAAGG AAAAARAAAA AAAGAAAAAA AACTCGA 991 



20 



35 



45 



(2) INFORMATION FOR SEQ ID NO: 301: 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 2345 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEE4JESS: double 

(D) TOPOLOGY: linear 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 301: 

TTGAGGCXX3A CQCTAGQGGC CCGGAAGRAA ACTGCGAGGC GAAGGTGACC GQGGACCGAG 60 

CATTTCAGAT CTGCTCGGTA GACCTGGTGC ACCACCACCA TGTTGGCTGC AAGGCTGGTG 120 

TGTCTCCGGA CACTACCTTC TAGGGTITTC CACCCAGCTT TCACCAAGGC CTCCCCTGTT 180 

GTCAAGAATT CCATCACGAA GAATCAATGG CTGTTAACAC CTAGCAGGGA ATATGCCACC 240 

40 AAAACAftGAA TTGGGATCCG GCGTGGGAGA ACTGGCCAAG AACTCAAAGA GGCAGCATTG 300 

GAACCATCGA TGGAAAAAAT ATTTAAAATT GATCAGATGG GAAGATQGTT TGTTGCTQGA 360 

GGGGCTGCTG TTGGTCTTGG AGCATTGTGC TACTATGGCT TGGGACTGTC TAATGAGATT 420 

GGAGCTATTG AAAAGGCTGT AATTTGGCCT CAGTATGTCA AGGATAGAAT TCATTCCACC 480 

TATATGTACT TAGCAGQGAG TATTGGTTTA ACAGCnTGT CTGCCATAGC AATCAGCAGA 540 

50 ACGCCTGTTC TCATGAACTT CATGATGAGA GGCTCTTGGG TGACAATTGG TGTGACCTTT 600 

GCAGCCATGG TTCGAGCTGG AATGCTGGTA CGATCAATAC CATATGACCA GAGCCCAQGC 660 

CCAAAGCATC TTGCTTQGTT GCTACATTCT GGTGTGATGG GTGCAGTGGT GGCTCCTCTG 720 

55 

ACAATATTAG GGGGTCCTCT TCTCATCAGA GCTGCATGGT ACACAQCTGG CATTGTGGGA 780 
GGCCTCTCCA CTGTGGCCAT GTGTGCGCCC AGTGAAAAGT TTCTGAACAT GQGTGCACCC 840 
60 CIGGGAGTCG GCCTGGGTCT CGTCTTTGTG TCCTCATTGG GATCTATGTT TCTTCCACCT 900 
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ACCACCGTGG CIX3GTCCCAC TCTTTACTCA C5TGGCAATGT ACGCTTGGATr AGTICTTTTC 960 

AGCATCTTCC TTCTGTATCA TACCCAGAAA GTAATCAAGC GTGCAGAAGT ATCACCAATG 1020 

^ TATOGACTTC AAAAATATCA TCCCATTAAC TCGATC3CTGA GTATCTACAT GGATACATTA 1080 

AATATATTTA TCCGAOTK3C AACTATGCTG GCAACTGGAG GCAACAGAAA GAAATGAACST 1140 

10 c»craw3CTr cksgcttctc tcctacatca aatatcttgt tpaatgcsqcx: agatatgcat 1200 

TAAATACTTT GTACAAGCAG CITTCGTTGA AGTITAGAAG ATAAGAAACA TGTCATCATA 1260 
TTTAAATOrr CCGGTAATOT GATGCCTCAG GTCTGCCTTT TTTTCrGGftG AATAAATGCA 1320 
GTAATCCTCT CCXS^TAAG GACACACATT TTCAATTCTC ATGrTTTGAGT GATTTTAAAA 1380 
TOmTCGTC AATCTGAAAA CTAAAGTTTG TGTCATGAGA ATGTAAGTCT TTTTTCTACT 1440 
20 TTAAAATTTA GTAGGTTCAC TGAGTAACTA AAATTTAGCA AACCTGTGTT TGCATATITr 1500 
rrrCGAOTGC AGAATATIXST AATTAATGTC ATAAGTGATT TCGAGCTTTG GTAAAGGGAC 1560 
CAGAGAGAAG GAGTCACCTG CAGTCITrrG TTTTTrrAAA TACTTAGAAC TTAGCACTTG 1620 

25 

TOTTATTGAT TAOT5AGGAG CCAGTAAGAA ACATCTGQGT ATTIGGAAAC AAGTCGTCAT 1680 
TGTTACAfPrC ATCTOCKSAA CTTAACAAAA CTGTTCATCC TGAAACAGGC ACAGGTGATG 
30 CATTCTCCTC CTCTTOCTrC TOVGTCCTCT CTTTCCAATA TAGATGTGGT CATGITrGAC 
TTGTACAGAA TOTTAATCAT ACAGAGAATC CTTGATGGAA TTATATATGT GTGTmACT 
TTPGAATGTT ACAAAAGGAA ATAACnTAA AACTATTCTC AAGAGAAAAT ATTCAAflOCA 

35 ^......., ,j,sn 

TGAAATATCT TGCTTTTTCC AGAATACAAA CAOTATACTC ATGATTGCTA AGTGTTXTIT 



1740 
1800 
1860 
1920 
1980 
2040 
2100 



TATTrrrcCA TAirrATIGA ACTGTCTAAT TGAATACAGC TTOCTCTTGT CACCTCTTCA 
40 AGCTTTCAAG CCTTTATAGA AAAGCTTCTT TGTGGCTTAC ACTGGAAATT ATGAAAGCAG 

TTmCTCCT AAGACrTTTG GTITCTCGCA TTCCCTCTCA GACTAAGCAC TAAAAAGCAA 2160 
AGCAAAACAG AACTAGTNCT GTCTTAATGA AATATATCAA CCCAAAAGTG TAATGAGGAA 

45 

AATGCrrCAT TAOTITCCXrC TAGCAGACTT TTACTTCTCT TACACTGCTA CACCATTACT 
TTCTIGAGAC ATTTOrAAGT CCTTTCATAC AGAAGAGTTA TATTTAGGAG GNCTTTAATG 
50 AAGGG 



2220 
2280 
2340 
2345 



55 (2) INFORMATION FOR SEQ ID NO: 302: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2369 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 



533 



(D) TOPOLOGY: linear 
(xi) SEQUENCE INSCRIPTION: SEQ ID NO: 302: 

rrxTi ' m 'T T iTrmTm ' tttttncaag atcattgttt atttattact tcagataaaa 
agatagtata catattaggg aatcccttaa aattcaactc tagagttata caccatctag 
tactittgca atgaatgtta acaacaacaa aaaaaatctc taaacacctg aaaoccccac 
tattaacatg gactatggta ataaaaaatt ttgacattta atttgttcaa catatagtat 
ttacattatg aaaccaatgg tgatgataca ataaagtgat aaagaaatag taaatuvtaaa 

CTTTAAAAAG CAAAGGTTTA TAGTCTGACA ATGCTAATTA TCCTAATTGT ATATAAAAAA 
TTAAAACATA GAGCTTTCTG TTACAAAATT CTTAATCCTC TGGGTTGTAA TCATTACTTG 
CTACCAATTT ACATGCAACA TCTGCTAOGA CTGACATTTG ATTTTTTTCC CCAAGAATGT 
GTGAGTAGAT AAATGACATT TCAGAGCAGA TATTAATTTA CTTGTGGACA GAAAAAGAAA 
CTCAAGATTG GTACTQGTCA CAAGCCTCTT CCCAATAGAA ATTATAAAAA CAGTAAGATA 
AAATTTAAAA AAAATCTAAA AAGGGGATGC ATAOGCAAAG AGTACCATAA ATGGCACAGC 
TCAAAAAATC CCAGGACCAA TCAGACACAC ATCTTTTCTC TCTCCTTCAG CGACAAGAGG 
TCGATTTTGC CATCAAATAA CCATGATTGA AGCAAGCGAG GGGCACCAGG TGTACAACTG 
ATTAGATCTT GCAAAATACT AAGATGGGAG CAGGGGTGGC CAGAAGAAGG QGTAATTTAT 
ATATAATTCA AACTATATAC AGCATAAATG GAATGCAGCC CATCCCAAAC TGGCTCTGTG 
AAACAATTGG ACCTTTATAG TTAAAATTAT AACAAGTGTA ATAATACAAT AGATTTACAT 
GGGAAGCAAA ATCCAAGGGA CATTTTATAT TAAGTATTTA CTGTGCTGTT TCAATTTAAA 
AATAATTTTG CTAAGTATAC ATCTCAACTG AAGTCTATGT AAAAAATGTC CTAATAGATA 
CAGATATTTA CCmOGTGA GTTGAAGGCC TTTTTGTGAC TTCTGTCTGA ACTGTAQGCA 
GAATCCTAGA TCTACATGCA CATATGGAGA AACTCAAGCT GAGGTCATCC AAAAGCTGTG 
CGTATCAGGA GGCTGGAGGT ACTTTGAAAG TCAAAGTAGA CCAGAAACCC AAAACAGGTA 
ACAGTCAGGA TCGCAACAGG GAATGGAATG CCAATATGGC AGTAAAACTT TTTTTAAAAA 
CAGAAAGAGG AAGGCCTCTC GrTACCAGCAG AATCCTGTAC ACGTACAAAA AAGAAAAAGC 
CACCCACCAT TTTCTAAAAC AGAAGCCAAT TATAGTGTGG GAAAGTACAA ATTACAGAAA 
ACCAGAAGTC AACAGAAGAA AAACTACTGG TTTACTTGAG AGAAAGGAGA ATGGTTCACC 
CCGAGCAGAG TTACTIOGTG AACGCCGCCA CCACCGCCCA CAGAACCTCA TTGGTGTTGG 
CCTTCAGACA TTCCACTTCA GGGTCTAAGT CGAGAARNTG CCGCACTCTC TTGGTAGCCA 
AATCATACTG CTCGTCCAGA AGAQGAGCAA AAGCATTCTC CAGGACGTCC GAGGCATGAG 
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20 



25 



45 



(2) INFORMATION FOR SEQ ID NO: 303: 



55 



CCAGCTAAAT GAGGGCXMC AAGCGCCTGT CCATGCQGrTG AGGGTCATTC ACCCATTTCT 1740 

CAAGAACGGC TTCCTCTACT TTCTTCATGA GGCX3CTG5CTT AATGTTGrTA TTQGTGAGGG 1800 

GATOTGTTCT CATGTCAAAA AGTAGGAAGT TCTGTTTCTC TCTTC5TCAAT ACACCCTPIT 1860 

CCACCAQGTr TITAGCTAAT CGTTCCCGTA CATTTCTTAA CTGATAATGC AATTTTAATG 1920 

GATTCCA3X3T CTCACCACTA AGTAATTCAA TCCAGTTCTG GACCGTTTCT GGAGGCTGAG 1980 

TTTCCTTAAC ATGCTTCAGA GCTTCATCAA GAAGAACATC CCCTGTTQGA GCATCTGACT 2040 

TACAGATTAC CriTCITCTT AATAGACTTT TACXHTTCAT TCCACAAGCC TCTAGTTGTA 2100 

15 ACCrrCCTCT CAATGCTAAT TCAATTAACA TACAGCCACG TAATCCAGAT GATATACAGT 2160 

CATICCAAAA TCATOIOTAA ACCTTCQCGG TCCITCftGGC CCAGCAGGAG CaCTrCCTCC 2220 

ATCAQCSGTCA CSCTGCGrrTC CTTCGACSTCG CCCTTGTCGT CGTCGTCCTG CTCGTCGCGG 2280 

CGGCTCTCCXS CXSTOTTCCTC GCTCCTAGCC GCGCCGCCGC CCGCCGCCCX; CTCCITOTCG 2340 
GCX5GCGTTGC GQGAQGCCTC QGTGCGCCG 



2369 



30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1181 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS : double 

(D) TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 303: 
GGGACGTCTG GTTTCACSCTC C3TCCGCCTCC CCGTC5GGTTT GCGACGTTTA GCGACTATTC 60 
40 CQCCTX3CGCC ACGCCGGCTC CGAGACTGGG GCCGTGGYTG CTGGTCCCGG GTGATGCTAG 120 
GCGGCTCCCT GGGCTCCAGG CTOrTCOGGG GTGTAGGTGG GAGTCACGGA CGGTTCGGGG 180 
CCCGAGOTGT CCGCGAAGGT GQCGCACATG GGCQGCAGGG GAGAGCATQG CTCAGCQGAT 240 
CGTCTGGGVG GACCTGGAGA TGACAGGATT GGACATTGAG AAGGACCAGA TTATTGAGAT 300 
GGCCTCTCTG ATAACTGACT CTGATCTCAA CATTTTC3GCT GAAGGTCCTA ACCTCaOTAT 360 
50 AAAACAACCA GATGACTTCC TQC3ACAGCAT GTCAGATTGG TGTAAGGAGC ATCACGGGAA 420 
GlCTOCXXrrr ACCAAGQCAG TGAAGGAGAG TACAATTACA TTGCAGCAQG CAGAGTATGA 480 
ATTTCTGrCC TTTGTACGAC AGCAGACTCC TCCAGGGCTC TGTCCACTTG CAGGAAATTC 540 
AGTTCAIGAA GATAAGAAGT TTCITCACAA ATACATGCCC CAGTTCATGA AACATCTTCA 600 
TTATAGAATA ATTGATCTGA GCACTGrrAA AGAACTGTGC AGACGCTGGT ATCCAGAAGA 660 
60 ATATCAATTT GCACCAAAGA AGGCTCCTTC TCATAGGGCA CTTGATGACA TTAGTGAAAG 720 



535 



CATCAAAGAG CTTCAGTTTT ACCGAAATAA CATCTTCAAG AAAAAAATAG ATGAAAAGAA 
GAGGAAAATT ATAGAAAATG GGGAAAATGA GAAGACCGTG AGTTGATGCC AGTTATCATG 
CTGCXACTAC ATCGTTATCT GGAGGCAACT TCTGGTGGTT TTTTTTTCTC ACGCTGATGG 
CTTGGCAGAG GMOTTCQGrT AACTTGCATC TCCAGATTGA TTACTCAAQC AGACAGCACA 
CGAAATACTA TTTTTCTCCT AATATGCTGT TTCCATTATG ACACAGCAGC TCCTTTGTAA 
GTACCAGGTC ATOTCCATCC CTTGGTACAT ATATGCATTT GCTTTTAAAC CATTTCmT 
GTTTAAATAA ATAAATAAGT AAATAAAGCT AGTTCTATTG AAATGCAAAA AAAAAAAAAA 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA N 

{2) INFORMATION FOR SEQ ID NO: 304: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1537 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS: double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 304: 

CTTTTTGTCT TCCGGCCGAT CCCACCTCTC CTCGACCCTG GACGTCTACC TTCCGGAGGC 

CCACATCTTG CCCACTCCGC GCGCGGGGCT AGCGCGGGTT TCAGCGACGG GAGCCCTCAA 

GGGACATGGC AACTACAGCG GCGCCGGCGG GCGOCGCCCG AANATGGAGC TGGCCCGGAA 

TGGGGAGGGT TCGAAGAAAA CATCCAGGGC GGAGGCTCAG CTGTGATTGA CATGGAGAAC 

ATG6ATGATA CCTCAGGCTC TAGCTTCGAG GATATGGGTG AGCTGCATCA GCGCCTGCGC 

GAGGAAGAAG TAGACGCTGA TGCAGCTGAT GCAGCTQCTG CTGAAGAGGA GGATGGAGAG 

TTCCTGGGCA TGAAQQGCTT TAAGGGACAG CT6AGCCQGC AGGTGGCAGA TCAGATGTQG 

CAGGCTGGGA AAAGACAAGC CTCCAGGGCC TTCAGCTTGT ACGCCAACAT CGACATCCTC 

AGACCCTACT TTGATGTGGA GCCTGCTCAG GTGCGAACAG QGCTCCTGGA GTCCATGATC 

CCTATXiJ^GA TGGTCAACTT CCCCCAGAAA ATTGCAGGTG AACTCTATGG ACCTCTCATG 

CTGGTCTTCA CTCTGGTTGC TATCCTACTC CATGGGATGA AGACGTCTGA CACTATTATC 

CGQGAGQGCA CCCTGATGGG CACAGCCATT GGCACCTGCT TCGGCTACTG GCTGGGAGTC 

TCATCCTTCA TTTACTTCCT TGCCTACCTG TGCAACGCCC AGATCACCAT GCTGCAGATG 

TTGGCACTCC TQGGCTATGG CCTCTTTGGG CATTGCATTG TCCTGTTCAT CACCTATAAT 

ATCCACCTCC ACGCCCTCTT CTACCTCTTC TGGCTGTTQG TGGGTQGACT GTCCACACTG 
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CX3CATQGTAG CAGTGTTGGT GTCTCQGACC GTGGGCCCCA CACAGCQGCT GCTCCTCTGT 960 

GOCACCCTGG CTGCCCTACA CATGCTCTTC CTGCTCTATC TGCATTTTCC CTACCACAAA 1020 

GTCNTAGAGG GGATCCTGGA CACACTGGAG GGCCCCAACA TCCCCCCCKT CXAGAGGGTC 1080 

CCCAGAGACA TCCCTGCXZAT GCTCCCTGCT GCTCGGCTTC CCACCACCGT CCTCAAOGCC 1140 

ACAGCCAAAG CTGTTGCGGT GACCCTGCAG TCACACTGAC CXXIACCTGAA ATTCTTGGCC 1200 

AlCTCCTCTTT CCCGCAGCTG CAGAGAGGAG GAAGACTATT AAAGGACAGT CXTTGATGACA 1260 

TGTTTCGTAG ATOQQGTTTG CAGCTGCCAC TGAGCTGTAG CTGCGTAAGT ACCTCCTTGN 1320 

15 AGCTGTOGGC ACTTCTGAAA GCACAAGGCC AAGAACTCCT QGCCAGGACT GCAAGGCTCT 1380 

• GCAGCCAATG CAGAAAATGG GTCAGCTCCT TTGAGAACCX: CTCCCCACCT ACCCCTTCCT 1440 

TCCTCTTTAT CTCTCCCACA TTGTCTTGCT AAATATAGAC TTGGTAATTA AAAAAAAAAA 1500 

AAAAAAAAAA AAAAAAAAAA AAAAAAGGGG GGNCXXX: 1537 



20 



25 



35 



45 



55 



(2) INFORMATION FOR SEQ ID NO: 305: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1493 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTIC»J: SEQ ID NO: 305: 

TCCA'TCCCAA AACCAATGCC TGCCAAACAA AATCTTAGAC ATCCCAATAT AATATGTTAG 60 

TTATATTTCT ATTCACATCA TTATTGAAAA TACCCAGCTC AGTGCCTGGC TTAATAAATG 120 

40 TTTAATTCCC TTACCTACTC TTGCTCTATT TTTTTATTTG AAATGGAGAT GAGCAAAATA 180 

ACACATTCAT GQCTCAAGCA ATTTTTTGGA CATTTCTTGT TACCAAAAGA TCTATAATCA 240 

GGATXSATCCT GAGCTGTTCA AACAAGCTGT ATATAAACAG ACAATGAAAC TCTTTGCAGA 300 

GCTGGAAATT AAAAGGAAAG AGAGAGAAGC CAAAGAGATG CATGAAAGGA AACGACAAAG 360 

GGAAGAAGAG ATTCAAGCTC AAGAAAAAGC CAAACGGGAA AGAGAGTGGC AGAAAAACTT 420 

50 TGAGGAAAGT CGAGATGGTC GTGTGGACAG CTGGCGAAAC TTCCAAGCCA ATACGAAGQG 480 

GAAGAAAGAG AAGAAAAATC GGACCTTCCT GAGACCACCG AAAGTAAAAA TGGAGCAACG 540 

TCAGTCACCG CCCAAGGTCA CAGGCACAGA ACCTTTCCCC TGCTATCTCC CTTCCTGCTT 600 

CGAAGGACTC ATTCTTTCCT CCCACTTCCA CCCCAACATA GAGTAGTATT TGCTTnTAG 660 

TCCATTTTGT TTTCAATACG ATTTAATATC GATCAGAGTA ATTCTTTTGT ACATTGAAAT 720 

60 GAGGGGCTTC CTTTAAAAAA AGACCTTTCC CTCTCCCTGC CCCTAGAACA ACCAGTATTA 780 
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GAAGGTGCCA CCATTGGTGC TGCCTTCTCT 
TTCACTGAAT TOTGATGGTT AGAAACTTCG 

5 

CATACTGCTT AAAACAGTGT TGCTGTGACT 
GAAGCCCCTT CGCTTCAGTT GCTCGCTTCT 
10 ACftGGGAACA CTTGTGAGCA CACAGAGCAG 
CCCCCTGTCT ACTGACCAAT CAGTCTQGCA 
TCCAAAGAGC TAGCCGTCCT CCACCCAGTA 

15 

GGTAATATTC TTTATGTATA ATAAATTTTT 
TACTCTCCCT TGTGGGTCCC TTGTCTGGCT 
20 ACAGACCTGG CTGAACCTTA GTTTCTTCAT 
AGCTTTTAGG GCAGATTTGC CATGGCATAT 
ATTGCCAATA TCCTATTATT TCTGTGTAAA 

25 



TCCCACAGCC TCTAACTCAG TGTTTTGTAC 840 

TCGATAGTTT GTGGAAATCA TCCAATTAAA 900 

TCAGAGACAA QCCTGGAAGG GGCACCTTAG 960 

GGGTCTGCTC CCTTCGAAGG CXTCAGATAAG 1020 

CATCTGATGC CCTGTOGTGT TTGQCATCTG 1080 

TGAGGCCCAC GCCACCCAAA CCTTTCACTT 1140 

CCATGTCCTA GCCTGTCTGC ATTTGTTAGT 1200 

ATACCXy^AGC CATTGATGTA CTTTTCCTrG 1260 

TGGCTGAACC CCAAAATGCT TTGGGGTTOG 1320 

CTATGAAATG GGAATATGAA TTACTGCAGC 1380 

ACAAGGTAAC TACCATAGTG CTCCTTGGGT 1440 

ATGAAGATAC TGATTGTTTT GAG 1493 



30 



(2) INFORMATION FOR SEQ ID NO: 306: . 



(i) SEQUENCE CHARACTERISTICS: 

(A) LEJGTH: 577 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS: double 
35 (D) TOPdJOGY: linear 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 306: 

AATTCGGCAG AGGNATTATA TACACTATAC TGGCATTTAC TGTTTCACCC AGCCCGGAAA 60 

CnCAGAGATG TATATTGGAA AATTTACAAC TCCATCTACA TTGGTTCCCA GGACGCTCTC 120 

ATAGCACATT ACCCAAGAAT CTACAACGAT GATAAGAACA CCTATATTCG TTATGAACTT 180 

45 GACTATATCT TATAATTTTA TTGTTTATTT TGTGTTTAAT GCACAQCTAC TTCACACCTT 240 

AAACTTGCTT TGATTTGGTG ATGTAAACTT TTAAACATTG CAGATCAGTG TAGAACTGGT 300 

CATAGAGGAA GAGCTAGAAA TCCAGTAGCA TGATTTTTAA ATAACCTGTC TTTGTTTTTG 360 

ATOTTAAACA CTAAATCCCA GTAGTGACCA AGAACACAGT GATTATATAC ACTATACTGG 420 

AGGGATTTCA TTTTTAATTC ATCTTTATGA AGATTTAGAA CTCATTCCTT GTGTTTAAAG 480 

55 GGAATOTTTA ATTGAGAAAT AAACATTTGT GWACAAAATG YTAAAAAAAA AAAAAAAAAA 540 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AACTCGA 577 

60 



50 
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(2) INFORMATION FOR SEQ ID NO: 307: 

(i) SECPENCE CHARACTERISTICS: 
5 (A) LEIXSTH: 2860 base pairs 

(B) TYPE: nucleic acid 

(C) STEU^NDEDNESS: double 

(D) TOPOLOGY: linear 

10 <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 307: 



GTGTNC5ACCG CTCTCNCAAT ATGGCTCCCC CGGGCTQGCA GRWRKTCRGT CWCKRGTGGC 60 



TAGCCTGTCC TGACAGGGGA GAGTTAAGCT CCCGTTCTCC ACCtSTGCCGG CTGGCCAGGT 120 

15 

GGGCTGAGGG TGACCGAGAG ACCAGAACCT GCTTQCTQGA GCTTAGTGCT CAGAGCTGGG 180 



GAGGGAGGTT CCGCCGCTCC TCTGCTGTCA GCGCCGGCAG CCCCTCCCGG CTTCACTTCC 240 



20 TCCCGCAGCC CCTGCTACTG AGAAGCTCCG GGATCCCAGC AGCCGCCACG CCCTGGCCTC 300 



AGCCTGCGGG GCTTCCAGTC AGGCCAACAC CGACGCGCAC TGGGGAQGAA GACAGGACCC 360 



TTGACATCTC CATCTGCACA GAQGTCCTGG CTGGAACCGA GCAGCCTCCT CCTCCTAGGA 420 

25 

TGACCTCACC CTCCAGCTCT CCAGTTTTCA GGTTGGAGAC ATTAGATGGA GGCCAAGAAG 480 



ATGGCTCTGA GGCGGACAGA GGAAAGCTGG ATTTTGGGAG CGGGCTGCCT CCCATGGAGT 540 



30 CACAGTTCCA GGGCGAGGAC CGGAAATTCG CCCCTTCAGA TAAGAGTCAA CCTCCAACTA 600 



CCGAAAGGGA ACAGGTGCCA GTCAGCCGGA TCCAAACCGA TTTGACCGAG ATCGGCTCTT 660 



CAATGCGGTC TCCCGGGGTG TCCCCGAGGA TCTGGCTGGA CTTCCAGAGT ACCTGAGCAA 720 

35 

GACCAGCAAG TACCTCACCG ACTTCGGAAA TACACAGAGG GCTCCACAGG TAAGACGGCC 780 



TGATGAAGGC TGTGCTGAAA CCTTAAGGAC GGGGTCAATG CCTGCATTCT GCCACTGCTG 840 



40 CAGATCGACC GGGACTCTGG CAATCCTCAG CCCCTGGTAA ATGCCCAGTG CACAGATGAC 900 



TATTACCGAG GCCACAGCGC TCTGCACATC GCCATTGAGA AAGAGGAGTC TGCAGTGTGT 960 



GAAGCTCCTG GTGGAGAATG GGGCCAATGT GCATGCCCGG GTCTGCGGCG ACTTCTTCCA 1020 

45 

GAAGGGCCAA GGGACTTGCT TTTATTTCGG TGAGCTACCC CTCTCTTTGG CCGCTTGCAC 1080 



CAAGCAGTGG GATGTGGTAA GCTACCTCCT GGAGAACCCA CACCAGCCCG CCAGCCTGCA 1140 



50 GGCCACTGAC TCCCAGGGCA ACACAGTCCT GCATGCCCTA GTGGATGATC TCGGACAACT 1200 



CAGCTGAGAA CATTGCACTG GTGACCAGCA TGTATGATGG GCTCCTCCAA GCTKGGGSCC 1260 



SCCYTCTGCC CTACCGTGCA GCTTGAGGAC ATCCGCAACC TGCAGGATCT CACGCCTCTG 1320 

55 

AAGCTGGCCG CCAAGGAGGG CAAGATCGAG ATTTTCAGGC ACATCCTGCA GCGGGAGTTT 1380 



TCAGGACTGA GCCACCTTTC COGAAAGTTC ACCGAGTGGT GCTATGGGCC TGTCCGGGTG 1440 



60 TCGCTGTATG ACCTGGCTTC TGTGGACAGC TGTGAOGAGA ACTCAGTGCT GGAGATCATT 1500 
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GCCTTTCATT GCAAGAGCCC GCACCGACAC CGAATGC5TCG TTTTGGAGCC CCTGAACAAA 1560 

CTGCTGCAGG CGAAATGGGA TCTGCTCATC CCCAAGTTCT TCTTAAACTT CXrTGTGTAAT 1620 

5 

CTGATCTACA TOTTCATCTT CACCGCTGTT GCCTACCATC AGCCTACCCT GAAGAAGCAG 1680 



GCXXKXrCCTC ACXrrCSAAAGC C3GAGGTTCGA AACTCCATGC TC5CTGACGGG CXACATCCTT 1740 
10 ATCCTGCTAG GGG0C5ATCTA CCTCCTOGTG GQGCCAGCTG TOisTACTTCT GGCGQCX3CCA 1800 



CGTOPTCATC TQGATCTCGT TCATAGACAG CTACTTTQGA AATOCTCTTC CTGTTCCAQG 1B60 

CXTTQCTTCA CAGTGGTOTC CXAGGTGCTG TOTTTCCTGG GCCATOGAGT GGTACCTGCX: 1920 

15 

CCTGCTTGTG TCTGCGCTCG TGGCTGGGCT GGCTGAACCT GCTTTACTAA TACACGTQQC 1980 



GTTCCAOCAC ACAC3QCAGTC TACAGTTTCA TGWTCCCTGA AGCCCTGGTG AGCCTGAGCC 2040 



20 AGGAGGCTTG GCGCCCCGAA GCTCCTACAG GCCCCAATGC CACAGAGTCA GTGCAGCCCA 2100 



TGGAGGGACA GGAGGACGAG GGCAACGGGG CCCAGTACAG GGGTATCCTG GAAGCCTCCT 2160 



TGGAQCTCTT CAAATTCACC ATCGGCATGG GCGAGCTGGC CTTCCAGGAG CAGCTGCACT 2220 

25 

TCCGCGGCAT GGTGCTGCTG CTGCTGCTGG CCTACGTGCT GCTCACCTAC ATCCTGCTGC 2280 



TCAACATGCT CATCGCXXTTC ATGAAGCGAA CGTCACAGTG TCGCCACTGA CAGCTGGAGC 2340 



30 ATCIX3GAAGC TOCAGAAAGC CATCTCTGTC CTGGAGATGG AGAATGQCTA TTQGTGGTQC 2400 



AGGAAAAAGC AGCGGGCAGG TGTGATGCTG ACCGTTGGCA CTAAGCCCAG ATGGCAGCCC 2460 



CGATCAGCGC TCGTGCTTCA GGGTGGAGGA GGTGAACTGG GCTTCATC30G GAGCAGACGC 2520 

35 

TGCCTACX3CT GTGTGAGGAC CXOTCAGGGG CAGGTGTCCC TCGAACTCTC GAGAACCCTG 2580 



TCCTGGCTTC CCCTCCCAAG GAGGATGAGG ATQGTGCCTC TGAGGAAAAC TATGTGCCCG 2640 



40 TCCAGCTCCT CCAGTCCAAC TGATGGCCCA GATGCAGCAG GAGGCCAGAG GACAGAGCAG 2700 



AGGATCTTTC CAACCACATC TGCTGGCTCT GGGGTCCXrAG TGAATTCTGG TGGCAAATAT 2760 



ATATTTTCAC TAACTCAAAA AAAAAAAAAA AAAAAAAAAA AAAAVGAGGG GQGGCCCXSKT 2820 

45 

ASCCAAWTTC GCCXTTATAAG TGAGTGCCWA TTACGATAAA 2860 



50 

(2) INFORMATION FOR SEQ ID NO: 308: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 876 base pairs 

55 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 308: 
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CTGCTTGrrGT CTGCGCTGGT GCTQGGCTGG 
CAGCACACAG GCATCTACAG TGTCATGATC 
5 TTGGCGCCCC GAAGCTCCTA CAGGCCCCAA 
ACAGGAGGAC GAGGGCAACG GGGCCCAGTA 
CTTCAAATTC ACCATCGGCA TGGGCGAGCT 

10 

CATGGTGCTG CTGCTGCTGC TGGCCTACGT 
GCTCATCGCC CTCATGNAGC GAGACCGWCA 
15 GGAAQCTGCA GAAAGCCATC TCTOTCCTCG 
AGAAGCAGCG GGCAGGTGTG ATGCTGftCCG 
AGCGCTGGTG CTTCAGQGTG GAGGAGGrTCA 

20 

CGCTGTOTGA GGACCCGTCA GGGGCAGGTG 
CTTCCCCTCC CAAGGAGGAT GAGGATGGTG 
25 TCCTCCAGTC CAACTGATGG CCCAGATGCA 
TTTCCAACCA CATCTGCTGG CTCTGGGGTC 
TCACTAAMWM AAAAAAAAAA AAAAAAAAAA 

30 



540 

CTGAACCTGC TTTACTATAC ACOrGGCTTC 60 

CAGAAGCCCT QGTGAGCCTG AGCCAQGANN 120 

TCCCACAGAG TCAGTGCAGC CCATQGAGGG 180 

CAGGGGTATC CTGGAAGCCT CCTTGGAGCT 240 

GGCCTTCCAG GAGCAGCTGC ACTTCCGCGG 300 

GCTGCTCACC TACATCCTGC TGCTCAACAT 360 

ACAGTGTCGC CACTGACAGC TGGAGCATCT 420 

AGATGGAGAA TGGCTATTGG TGGTGCAQGA 480 

TTGGCACTAA GCCAGATGGC AGCCTCGATG 540 

ACTGQGCTTC ATGGGAGCAG ACGCTGCXITA 600 

TCCCTCX3AAC TCTCGAGAAC CCTGTCCTGG 660 

CCTCTGAGGA AAACTATGTG CCCGTCCAGC 720 

GCAGGAGGCC AGAGGACAGA GCAGAGGATC 780 

CCAGTGAATT CTGGTGGCAA ATATATATTT 840 

ACTCGA 876 



35 



45 



55 



(2) INFORMATION FOR SEQ ID NO: 309: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2025 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : double 
40 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 309: 

CATGACCCGC CTCATCCGAT CCCGCACAGC CTCTGGTTCC AGCGTCACTT CTCTGGATGG 60 

CACCCGCAGC CGCTCCCACA CCAGCGAGGG CACCCGAAGC CGCTCCCACA CCAGCGAGGG 120 

CACCCGCAGC CGCTCCCACA CCAGCGAGGG GGCCCACCTG GACATCACCC CCAACTCGGG 180 

50 TGCTGCTGGG AACASGCCGG GCCCAAGTCC ATGGAGGTCT CCTGCTAGGC GGCCTGCCCA 240 

GCTGCCGCCC CCGGACTCTG ATCTCTGTAG TGGCCCCCTC CTCCCCGGCC CCTTTTCGCC 300 

CCCTCCCTX3C CATACTCCGC CTAACTCGGT ATTAATCCAA AGCTTATTTT GTAAGAGTGA 360 

GCTCTOGTOG AGACAAATCA GGTCTATTAC GTGGGTGCCC TCTCCAAAGG CGGGGTGGCG 420 

G1GGACCAAA GGAAGGAAGC AAGCATCTCC GCATCGCATC CTCTTCCATT AACCAGTGGC 480 

60 CGGTTGCCAC TCTCCTCCCC TCCCTCAGAG ACACCAAACT GCCAAAAACA AGACGCGTAC 540 
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15 



541 



AGCACACACT TCACAAAGCC AAGCCTAGGC CX3CCCTGAGC ATCCTGGTTC AAACGGGTQC 600 

CTGGTCAGAA GGCCAGCCGC CCACTTCCCG TTTCCTCTTT AACTGAGGAG AAGCTGATCC 660 

5 

AGTTTCCGGA AACAAAATCC TTTTCTCATT TGGGGAGGGG GGTAATAGTG ACATGCAGGC 720 

ACXrrCTTTTA AACAGGCAAA ACAGGAAGQG GGAAAAGGTG GGATTCATGT CGAGGCTAGA 780 

10 GGCATTTQGA ACAACAAATC TACGTAGTTA ACTTGAAGAA ACCGATTTTT AAAGTTQGTG 840 

CATCTAGAAA GCTTTGAATG CAGAAGCAAA CAAGCTTGAT TTTTCTAGCA TCCTCTTAAT 900 

GTGCAGCAAA AGCAGGCRAC AAAATCTCCT GGCTTTACAG ACAAAAATAT TTCAQCAAAC 960 

GTTGGGCATC ATGGTTTTTG AAGGCTTTAG TTCTGCTTTC TGCCTCTCCT CCACAGCCCX: 1020 

AACCTCCCAC CCCTGATACA TGAGCCAGTG ATTATTCTTG TTCAGQGAGA AGATCATTTA 1080 

20 GATTTGTTTT GCATTCCTTA GAATGGAGGG CAACATTCCA CAGCTGCCCT GGCTGTGATG 1140 

AGTGTCCTTG CAGGGGCCX5G AGTAGGAGCA CTGGGGTGGG GGCGGAATTG GGGITACTCG 1200 

ATGTAAGGGA TTCCTTGTTG TTGTGTTGAG ATCCAGTGCA GTTGTGATTT CTGTGGATCC 1260 

25 

CAGCTTQGTT CCAGGAATTT TGTGTGATTG GCTTAAATCC AGTnTCAAT CTTOGACAGC 1320 

TGGGCTOGAA CGTGAACTCA GTAGCTGAAC CTGTCTGACC CX3GTCACGTT CTTGGATCCT 1380 

30 CAGAACTCTT TXXrTCTTGTC GQGGTGGGGG TGGGAACTCA CGTQGGGAGC GGTGGCTGAG 1440 

AAAATGTAAG GATTCTQGAA TACATATTCC ATQGGACTTT CCTTCCCTCT CCTGCTTCCT 1500 

CTTTTCCTGC TCCCTAACCT TTCGCCGAAT GGGGCAGCAC CACTGACGTT TCTQGGCGGC 1560 

CAGTGCX3GCT GCCAGGTTCC TGTACTACTG CCTTGTACTT TTCATTTTGG CTCACCGTGG 1620 

ATTTTCTCAT AGGAAGTTTG GTCAGAGTGA ATTGAATATT GTAAGTCAGC CACTGGGACC 1680 

40 CGAGGATTTC TGGGACCCCG CAGTTGGGAG GAGGAAGTAG TCXAGCCTTC CAGGTGGCGT 1740 

GAGAQGCAAT GACTCGTTAC CTGCXGCCCA TCACXTTTOGA GGCCTTCCCT GGCCTTGAGT 1800 

AGAAAAGTCG GGGATCGGGG CAAGAGAQGC TGAGTACGGA TGGGAAACTA TTGTGCACAA 1860 

GTCTTTCCAG AGGAGTTTCT TAATGAGATA TTTGTATTTA TTTCCAGACC AATAAATTTG 1920 

TAACTTTGCA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAACTC 1980 

50 GAGGGGGGCC CX3TACCCAAT TCGCCGTATA TGATCGTAAA CAATC 2025 



35 



45 



55 (2) INFORMATION FOR SEQ ID NO: 310: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3026 base pairs 

(B) TYPE: nucleic acid 
60 (C) STRANDEDNESS: double 
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10 



(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310: 
TAGGCAGCAC TGAAATATCC TAACCCCCTA AGCTCCAGGT GCCCTGTGGN ACGAGCAACT 60 

GGACTATAGC AGGGCTGGGC TCTGTCTTCC TGGTCATAGG CTCACTCTTT CCCCCAAATC 120 

TTCCTCTQGA GCTTTGCAGC CAAGGTGCTA AAAGGAATAG GTAGGAGACC TCTTCTATCT 180 

AATCCTTAAA AGCATAATGT TGAACATTCA TTCAACAGCT GATGCCCTAT AACCCCTGCC 240 

TOGATTTCTT CCTATTAGGC TATAAGAAGT AGCAAGATCT TTACATAATT CAGAGTGGTT 300 

15 TCATTGCCTT CCTACCCTCT CTAATGGCCC CTCCATTTAT TTGACTAAAG CATCACACAG 360 

TX3GCACTAGC ATTATACCAA GAGTATGAGA AATACAGTGG TTTATGGCTC TAACATTACT 420 

GCCTTCAGTA TCAAGGCTGC CTGGAGAAAG GATGGCAGCC TCAGQGCTTC CTTATGTCCT 480 

20 

CCACCACAAG AGCTCCTTGA TGAAGGTCAT CTTTTTCCCC TATCCTGTTC TTCCCCTCCC 540 

CGCTCCTAAT GGTACGTGGG TACCCAGGCT GGTTCTTGGG CTAGGTAGTG GGGACCAAGT 600 

25 TCATTACCTC CCTATCAGTT CTAGCATAGT AAACTACGGT ACCAGTGTTA GTGGGAAGAG 660 

CTCQGTTTTC CTAGTATACC CACTGCATCC TACTCCTACC TGGTCAACCC GCTGCTTCCA 720 

GGTATOGGAC CTGCTAAGTG TGGAATTACC TGATAAGGGA GAGGGAAATA CAAGGAGGGC 780 

CTCTOGTGTT CCIGGCCTCA GCCAGCTGCC CACAAGCCAT AAACCAATAA AACAAGAATA 840 

CTGAGTCAGT TTTTTATCTG GGTTCTCTTC ATTCCCACTG CACTTGGTGC TGCTTTGGCT 900 

35 GACTGGGAAC ACCCCATAAC TACAGAGTCT GACAGGAAGA CTQGAGACTG TCCACTTCTA 960 

GCTCGGAACT TACTGTGTAA ATAAACTTTC AGAACTGCTA CCATGTUIGTG AAAATGCCAC 1020 

ATTITCCTTT ATAATTTCTA CCCATGTTGG GAAAAACTGG CTTTTTCCCA GCCCTTTCCA 1080 

40 

GGGCATAAAA CTCAACCCCT TCGATAGCAA GTCCCATCAG CCTATTATTT TTTTAAAGAA 1140 

AACTIGCACT TGTTTTTCTT TTTACAGTTA CTTCCTTCCT GCCCCAAAAT TATAAACTCT 1200 

45 AAGTGTAAAA AAAAGTCTTA ACAACAGCTT CTTGCTTGTA AAAATATGTA TTATACATCT 1260 

GTATTTTTAA ATTCTQCTCC TGAAAAATGA CTGTCCCATT CTCCACTCAC TGCATTTGGG 1320 

GCCTTTCCCA TTGGTCTGCA TGTCTTTTAT CATTGCAGGC CAGTGGACAG AGGGAGAAGG 1380 

GAGAACAGGG GTCGCCAACA CTTGTGTTGC TTTCTGACTG ATCCTGAACA AGAAAGAGTA 1440 

ACACTCAGGC GCTCGCTCCC ATGCACAACT CTCCAAAACA CTTATCCTCC TGCAAGAGTG 1500 

55 GGCTTTCCAG GGTCTTTACT GGGAAGCAGT TAAGCCCCCT CCTCACCCCT TCCTTTTTTC 1560 

TTTCTTTACT CCTTTGGCTT CAAAGGATTT TGGAAAAGAA ACAATATGCT TTACACTCAT 1620 

rrrCAATTTC TAAATTTGCA GGGGATACTG AAAAATACGG CAGGTGGCCT AAGGCTGCTG 1680 



30 



50 
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TAAAGTTGAG GGGAGAGGAA ATCTTAAGAT 
AAAGAACAAT AGAACTGGTC TTCCATTTTG 
5 CTQGAGACAG TAACATTTCA TTAACCAAAG 
AGTACrCAQG CCACTCCAAT CACCCTACAA 
CCTTAAACTG AOGCTAGNMA ATAAACCTGG 

10 

CCATCTGTGA GGTGAYAGGC AAGGATGAAA 
AAAGGAGATC ATTTAGTTGG GTCTGAAAGG 
15 AGTACCTGTA AGCATTTTAG GTCCCAGAAT 
TAATGTOCTT TCCCTGGAGT CAGTTTTTTT 
AATTCTAAAA GAGAAGGGAG CTGAGGCCAT 

20 

AAAAGATTCA AAGCTCTAAT AGAGTCACAG 
AGTACAATAA GCAGAGGTGG AAAATGATCT 
25 ATTTATAAAT TTGAAATCCA AACTACTTTC 
GGACAGGAAA TATGTCCCCC CCTAACTTTC 
AGATCATTCT ACAAGTAATT TTGCACAGAC 

30 

CACCCAAGGT CANCCAAACA ACTTGGTTGT 
GGOGATTAGC TAGACTAGGA GACCCAGAAG 
35 TTGGCOTGTC AGftGCTTGAT TAGAAGCCAA 
GGAAAAACCT GTGGGTTGrTG CTAATTTCTG 
GGGTGCATGG AQGAATTGGG ACCTGGTTAT 

40 

ATGTAAAACA GAATATTCTG TAAACCTAAT 
AAAATATTCA ATAAGAAGTC AAAAAAAAAA 
45 AATTTNCCAA ATAGAGATNG TATTAC 



TACAAGATAA AAAACGAATC CCCTAAACAA 1740 

CCACCTTTCC TGTTCATGAC AGCTACTAAC 1800 

AAAGTQGGTC ACCTGACCTC TGAAGAGCTG 1860 

GATGCCAAGG AGGTCCCAGG AAGTCCAGCT 1920 

OCAAGTGAGG CAAGAGAAAT GAGGAAGAAT 1980 

GACAAAGAAG GAAAAGAGTA TCAAAQGCAG 2040 

AAAAGTCTTT GCTATCCX3AC ATGTACTGCT 2100 

GGAAAAAAAA ATCAGCTATT GGTAATATAA 2160 

AAAAAGTTAA CTCTTAGTTT TTACTTOTPr 2220 

TCCCTGTAGG AGTAAAGATA AAAGGATAGG 2280 

CTTTCCCAGG TATAAAACCT AAAATTAAGA 2340 

AGTTCCTGAT AGCTACCXy^ AGAGCAAGTC 2400 

TTAATATCAC TTTQGTCTCX: ATTTTTCCCA 2460 

TTGCTTCAAA AATTAAAATC CAGCATCCCA 2520 

ATCTCCTCAC CCXAGTGCCT GTCTGGAGCT 2580 

GAACCX:y\ACT GCCTTAACCT TCTGGGGGAG 2640 

TGAATGGGAA AGGGTGAGGA CTTCACAATG 2700 

GACAGTQGCA GCAAAGGAAG ACTTGGCCCA 2760 

TCCAGAAAAT AQGGTGGACA GAAGCTTGTG 2820 

GTTGTTATTC TCGGACTGTG AATTTrQGTG 2880 

GTCTGTATAA ATAATGAGCG TTAACACAGT 2940 

AAAAAAAACT CGAGGQGGGG CCCQGTACCC 3000 

3026 



50 (2) INFORMATIC»I FOR SEQ ID NO: 311: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 712 base pairs 

(B) TYPE: nucleic acid 
55 (C) STRANDEDNESS: 'double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311: 



60 



GCAGGCTTTG TGCTCACCTA CAAGCTGGGT GAGCAGGGTG CCAGCAGCCT GTTTCCTCTT 



60 
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CTCCTGCTGG ACCACGGCGT TTCTGCTCCC GAGTTGGGAC TGTGGAATGG TGTGGGTGCT 120 

GTGGTCTGCT CCATCGCTGG CTCCTCCCTG GGTGGGACCT TGCTGGCCAA GCACTGGAAA 180 

5 

CTGCTGCCTC TGTTGARGTC GGTGCTQCGC TTCCGCCTCG GGGGCCTAGC CTGTCAGACT 240 

GCCTTGGTCT TCCACCTGGA CACCTTQGGG GCCAGCATGG ACGCTQGCAC AATCTTGAGA 300 

10 GGGTCAGCCT TGCTGAGCCT ATGTCTGCAG CACTTCTTGG GAGGCCTGGT CACCACAGTC 360 

ACCTTCACTG GGATGATGCG CTCCAGCCAG CTGGCCCCCA GQGCCTGCAG GCCACACACT 420 

ACAGCCTTCT GQCCACXXrTG GAGCTGCTGG QGAAGCTGCT GCTGGGCACT CTGCQGAGGC 480 

15 

CTGGCTGATG QGTTQGGGCC ACATCCCTGC TTCTTGCTCC TGCTCATCCT CTCTQCCTTT 540 

CCCGTTCTGT ACCIX3GACCT AGCACCCAGC ACCTTTCTCT GAGCTGAGTG GCTGGAGTGG 600 

20 TCAATAAAGC CACATGTGCC TGTC3GCCCAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 660 

AACTGGAGQG GGC3GCCCGGT ACCCAAATCG CCGGATATGA TCGTAAACAA TC 712 



25 



(2) INFORMATION FOR SEQ ID NO: 312: 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 1289 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: double 

(D) TOPOLOGY: linear 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 312: 

CAAAATTTCA GAACTTTCAG GAGGGCAAGA GAATATCAAA CAAAGATTTC TGGAAGTATT 60 

TTGCCAACCT TCTGGTTGAG CTGCAAGAAA ATATTTATGG TGAGAACTTT TCTGTTTCCC 120 

40 

GTTATTGGGT TTTTGGTTGG TTTTTGTTTG TTTTTTACTA TGCTTTQGTC TGTAAAAATA 180 

TGCAACTGAA CTACATTCAG AAGGAAATAT TGTCTACATA GAATATTATA TGAAGTTGGT 240 

45 ACATAATTCT GATGAGGAAA AAAAATCTTT GCAATTCTTT AAGCCATATT GTTGTTTTTC 30O 

TGTGTTGTTT TCCCTGGATG AAAATATCAG TATTAAGTAG ACAGCATATT ATTCAAGTGT 360 

TTAGACTTAT TAATATGTTC TTGTCCTGTA TTTATACATA TGTGTATTTT GGAAAGTATT 420 

50 

GCCTTTTTTA AGGGAAGCTA TAATTCGATA CATAGTGAAA AAGGGAATGG TGACCCCTTT 480 

GTGCCTCTTC CACTGAGGAT AACAAACAGC ATTGTAATCC ATTCTCTTGC ACCTTCTTCT 540 

55 TCTTATCTTG TTATTACGGT TTTATTAATT TTGTAGAGGG ACAGGGAGTG GGCAAGQGGA 600 

AGAAGCAGCT TATTTGACTA ACCAGCCCCT CTGTQGTCCA CCAGCGTCTT GGCTTGGTGG 660 

GAGGGCTCTC AATCAGCAGG GCCCCAGGAG GGAAGAAGAA GTGGGGCAAA GCCTGGCCTC 720 
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GCCGCTCGGG AGCTTTGCCA TCTGAGCX::AC 
GAAATGTCAA CCGGAGCCCT TACACCAGCC 
5 TAAACGAATA TTTMTCCAA CCTCACTACA 
ATGGGGGTGT TCCAGCCTTC AGCGAGATGG 
GGCTTATCCT TCTCTGTTGC CAACCTTGCC 

10 

CCCGTCCGTG TCTGTGTCTG TTCCATACGTC 
CX3TGGGCCCA GCTCGGAAGG TGCGTGGAGA 
15 GGC5ATGGCAT TCCGTTGTGT GCCTTATTCC 
AATATAGCCT CTTCATGCTG TATTAAAAGG 
CTTGAGQQGG GGNCCGGTAC CCAATTWTC 

20 



GCCTCCTCCA GGCCATGCTC CTTGAACTTG 780 

CTCCAGCATC TAATAGACTT GAATCTACTC 840 

TTGTAGCTCA GTCCAACGAC TAACCCTGAA 900 

CCAAGCQGTC CCCTGGQGGC TGTGGCAGCG 960 

GTCCXSACXrrC CTCCGCCCCC ATGCGGTGAC 1020 

TGAGTCCAGC TAAAAAGACA AAACAGAACX: 1080 

AGGCTCCGAC GTCTCCGAAG TGCAGCCTTT 1140 

TGGAGAATCT GTATACGGCT CGCCTATAGA 1200 

ACTTTTAAAA OCAAAAAAAA AAAAAAAAAA 1260 

1289 



25 



(2) INFORMATION FOR SEQ ID NO: 313: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

30 (xi) SEQUENCE DESCRIPTICMa : SEQ ID NO: 313: 

Met Phe Leu lie Phe Val Tyr Phe Leu Lys lie Leu Phe Ser Ser Ser 
15 10 15 

35 Leu Pro Phe Leu Trp Leu 
20 



40 (2) INFORMATION FOR SEQ ID NO: 314: 

(i) SEQUENCE CHARACTERISTICS: 

(A) L^XTTH: 128 amino acids 

(B) TYPE: amino acid 
45 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 314: 

Met Met Phe Leu Thr Gin Gly Gly Pro Leu Pro Ser Thr Arg Ala Arg 
15 10 15 

Pro Thr Cys Gin Ala Gly Ala Leu Pro Lys Pro Ser Gly Leu Leu Gly 
20 25 30 



50 



Val Thr Cys Trp Asn Gly Leu Lys Gly Pro Leu Cys Gly Asn Arg Cys 
55 35 40 45 

Ser Pro Asn Thr Leu Leu Leu Ala Ala Arg Gin Ala Leu Trp Lys Gly 
50 55 60 

60 Arg Gly Arg Thr His Gin Asp Leu Pro Gly Pro Leu Gin Gly Arg Gin 



wo 98/39448 



PCT/US98/04493 



546 



65 



70 



75 



80 



Leu Gly Pro Glu Pro Lys His Leu Ala Leu Leu Pro Pro Arg Gly Gin 
85 90 95 

Glu Ala Ser Trp Ala Ser Ser Leu Pro Gly Gin Gly Pro Leu Pro Leu 
100 105 110 



Pro His lie Asn Cys Thr Val Phe Ser Leu Lys Ala Ser Phe lie Lys 
10 115 120 125 



15 



(2) INFORMATICS FOR SEQ ID NO: 315: 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) I-ENGTH: 28 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 315: 

25 Met Gin Phe Leu Leu Thr Ala Phe Leu Leu Val Pro Leu Leu Ala Leu 
15 10 15 

Cys Asp Val Pro lie Ser Leu Gly Phe Ser Pro Ser 
20 25 

30 



(2) INFORMATION FOR SEQ ID NO: 316: 

35 (i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 64 amino acids 
{B) TYPE: amino acid 
(D) T0P0LCX3Y: linear 
(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 316: 

40 

Met Asp Gly Phe Ser Ser Arg Leu Phe Ser Ser Leu Pro Phe Val Ala 
15 10 15 

Leu Gin Trp Phe He Val He Ser His Leu Leu Ser Leu Ser Leu Ser 
45 20 25 30 

Ala Cys Cys Tyr Gin Thr His Cys Ser Leu Xaa Gin Leu Ser Ser Ala 
35 40 45 

50 Phe Ser Xaa Met Gly Glu Ser Cys Val Gly Glu Arg Glu Tyr Xaa Phe 
50 55 60 



55 



(2) INFORMATION FOR SEQ ID NO: 317: 
60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 21 amino acids 

(B) TYPE; amino acid 
(D) TOPOIiOGY; linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 317: 

5 

Met Pro Leu lie Asn Leu Leu Leu Leu Tyr Tyr Val Pro Asn Gly Gly 
1 5 10 15 

Lys Gin Asp Lys Lys 
10 20 



15 



30 



(2) INFORMATION FOR SEQ ID NO: 318: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 318: 

Met Gly Arg His Leu Val Leu Val Met Phe lie Thr Thr Ser Leu His 
15 10 15 

25 Ser Gly Thr Pro Val Pro Glu Asn Val lie Cys Gly Val Thr Lys Gly 
20 25 30 



Pro Gin Gly Lys Lys Lys Lys 
35 



(2) INFORMATION FOR SEQ ID NO: 319: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LErK5TH: 33 amino acids . 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 319: 

40 

Met Leu Trp Trp Ser Arg Asp Tyr Thr Met Val Phe Leu Leu Phe Thr 
1 5 10 15 

Met Val Phe Thr Gly Asp Leu Val He Arg Gly Arg Thr Glu Leu Ser 
45 20 25 30 

Leu 



50 

(2) INFORMATION FOR SEQ ID NO: 320: 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 320: 

60 Met Val Cys Ser Ser Leu Cys Asp He Gly Gly He He Thr Pro Phe 
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10 



15 



lie Val Phe Arg Leu Arg Glu Val Trp Gin Ala Leu Pro Leu He Leu 
20 25 30 

5 

Phe Ala Val Leu Gly Leu Leu Ala Ala Gly Val Thr Leu Leu Leu Pro 

35 40 45 

Glu Thr Lys Gly Val Ala Leu Pro Glu Thr Met Lys Asp Ala Glu Asn 
10 50 55 60 

Leu Gly Arg Lys Ala Lys Pro Lys Glu Asn Thr He Tyr Leu Lys Val 
65 70 75 80 



15 Gin Thr Ser Glu Pro Ser Gly Thr 
85 



20 (2) INFORMATION FOR SEQ ID NO: 321: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 cutiino acids 

(B) TYPE: amino acid 
25 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 321: 

Met Gin Pro Gly Ala Gly Val Leu Val Leu Gly Leu Leu Leu Pro Pro 
15 10 15 

30 

Pro Gin Ser Pro Ser Leu Ser 
20 



35 

(2) INFORMATION FOR SEQ ID NO: 322: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 
40 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 322: 

Met Thr Phe Thr Leu Gly Asp Ser Gin Val Leu Leu He Asn Leu Phe 
45 1 5 10 15 

Pro Ser Met Pro Ser Gly Ser Cys Ala Arg Pro 
20 25 

50 

(2) INFORMATION FOR SEQ ID NO: 323: 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 64 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 323: 

60 Met Cys Leu Glu Cys Trp Ala Glu Asn Leu Gly Pro His His Thr Ser 
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10 



15 



Ser Leu Leu Asn Pro Arg His Leu Pro Ser lie Pro Ala Met Phe Pro 
20 25 30 

Val Ser Ser Gly Cys Phe Gin Glu Gin Gin Glu Met Asn Lys Ser Leu 
35 40 45 



Val Ser Cys Leu Phe Val Leu His Phe Val Leu His Cys lie Phe Xaa 
10 50 55 60 



15 



(2) INFORMATION FOR SEQ ID NO: 324: 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 196 andno acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 324: 

25 Met Leu Ser Thr Ser Glu Tyr Ser Gin Ser Pro Lys Met Glu Ser Leu 
15 10 15 

Ser Ser His Arg lie Asp Glu Asp Gly Glu Asn Thr Gin He Glu Asp 
20 25 30 



30 



Thr Glu Pro Met Ser Pro Val Leu Asn Ser Lys Phe Val Pro Ala Glu 
35 40 45 



Asn Asp Ser He Leu Met Asn Pro Ala Gin Asp Gly Glu Val Gin Leu 
35 50 55 60 

Ser Gin Asn Asp Asp Lys Thr Lys Gly Asp Asp Thr Asp Thr Arg Asp 
65 70 75 80 

40 Asp He Ser He Leu Ala Thr Gly Cys Lys Gly Arg Glu Glu Thr Val 

85 90 95 

Ala Glu Glu Val Cys He Asp Leu Thr Cys Asp Ser Gly Ser Gin Ala 
100 105 110 

45 

Val Pro Ser Pro Ala Thr Arg Ser Glu Ala Leu Ser Ser Val Leu Asp 

115 120 125 

Gin Glu Glu Ala Met Glu He Lys Glu His His Pro Glu Glu Gly Ser 
50 130 135 140 

Ser Gly Ser Glu Val Glu Glu He Pro Glu Thr Pro Cys Glu Ser Gin 
145 150 155 160 

55 Gly Glu Glu Leu Lys Glu Glu Asn Met Glu Ser Val Pro Leu His Leu 
165 170 175 



Ser Leu Thr Glu Thr Gin Ser Gin Gly Leu Cys Leu Arg Arg His Pro 
180 185 190 
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Lys Lys Lys Lys 
195 



5 

(2) INFORMATION FOR SEQ ID NO: 325: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 252 amino acids 
10 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE EHSCRIPTICW: SEQ ID NO: 325: 

Met Gly Gly Asp Leu Val Leu Gly Leu Gly Ala Leu Arg Arg Arg Lys 
15 1 5 10 15 

Arg Leu Leu Glu Gin Glu Lys Ser Leu Ala Gly Trp Ala Leu Val Leu 
20 25 30 

20 Ala Xaa Xaa Gly lie Gly Leu Met Val Leu His Ala Glu Met Leu Trp 
35 40 45 

Phe Gly Gly Cys Ser Ala Val Asn Ala Thr Gly His Leu Ser Asp Thr 
50 55 60 

25 

Leu Trp Leu lie Pro lie Thr Phe Leu Thr He Gly Tyr Gly Asp Val 
65 70 75 80 

Val Pro Gly Thr Met Trp Gly Lys He Val Cys Leu Cys Thr Gly Val 
30 85 90 95 

Met Gly Val Cys Cys Thr Ala Leu Leu Val Ala Val Val Ala Arg Lys 
100 105 110 

35 Leu Glu Phe Asn Lys Ala Glu Lys His Val His Asn Phe Met Met Asp 
115 120 125 

He Gin Tyr Thr Lys Glu Met Lys Glu Ser Ala Ala Arg Val Leu Gin 
130 135 140 

40 

Glu Ala Trp Met Phe Tyr Lys His Thr Arg Arg Lys Glu Ser His Ala 
145 150 155 160 

Ala Arg Xaa His Gin Arg Xaa Leu Leu Ala Ala He Asn Ala Phe Arg 
45 165 170 175 

Gin Val Arg Leu Lys His Arg Lys Leu Arg Glu Gin Val Asn Ser Met 
180 185 190 

50 Val Asp He Ser Lys Met His Met He Leu Tyr Asp Leu Gin Gin Asn 
195 200 205 

Leu Ser Ser Ser His Arg Ala Leu Glu Lys Gin He Asp Thr Leu Ala 
210 215 220 

Gly Lys Leu Asp Ala Leu Thr Glu Leu Leu Ser Thr Ala Leu Gly Pro 
225 230 235 240 



Arg Gin Leu Pro Glu Pro Ser Gin Gin Ser Lys Xaa 
60 245 250 
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(2) INFORMATICS FOR SEQ ID NO: 326: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENC3TH: 68 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 326: 

Met Trp Arg Cys Arg Gly Lys Leu Ser Phe Pro Leu Phe Ala Val Val 
15 10 15 

15 lie Val Ser Cys Arg Lys Asp Gly Pro Asp Ala Ala Ala Ala Pro Ala 
20 25 30 



20 



Val lie Lys Asn Asn Ser His Tyr Gin Thr Ser Lys Ala Leu Glu Leu 
35 40 45 

Glu Lys Thr Thr Glu Asn Lys Glu Ser Asn Pro Phe He Leu Gin Val 
50 55 60 



Asn Lys Leu Xaa 
25 65 



30 



(2) INFORMATION FOR SEQ ID NO: 327: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 327: 

Met Gly Glu Gly Lys Asn Gly Phe Gly Gly Phe Val His Thr Ala Asp 
15 10 15 

40 Ala Cys Trp Glu Gly Val His Ser Glu Pro Val Cys Arg Thr Val His 
20 25 30 

Thr Val His Thr Cys His His Gin Ala Phe Leu Val Leu He Gly Trp 
35 40 45 

45 

Ser Lys Ser Gly Lys Glu Arg Lys Glu /Via Phe Leu Thr Ala He He 
50 55 60 

Leu Asn Ser Arg Ser He His He Ser Cys Ser Trp Pro Pro Ser Pro 
50 65 70 75 80 

Val Pro Gin Xaa 



55 

(2) INFORMATIC»J FOR SEQ ID NO: 328: 



60 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 



wo 98/39448 



552 



PCT/US98/04493 



(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 328: 

5 Met Leu Leu lie Asn Leu Leu Trp Leu Val Thr Met He Lys Ser Val 
15 10 15 

He Asn Asn Asn He He Leu Phe Leu Lys Lys Lys Ser Leu Phe Phe 
20 25 30 

10 

He Asp Ser Val 
35 



15 

(2) INFORMATION FOR SEQ ID NO: 329: 

(i) SEQUENCE CHARACTERISTICS: 

(A) hENSTH: 63 amino acids 
20 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 329: 

Met Thr Phe Pro Phe Glu Lys Lys He Val Ala Phe Ser Ala Phe Tyr 
25 1 5 10 15 

Leu He Pro Gly Glu Ser Arg Leu Ala Pro Thr Phe Asn Pro Ser Ala 
20 25 30 

30 Asp Met Thr Val He Leu Arg Gly Arg Ala Gin His Lys Thr Ala Met 
35 40 45 

Leu Glu Ser Tyr Asn Trp Lys Val Ser Cys Gin Leu Arg Glu Xaa 
50 55 60 

35 



(2) INFORMATION FOR SEQ ID NO: 330: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 330: 

45 

Met His Ser Lys Gly Ser Ser Leu Leu Leu Phe Leu Pro Gin Leu He 
15 10 15 

Leu He Leu Pro Val Cys Ala His Leu His Glu Glu Leu Asn Cys Cys 
50 20 25 30 

Phe His Arg 
35 

55 

(2) INFORMATION FOR SEQ ID NO: 331: 



60 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTIC»I: SBQ ID NO: 331: 

5 Met Gly Ala Leu Val Leu Leu Leu Cys Leu Leu Val Gly Val Gin Gin 
15 10 15 

Ser Gly Ser Val Trp Asp Ser 
20 

10 



(2) INFORMATION FOR SEQ ID NO: 332: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 332: 



20 



Met Gin Ser Ala Glu He Leu Ser Trp Thr Asp Val Leu His Asp Phe 
15 10 15 



Leu Phe Ser Leu Phe Leu Trp Pro Ala Phe Glu Asp Arg Ala Leu Leu 
25 20 25 30 



30 



He Phe Thr Leu Asn Gin He Val 
35 40 



(2) INFORMATION FOR SEQ ID NO: 333: 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 111 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 333: 

40 Met Gin Ser Leu Val Gin Trp Gly Leu Asp Ser Tyr Asp Tyr Leu Gin 
15 10 15 

Asn Ala Pro Pro Gly Phe Phe Pro Arg Leu Gly Val He Gly Phe Ala 
20 25 30 

45 

Gly Leu He Gly Leu Leu Leu Ala Arg Gly Ser Lys He Lys Lys Leu 
35 40 45 

Val Tyr Pro Pro Gly Phe Met Gly Leu Ala Ala Ser Leu Tyr Tyr Pro 
50 50 55 60 

Gin Gin Ala He Val Phe Ala Gin Val Ser Gly Glu Arg Leu Tyr Asp 
65 70 75 80 

55 Trp Gly Leu Arg Gly Tyr He Val He Glu Asp Leu Trp Lys Glu Asn 
85 90 95 



Phe Gin Lys Pro Gly Asn Val Lys Asn Ser Pro Gly Thr Lys Xaa 
100 105 110 
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(2) INFORMATION FOR SEQ ID NO: 334: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 

(B) TYPE: amino acid 
(D) TOPOIiOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 334: 

10 

Met Ala Pro Ser Leu Leu Leu Leu Ala Pro Leu Cys Ser Leu Glu Ala 
1 5 10 15 

Val Leu Ser Ser Pro Leu Glu Lys Gin Cys Gin Leu Pro Gly He Phe 
15 20 25 30 

Cys Gin Leu Gin Leu Pro Cys Pro Leu Leu Leu Ser Ala Gin Leu Leu 
35 40 45 

20 Lys Gly He Val Xaa Pro Arg Cys Pro Ala Ser Leu Pro Gin Pro Pro 
50 55 60 

His Pro Ala Pro Ser Trp His Leu Pro Leu His Cys Thr Glu Arg Xaa 
65 70 75 80 

25 

Pro His His Leu Pro Leu Gin Gly Gly Ser Ser Asn Met Glu Glu Xaa 
85 90 95 

Asn Tyr Arg Gly Tyr Xaa Asp Ala Gin I*eu 
30 100 105 



35 



(2) INFORMATION FOR SEQ ID NO: 335: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 335: 

Met Thr Thr Cys Leu Phe Gly Leu Leu Ser Cys Glu Met Ser Ala Gin 
15 10 15 

45 Val Ser Gin Lys Ser Cys Val Tyr Asp Glu Ser Glu Cys Phe Ser Ser 
20 25 30 

Val Gly Gin Leu Leu Ala Leu Leu lie Leu Val Tyr Val Leu Pro Ser 
35 40 45 



50 



He Xaa 
50 



55 

(2) INFORMATION FOR SEQ ID NO: 336: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 
60 (B) TYPE: amino acid 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 336: 

Met Leu Trp Lys Cys Ser Gin Asn lie Ala Arg Cys Leu Leu Leu Leu 
5 1 5 10 15 

Leu Ala Leu Val Glu He Lys Leu Glu Asp Leu Gin Ser Gin Leu His 
20 25 30 

10 Pro Thr Trp Lys Ser He Pro Gly Pro Ser Pro Arg Asn Gin His Arg 
35 40 45 



15 



(2) INFORMATION FOR SEQ ID NO: 337: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 337: 

25 

Met Leu He Pro Leu Gin Cys Leu Phe Ser Ser Asp Arg Met Leu Thr 
15 10 15 

Phe Leu Thr Pro Trp Gin Lys Gly Glu Lys Cys Val Leu Gly Trp Val 
30 20 25 30 

Thr Lys Phe Leu Ser Glu He Ser Xaa 
35 40 

35 

(2) INFORMATION FOR SEQ ID IK): 338: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 76 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 338: 



45 Met Thr Phe Ser Ser Leu Lys 
1 5 

Gly Leu Glu Arg Phe He He 
20 

50 

Gin Arg Ser Leu Ser Ser Asn 
35 

Thr Phe Lys Asp Val Leu Val 
55 50 55 



Leu Phe Val Leu Thr Cys He He Lys 
10 15 

Leu Arg Glu Val Cys Asn Gin Glu He 
25 30 

Leu Val His Val Leu Leu Gin Pro Ala 
40 45 

Thr Glu He He Cys Leu Cys Met Cys 
60 



Leu Tyr Ser He Lys Tyr Met Pro Pro Gin Lys Lys 
65 70 75 



60 
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(2) INFORMATION FOR SEQ ID NO: 339: 

(i) SEQUENCE CHARACTERISTICS: 
5 <A> LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 339: 

10 Lys Val Tyr lie Phe Leu lie Phe Met Val Leu He Leu Pro Ser Leu 
15 10 15 



15 



Gly Leu Thr Arg Tyr Met Pro Pro Xaa Ser Xaa Leu Asn Ser Glu 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 340: 

20 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 340: 



25 



35 



50 



Met Ala Lys He Ser Pro Phe Glu Val Val Lys Arg Thr Ser Val Pro 
15 10 15 



Val Leu Val Gly Leu Val He Val He Val Ala Thr Glu Leu Met Val 
30 20 25 30 



Pro Gly Thr Ala Ala Ala Val Thr Gly Lys 
35 40 



(2) INFORMATION FOR SEQ ID NO: 341: 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 341: 

45 Met Arg Leu Phe Phe He Gly Phe Leu Leu Leu Phe Ser Phe Gly Leu 
15 10 15 



Leu Arg Gin Pro Ser Leu Ser Ala Glu His 
20 25 



(2) INFORMATION FOR SEQ ID NO: 342: 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SBQUEtKTE DESCRIPTION: SEQ ID NO: 342: 
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Met Val Phe Ser Val Ser Ser Ala Leu Ala Leu Leu Leu Met Leu Leu 
15 10 15 

Arg Ser Ser Asp Leu Ala Lys Lys Thr Glu 
5 20 25 



(2) INFOPMATION FOR SEQ ID NO: 343: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

15 (xi) SEQUENCE DESCRIPTICW: SEQ ID NO: 343: 

Met Ser Leu Glu Phe Tyr Gin Lys Lys Lys Ser Arg Trp Pro Phe Ser 
15 10 15 

20 Asp Glu Cys lie Pro Trp Glu Val Trp Thr Val Lys Val His Val Val 
20 25 30 

Ala Leu Ala Thr Glu Gin Glu Arg Gin He Cys Arg Glu Lys Val Gly 
35 40 45 



25 



40 



Glu Lys Leu Cys Glu Lys He He Asn He Val Glu Val Met Asn Arg 
50 55 60 



His Glu Tyr Leu Pro Lys Met Pro Thr Gin Ser Glu Val Asp Asn Val 
30 65 70 75 80 

Phe Asp Thr Gly Leu Arg Asp Val Gin Pro Tyr Leu Tyr Lys He Ser 
85 90 95 

35 Phe Gin He Thr Asp Ala Leu Gly Thr Ser Val Thr Thr Thr Met Arg 
100 105 110 



Arg Leu He Lys Asp Thr Leu Pro Ser Glu Arg Arg Trp He Ser Gly 
115 120 125 

Ser Ser Leu Met Ala Pro Arg Pro Trp Leu Leu Gly He Ala Leu Leu 
130 135 140 



Gly Leu Trp Ala Leu Glu Pro Ala Leu Gly His Trp Xaa 
45 145 150 155 



(2) INFORMATION FOR SEQ ID NO: 344: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 520 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 344: 

Met Phe Leu Leu Pro Leu Pro Ala Ala Gly Arg Val Val Val Arg Arg 
15 10 15 



60 



Leu Ala Val Arg Arg Phe Gly Ser Arg Ser Leu Ser Thr Ala Asp Met 
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20 



25 



30 



Thr Lys Gly Leu Val Leu Gly He Tyr Ser Lys Glu Lys Glu Asp Asp 
35 40 45 

5 

Val Pro Gin Phe Thr Ser Ala Gly Glu Asn Phe Asp Lys Leu Leu Ala 
50 55 60 

Gly Lys hexi Arg Glu Thr Leu Asn He Ser Gly Pro Pro Leu Lys Ala 
10 65 70 75 BO 

Gly Lys Thr Arg Thr Phe Tyr Gly Leu His Gin Asp Phe Pro Ser Val 
85 90 95 

15 Val Leu Val Gly Leu Gly Lys Lys Ala Ala Gly He Asp Glu Gin Glu 

100 105 110 



20 



25 



30 



35 



40 



Asn Trp His Glu Gly Lys Glu Asn He Arg Ala Ala Val Ala Ala Gly 
115 120 125 

Cys Arg Gin He Gin Asp Leu Glu Leu Ser Ser Val Glu Val Asp Pro 
130 135 140 

Cys Gly Asp Ala Gin Ala Ala Ala Glu Gly Ala Val Leu Gly Leu Tyr 
145 ^ 150 155 160 

Glu Tyr Asp Asp Leu Lys Gin Lys Lys Lys Met Ala Val Ser Ala Lys 
165 170 175 

Leu Tyr Gly Ser Gly Asp Gin Glu Ala Trp Gin Lys Gly Val Leu Phe 
180 185 190 

Ala Ser Gly Gin Asn Leu Ala Arg Gin Leu Met Glu Thr Pro Ala Asn 
195 200 205 

Glu Met Thr Pro Thr Arg Phe Ala Glu He He Glu Lys Asn Leu Lys 
210 215 220 

Ser Ala Ser Ser Lys Thr Glu Val His He Arg Pro Lys Ser Trp He 
225 230 235 240 

Glu Glu Gin Ala Met Gly Ser Phe Leu Ser Val Ala Lys Gly Ser Asp 
245 250 255 



45 



50 



55 



Glu Pro Pro Val Phe Leu Glu He His Tyr Lys Gly Ser Pro Asn Ala 
260 265 270 

Asn Glu Pro Pro Leu Val Phe Val Gly Lys Gly He Thr Phe Asp Ser 
275 280 285 

Gly Gly He Ser He Lys Ala Ser Ala Asn Met Asp Leu Met Arg Ala 
290 295 300 

Asp Met Gly Gly Ala Ala Thr He Cys Ser Ala He Val Ser Ala Ala 
305 310 315 320 

Lys Leu Asn Leu Pro He Asn He He Gly Leu Ala Pro Leu Cys Glu 
325 330 335 



60 Asn Met Pro Ser Gly Lys Ala Asn Lys Pro Gly Asp Val Val Arg Ala 
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340 



345 



350 



Lys Asn Gly Lys Thr lie Gin Val Asp Asn Thr Asp Ala Glu Gly Arg 
355 360 365 

5 

Leu lie Leu Ala Asp Ala Leu Cys Tyr Ala His Thr Phe Asn Pro Lys 
370 375 380 

Xaa He Leu Asn Ala Ala Thr Leu Thr Gly Ala Met Asp Val Ala Leu 
10 385 390 395 400 

Gly Ser Gly Ala Thr Gly Val Phe Thr Asn Ser Ser Trp Leu Trp Asn 
405 410 415 

15 Lys Leu Phe Glu Ala Ser He Glu Thr Gly Asp Arg Val Trp Arg Met 
420 425 430 



20 



Pro Leu Phe Glu His Tyr Thr Arg Gin Val Val Asp Cys Gin Leu Ala 
435 440 445 

Asp Val Asn Asn He Gly Lys Tyr Arg Ser Ala Gly Ala Cys Thr Ala 
450 455 460 



Ala Ala Phe Leu Lys Glu Phe Val Thr His Pro Lys Trp Ala His Leu 
25 465 470 475 480 

Asp He Ala Gly Val Met Thr Asn Lys Asp Glu Val Pro Tyr Leu Arg 
485 490 495 

30 Lys Gly Met Thr Gly Arg Pro Thr Arg Thr Leu He Glu Phe Leu Leu 
500 505 510 



35 



Arg Phe Ser Gin Asp Asn Ala Xaa 
515 520 



(2) INFORMATION FOR SEQ ID NO: 345: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 amino acids 

(B) T^PE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 345: 

45 

Thr He Leu Phe Leu Phe Leu Gin Leu Ser Ala Leu Arg Leu He Val 
15 10 15 

Gly Lys Asp Ser He Asp He Asp He Ser Ser Arg Arg Arg Glu Asp 
50 20 25 30 

Gin Ser Leu Arg Leu Asn Ala 
35 

55 

(2) INFORMATICS^ FOR SEQ ID NO: 346: 



60 



(i) SEQUENCE CHARACTERISTICS: 

(A) LQ3GTH: 234 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLCXSfY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 346: 

5 Met Thr Ser Glu Leu Asp lie Phe Val Gly Asn Thr Thr Leu He Asp 
15 10 15 

Glu Asp Val Tyr Arg Leu Trp Leu Asp Gly Tyr Ser Val Thr Asp Ala 
20 25 30 



10 



25 



40 



50 



Val Ala Leu Arg Val Arg Ser Gly He Leu Glu Gin Thr Gly Ala Thr 
35 40 45 



Ala Ala Val Leu Gin Ser Asp Thr Met Asp His Tyr Arg Thr Phe His 
15 50 55 60 

Met Leu Glu Arg Leu Leu His Ala Pro Pro Lys Leu Leu His Gin Leu 
65 70 75 80 

20 He Phe Gin He Pro Pro Ser Arg Gin Ala Leu Leu He Glu Arg Tyr 
85 90 95 



Tyr Ala Phe Asp Glu Ala Phe Val Arg Glu Val Leu Gly Lys Lys Leu 
100 105 110 

Ser Lys Gly Thr Lys Lys Asp Leu Asp Asp He Ser Thr Lys Thr Gly 
115 120 125 



He Thr Leu Lys Ser Cys Arg Arg Gin Phe Asp Asn Phe Lys Arg Val 
30 130 135 140 

Phe Lys Val Val Glu Glu Met Arg Gly Ser Leu Val Asp Asn He Gin 
145 150 155 160 

35 Gin His Phe Leu Leu Ser Asp Arg Leu Ala Arg Asp Tyr Ala Ala He 
165 170 175 



Val Phe Phe Ala Asn Asn Arg Phe Glu Thr Gly Lys Lys Lys Leu Gin 
180 185 190 

Tyr Leu Ser Phe Gly Asp Phe Ala Phe Cys Ala Glu Leu Met He Gin 
195 200 205 



Asn Trp Thr Leu Gly Pro Val Asp Ser Gin Met Asp Asp Met Asp Met 
45 210 215 220 



Asp Leu Asp Arg Asn Phe Ser Arg Thr Xaa 
225 230 



(2) INFORMATION FOR SEQ ID NO: 347: 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 169 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 347: 



60 Met Ala Ala Ala Val Ala Gly Met Leu Arg Gly Gly Leu Leu Pro Gin 
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10 



15 



Ala Gly Arg Leu Pro Thr Leu Gin Thr Val Arg Tyr Gly Ser Lys Ala 
20 25 30 

5 

Val Thr Arg His Arg Arg Val Met His Phe Gin Arg Gin Lys Leu Met 
35 40 45 

Ala Val Thr Glu Tyr He Pro Pro Lys Pro Ala He His Pro Ser Cys 
10 50 55 60 

Leu Pro Ser Pro Pro Ser Pro Pro Gin Glu Glu He Gly Leu He Arg 
65 70 75 80 

IS Leu Leu Arg Arg Glu He Ala Ala Val Phe Gin Asp Asn Arg Met He 
85 90 95 

Ala Val Cys Gin Asn Val Ala Leu Ser Ala Glu Asp Lys Leu Leu He 
100 105 110 



20 



Ala Thr Pro Ala Ala Glu Thr Gin Asp Pro Asp Glu Gly Leu Pro Gin 
115 120 125 



Pro Gly Pro Glu Ser Pro Ser Trp Arg He Pro Ser Thr Lys He Cys 
25 130 135 140 



Cys Pro Phe Leu Trp Gly Thr Thr Cys Cys Trp Ser Val Lys Ser Pro 
145 150 155 160 

30 Arg Ser Arg Arg Trp Tyr Gly Ser Xaa 
165 



35 (2) INFORMATION FOR SEQ ID NO: 348: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQIGTH: 43 amino acids 

(B) TYPE: andno acid 
40 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 348: 

Met Lys Arg Ser Phe Leu Leu Pro Leu Leu Leu Val Gly Phe Leu Asp 
15 10 15 

45 

Thr Ala His Leu He Leu Leu Glu Thr Leu Ser Val Cys Leu Trp Leu 
20 25 30 

Pro Ser Leu lie Asp Ser Arg Cys Val Met Ser 
50 35 40 



(2) INFORMATION FOR SEQ ID NO: 349: 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 349: 
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Met Lys Glu Gly Pro Pro Cys Lys Arg His His Tyr Tyr Gin Asn Cys 
1 5 10 15 

5 Gly Ala Lys Leu Leu Val Ser Leu Phe Gly Glu Thr Asn Gin He His 
20 25 30 



10 



Leu Leu Glu Thr Gin Val Gly Thr Glu Lys Gly Gly Glu Arg He Trp 
35 40 45 

Glu Glu Lys Trp Arg He Ser Ser Thr Val Leu Phe He Ser Val Asn 
50 55 60 



Ser Tyr Val Glu Gly Ser Val Leu Glu He Lys Leu Phe Tyr 
15 65 70 75 



20 



(2) INFORMATION FOR SEQ ID NO: 350: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LBIGTH: 24 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

25 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 350: 

Met Ser Glu He Leu Ser Leu Leu Phe Cys Leu Leu Gly Pro Ala Leu 
15 10 15 

30 Asp Glu Arg Arg Glu Glu Lys Asp 
20 



35 (2) INFORMATIC»J FOR SEQ ID NO: 351: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 274 amino acids 

(B) TYPE: amino acid 
40 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 351: 

Met Ser Ser Ala Gly Thr Ala Thr Pro Leu Glu Met Asp His Lys Leu 
15 10 15 

Thr Ser Gin Pro Gly Arg Pro Ser Phe Tyr Cys Asn Ser Arg His Ser 
20 25 30 



45 



He Val Gly Ser Ser His Gin Leu Gly Phe Trp Phe Ser His Leu Glu 
50 35 40 45 

Ser Ser Gly Leu Lys Val Phe Gin Val Ser Leu Pro Cys Glu Cys Val 
50 55 60 

55 Asn Leu Pro Thr Arg He Ala Ser Val Val Leu Ser Leu Met Ser Leu 
65 70 75 80 



Leu Val Val Gly Gin Ala Pro Ala Trp Glu Gly Ser Leu Leu Arg Gly 
85 90 95 



60 
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Arg Pro Ala Gly Gly Ala His Leu Cys Ala Met Xaa Val lie Glu Gly 
100 105 110 

lieu Val Val Asp Val Gly Glu Arg lie Leu His Gly Gin Arg Glu Val 
5 115 120 125 

Gly Gin Val Ser Gin Val Leu Pro Ala Leu Ser Leu Gly Leu Val Phe 
130 135 140 

10 Leu Cys Gin Gly Thr Val Glu Lys Val Ser Gly Ala Ala His Cys Ser 
145 150 155 160 



15 



Ser Leu Leu Cys Cys Leu Pro Trp Gin Cys Ser Gly Gly Gly Phe Pro 
165 170 175 

Thr Xaa Arg Cys Ser Arg Pro Tyr Phe Ser Ser His Lys Gly Val Ala 
180 185 190 



Ala Thr Leu Ala Leu Thr Cys His Cys Asp Lys Val His Val Ala Gly 
20 195 200 205 

Leu Gly Lys Asp Trp Ala He Glu Gin Arg Arg Arg Thr Cys Glu Ser 
210 215 220 

25 Asp Xaa Glu Xaa Xaa Pro Phe Thr Leu Ala Gly Leu Val Leu Val Leu 
225 230 235 240 



30 



Arg Phe Cys Gin Val Val Leu Val Trp He Pro Gin Leu Gly Asp Lys 
245 250 255 

His Trp Arg Gly Met Thr Arg Leu Gly Arg Val Ser Leu Thr Ser Ser 
260 265 270 



35 



He Xaa 



40 



55 



(2) INFORMATION FOR SEQ ID NO: 352: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 352: 

Met He Phe Thr Ser Val Thr Lys Gly He Leu Leu He Ala Leu Trp 
15 10 15 

50 Val Pro Leu Phe His Phe Met Leu He Asp Ser He Leu Gly Pro Ser 
20 25 30 



Arg Leu Leu Thr Asp Gly Val Pro Phe Asn Pro Trp His Val Xaa 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 353: 
60 (i) SEQUENCE CHARACTERISTICS: 



wo 98/39448 



PCT/US98/04493 



564 



(A) LENGTH: 3 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SBQUOaCE DESCRIPTION: SBQ ID NO: 353: 

5 

Met Lys Thr 
1 



10 

(2) INFORMATION FOR SBQ ID NO: 354: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 
15 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEC3UENCE DESCRIPTION: SEQ ID NO: 354: 

Met Ser lie Ser Gly Thr Asp Gly Leu He Leu Leu Leu Val Gly Leu 
20 1 5 10 15 

Glu Ala Xaa Val Arg Ser Ser Lys Lys Trp He Pro Lys Ala Leu Xaa 
20 25 30 

25 Val Thr Gin Ala Lys Trp Asn Ser Trp Pro Ser Arg Arg Asn Ala Gly 
35 40 45 

Phe Ala Leu His 
50 

30 



(2) INFORMATION FOR SEQ ID NO: 355: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 132 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 355: 

40 

Met Glu His Cys Leu Tyr His Ser Val His Gly He Asn Pro Tyr He 
15 10 15 

His Lys Asn Thr His Pro Ser He Asn He Tyr Met Val Trp Asp Glu 
45 20 25 30 

Gin Val Asn Ser Phe Glu Arg Glu Phe Val Pro Phe Phe Phe Leu He 
35 40 45 

50 He Leu Leu Asn Cys Cys Gin Leu Ser Asn Lys Gin Thr Glu Lys Leu 
50 55 60 

Phe Gly Lys Thr Leu His Thr Pro Phe Leu Ser Ser Ala Leu Lys Tyr 
65 70 75 80 

55 

Arg Leu Asn Thr His He Leu Pro Val Phe Ser Tyr Ser Asp Ser He 
85 90 95 



60 



Leu Thr Cys His Leu He Leu Ala Ser Tyr Phe Ser His Val Tyr Leu 
100 105 110 
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Pro Val Thr Cys lie Cys Tyr Leu Asn Arg Lys Lys Asn lie Gin Lys 
115 120 125 

5 Lys Lys Asn Xaa 
130 



10 (2) INFORMATION FOR SEQ ID NO: 356: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 204 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 356: 

Met Gly Ser Arg Asp His Leu Phe Lys Val Leu Val Val Gly Asp Ala 
15 10 15 

Ala Val Gly Lys Thr Ser Leu Val Gin Asp Tyr Ser Gin Asp Ser Phe 
20 25 30 



20 



Ser Lys His Tyr Lys Ser Thr Val Gly Val Asp Phe Ala Leu Lys Val 
25 35 40 45 

Leu Gin Trp Ser Asp Tyr Glu He Val Arg Leu Gin Leu Trp Asp He 
50 55 60 

30 Ala Gly Gin Glu Arg Phe Thr Ser Met Thr Arg Leu Tyr Tyr Arg Asp 
65 70 75 80 

Ala Ser Ala Cys Val He Met Phe Asp Val Thr Asn Ala Thr Thr Phe 
85 90 95 

35 

Ser Asn Ser Gin Arg Trp Lys Gin Asp Leu Asp Ser Lys Leu Thr Leu 
100 105 110 

Pro Asn Gly Glu Pro Val Pro Cys Leu Leu Leu Ala Asn Lys Cys Asp 
40 115 120 125 

Leu Ser Pro Trp Ala Val Ser Arg Asp Gin He Asp Arg Phe Ser Lys 
130 135 140 

45 Glu Asn Gly Phe Thr Gly Trp Thr Glu Thr Ser Val Lys Glu Asn Lys 
145 150 155 160 

Asn He Asn Glu Ala Met Arg Val Leu He Glu Lys Met Met Arg Asn 
165 170 175 

50 

Ser Thr Glu Asp He Met Ser Leu Ser Thr Gin Gly Asp Tyr He Asn 
180 185 190 

Leu Gin Thr Lys Ser Ser Ser Trp Ser Cys Cys Xaa 
55 195 200 



60 



{2) INPORMATI(»l FOR SEQ ID NO: 357: 
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(i) SE3QUENCE CHARACTERISTICS: 

(A) LEMGTH: 47 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 357: 

Met lie Ser Leu lie Phe Gin Leu Glu Glu Glu Lys Leu Val Glu Lys 
15 10 15 

10 Phe Phe Phe Phe Leu Phe Phe Phe Leu Lys Lys Gly Ser Gin Gly Ser 
20 25 30 

Asn Leu Lys lie Val Pro Arg His Met Arg Val Val Leu Arg Gly 
35 40 45 

15 



<2) INFORMATION FOR SEQ ID NO: 358: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTK^: SEQ ID NO: 358: 

25 

Met Thr Tyr Val Thr Cys Leu His Val Cys Leu Leu Val Glu Phe Leu 
15 10 15 

Asn Ser Gin Leu Thr Asn His Arg Lys Tyr Tyr Phe Leu Ser Tyr Gly 
30 20 25 30 

Phe Trp Phe Thr Gly Leu Arg Gly Phe Ser Glu Tyr Leu Trp Pro Gin 
35 40 45 

35 Gin His Thr Ser Phe His Pro Asn Arg Asn Glu He Asn Phe Val Ser 
50 55 60 

Thr Asp Asn Arg He Trp Val Thr Xaa 
65 70 

40 



(2) INFORMATION FOR SEQ ID NO: 359: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 359: 

50 

Met Ser Asp Gin Glu Ala Lys Pro Ser Thr Glu Asp Leu Gly Asp Lys 

.1 5 10 15 

Lys Glu Gly Glu Tyr He Lys Leu Lys Val He Gly Gin Asp Ser Ser 
55 20 25 30 

Glu He His Phe Lys Val Lys Met Thr Thr His Leu Lys Lys Leu Lys 
35 40 45 

60 Glu Ser Tyr Cys Gin Arg Gin Gly Val Pro Met Asn Ser Leu Arg Phe 
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50 55 60 

Leu Phe Glu Gly Gin Arg He Ala Asp Asn His Thr Pro Lys Glu Leu 
65 70 75 80 

5 

Gly Met Glu Glu Glu Asp Val He Glu Val Tyr Gin Glu Gin Thr Gly 
85 90 95 

Gly His Ser Thr Val Xaa 

10 100 



15 



30 



(2) INFORMATION FOR SBQ ID NO: 360: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SBQ ID NO: 360: 

Met Gly Phe Pro Gin Trp His Leu Gly Asn His Ala Val Glu Pro Val 
15 10 15 

25 Thr Ser He Leu Leu Leu Phe Leu Leu Met Met Leu Gly Val Arg Gly 
20 25 30 



Leu Leu Leu Val Gly Leu Val Tyr Leu Val Ser His Leu Ser Gin Arg 
35 40 45 



35 

(2) INFORMATION FOR SEQ ID NO: 361: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 179 amino acids 
40 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 361: 

Met Ser Ala Glu Val Lys Val Thr Gly Gin Asn Gin Glu Gin Phe Leu 
45 1 5 10 15 

Leu Leu Ala Lys Ser Ala Lys Gly Ala Ala Leu Ala Thr Leu He His 
20 25 30 

50 Gin Val Leu Glu Ala Pro Gly Val Tyr Val Phe Gly Glu Leu Leu Asp 
35 40 45 

Met Pro Asn Val Arg Glu Leu Ala Glu Ser Asp Phe Ala Ser Thr Phe 
50 55 60 

55 

Arg Leu Leu Thr Val Phe Ala Tyr Gly Thr Tyr Ala Asp Tyr Leu Ala 
65 70 75 80 

Glu Ala Arg Asn Leu Pro Pro Leu Thr Glu Ala Gin Lys Asn Lys Leu 
60 85 90 95 
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Arg His Leu Ser Val Val Thr Leu Ala Ala Lys Val Lys Cys lie Pro 
100 105 110 

5 Tyr Ala Val Leu Leu Glu Ala Leu Ala Leu Arg Asn Val Arg Gin Leu 
115 120 125 

Glu Asp Leu Val He Glu Ala Val Tyr Ala Asp Val Leu Arg Gly Ser 
130 135 140 

10 

Leu Asp Gin Arg Asn Gin Arg Leu Glu Val Asp Tyr Ser He Gly Arg 
145 150 155 160 

Asp He Gin Arg Gin Asp Leu Ser Ala lie Ala Arg Thr Leu Xaa Lys 
15 165 170 175 

Asn His Xaa 



20 



35 



45 



60 



(2) INFORMATION FOR SEQ ID NO: 362: 



(i) SEQUENCE CHARACTERISTICS; 
25 (A) LENGTH: 25 _ amino acids 

<B) TVPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SBQUESraE DESCRIPTION: SEQ ID NO: 362: 

30 Met Lys Ser Ser Ser Leu Phe Phe Phe Phe Leu Ala His Phe He His 
15 10 15 



Ser His Asp Leu Pro Gly Leu Cys Arg 
20 25 



{2) INFORMATION FOR SEQ ID NO: 363: 



40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 224 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 363: 



Met Lys Phe Ala Ala Ser Gly Xaa Phe Leu His His Met Ala Gly Leu 
15 10 15 



Ser Ser Ser Lys Leu Ser Met Ser Lys Ala Leu Pro Leu Thr Lys Val 
50 20 25 30 

Val Gin Asn Asp Ala Tyr Thr Ala Pro Ala Leu Pro Ser Ser He Arg 
35 40 45 

55 Thr Lys Ala Leu Thr Asn Met Ser Arg Thr I^u Val Asn Lys Glu Glu 
50 55 60 



Pro Pro Lys Glu Leu Pro Ala Ala Glu Pro Val Leu Ser Pro Leu Glu 
65 70 75 80 
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Gly Thr Lys Met Thr Val Asn Asn Leu His Pro Arg Val Thr Glu Glu 
85 90 95 

Asp lie Val Glu Leu Phe Cys Val Cys Gly Ala Leu Lys Arg Ala Arg 
5 100 105 110 

Leu Val His Pro Gly Val Ala Glu Val Val Phe Val Lys Lys Asp Asp 
115 120 125 

10 Ala He Thr Ala Tyr Lys Lys Tyr Asn Asn Arg Cys Leu Asp Gly Gin 
130 135 140 

Pro Met Lys Cys Asn Leu His Met Asn Gly Asn Val He Thr Ser Asp 
145 150 155 160 

15 

Gin Pro He Leu Leu Arg Leu Ser Asp Ser Pro Ser Met Lys Lys Glu 
165 170 175 

Ser Glu Leu Pro Arg Arg Val Asn Ser Ala Ser Ser Ser Asn Pro Pro 
20 180 185 190 

Ala Glu Val Asp Pro Asp Thr He Leu Lys Ala Leu Phe Lys Ser Ser 
195 200 205 

25 Gly Ala Ser Xaa Thr Thr Gin Pro Thr Glu Phe Lys He Lys Leu Xaa 
210 215 220 



30 



(2) INFORMATION FOR SEQ ID NO: 364: 

35 <i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 349 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 364: 

40 

Met Ser Lys Asn Cys He Lys Leu Leu Cys Glu Asp Pro Val Phe Ala 
15 10 15 

Glu Tyr He Lys Cys He Leu Met Asp Glu Arg Thr Phe Leu Asn Asn 
45 20 25 30 

Asn He Val Tyr Thr Phe Met Thr His Phe Leu Leu Lys Val Gin Ser 
35 40 45 

50 Gin Val Phe Ser Glu Ala Asn Cys Ala Asn Leu He Ser Thr Leu He 
50 55 60 

Thr Asn Leu He Ser Gin Tyr Gin Asn Leu Gin Ser Asp Phe Ser Asn 
65 70 75 80 

55 

Arg Val Glu He Ser Lys Ala Ser Ala Ser Leu Asn Gly Asp Leu Arg 
85 90 95 



Ala Leu Ala Leu Leu Leu Ser Val His Thr Pro Lys Gin Leu Asn Pro 
60 100 105 110 
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Ala Leu lie Pro Thr Leu Gin Glu Leu Leu Ser Lys Cys Arg Thr Cys 
115 120 125 

Leu Gin Gin Arg Asn Ser Leu Gin Glu Gin Glu Ala Lys Glu Arg Lys 
130 135 140 



10 



Thr Lys Asp Asp Glu Gly Ala Thr Pro lie Lys Arg Arg Arg Val Ser 
145 150 155 160 

Ser Asp Glu Glu His Thr Val Asp Ser Cys lie Ser Asp Met Lys Thr 
165 170 175 



Glu Thr Arg Glu Val Leu Thr Pro Thr Ser Thr Ser Asp Asn Glu Thr 
15 180 185 190 

Arg Asp Ser Ser lie lie Asp Pro Gly Thr Glu Gin Asp Leu Pro Ser 
195 200 205 

20 Pro Glu Asn Ser Ser Val. Lys Glu Tyr Arg Met Glu Val Pro Ser Ser 
210 215 220 



25 



Phe Ser Glu Asp Met Ser Asn lie Arg Ser Gin His Ala Glu Glu Gin 
225 230 235 240 

Ser Asn Asn Gly Arg Tyr Asp Asp Cys Lys Glu Phe Lys Asp Leu His 
245 250 255 



Cys Ser Lys Asp Ser Thr Leu Ala Glu Glu Glu Ser Glu Phe Pro Ser 
30 260 265 270 

Thr Ser He Ser Ala Val Leu Ser Asp Leu Ala Asp Leu Arg Ser Cys 
275 280 285 

35 Asp Gly Gin Ala Leu Pro Ser Gin Asp Pro Glu Val Ala Leu Ser Leu 
290 295 300 



40 



Ser Cys Gly His Ser Arg Gly Leu Phe Ser His Met Gin Gin His Asp 
305 310 315 320 

He Leu Asp Thr Leu Cys Arg Thr He Glu Ser Thr He His Val Val 
325 330 335 



Thr Arg He Ser Gly Lys Gly Asn Gin Ala Ala Ser Xaa 
45 340 345 



{2) INFORMATIC»« FOR SEQ ID NO: 365: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 467 amino acids 

(B) TYPE: amino acid 
{D) TOPOLOGY: linear 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 365: 

Met Leu His Gin Asp His He Thr Phe Ala Met Leu Leu Ala Arg He 
16 10 15 



60 Lys Leu Lys Gly Thr Val Gly Glu Pro Thr Tyr Asp Ala Glu Phe Gin 
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20 



25 



30 



His Phe Leu Arg Gly Asn Glu lie Val Leu Ser Ala Gly Ser Thr Pro 
35 40 45 

5 

Arg He Gin Gly Leu Thr Val Glu Gin Ala Glu Ala Val Val Arg Leu 
50 55 60 

Ser Cys Leu Pro Ala Phe Lys Asp Leu He Ala Lys Val Gin Ala Asp 
10 65 70 75 80 

Glu Gin Phe Gly He Trp Leu Asp Ser Ser Ser Pro Glu Gin Thr Val 
85 90 95 

15 Pro Tyr Leu Trp Ser Glu Glu Thr Pro Ala Thr Pro He Gly Gin Ala 
100 105 110 



20 



He His Arg Leu Leu Leu He Gin Ala Phe Arg Pro Asp Arg Leu Leu 
115 120 125 

Ala Met Ala His Met Phe Val Ser Thr Asn Leu Gly Glu Ser Phe Met 
130 135 140 



Ser He Met Glu Gin Pro Leu Asp Leu Thr His He Val Xaa Thr Glu 
25 145 150 155 160 

Val Lys Pro Asn Thr Pro Val Leu Met Cys Ser Val Pro Gly Tyr Asp 
165 170 175 

30 Ala Ser Gly His Val Glu Asp Leu Ala Ala Glu Gin Asn Thr Gin He 
180 185 190 



35 



Thr Ser He Ala He Gly Ser Ala Glu Gly Phe Asn Gin Ala Asp Lys 
195 200 205 

Ala He Asn Thr Ala Val Lys Ser Gly Arg Trp Val Met Leu Lys Asn 
210 215 220 



Val His Leu Ala Pro Gly Trp Leu Met Gin Leu Glu Lys Lys Leu His 
40 225 230 235 240 

Ser Leu Gin Pro His Ala Cys Phe Arg Leu Phe Leu Thr Met Glu He 
245 250 255 

45 Asn Pro Lys Val Pro Val Asn Leu Leu Arg Ala Gly Arg He Phe Val 
260 265 270 



50 



Phe Glu Pro Pro Pro Gly Xaa Lys Ala Asn Met Leu Arg Thr Phe Ser 
275 280 285 

Ser He Pro Val Ser Arg He Cys Lys Ser Pro Asn Glu Arg Ala Arg 
290 295 300 



Leu Tyr Phe Leu Leu Ala Trp Phe His Ala He He Gin Glu Arg Leu 
55 305 310 315 320 



Arg Tyr Ala Pro Leu Gly Trp Ser Lys Lys Tyr Glu Phe Gly Glu Ser 
325 330 335 



60 Asp Leu Arg Ser Xaa Cys Asp Thr Val Asp Thr Trp Leu Asp Asp Thr 
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340 



345 350 



Ala Lys Gly Arg Gin Asn He Ser Pro Asp Lys He Pro Trp Ser Ala 
355 360 365 

Leu Lys Thr Leu Met Ala Gin Ser He Tyr Gly Gly Arg Val Asp Asn 
370 375 380 

Glu Phe ASP Gin Arg Leu Leu Asn Thr Phe Leu Glu Arg Leu Phe Thr 
10 385 390 395 400 

Thr Arg Ser Phe Asp Ser Glu Phe Lys Leu Ala Cys Lys Val Asp Gly 
405 410 415 

15 His Lys Asp He Gin Met Pro Asp Gly Met Gin Ala Arg Gly Val Cys 
420 425 430 

Ala val Gly Gly Val Ala Pro Arg His Pro Asp Ala Leu Leu Ala Gly 
435 440 445 

20 

Pro Ala Gin Gin Arg Arg Glu Ser Pro Pro Tyr His Thr Gly Cys Gly 
450 455 460 

His Asp Gin 
25 465 



30 



(2) INFORMATION FOR SEQ ID NO: 366: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 152 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 366: 

Met Ala ASP Glu Ala Thr Arg Arg Val Val Ser Glu He Pro Val Leu 
1 5 10 15 

40 Lys Thr Asn Ala Gly Pro Arg Asp Arg Glu Leu Trp Val Gin Arg Leu 
20 25 30 

Lys Glu Glu Tyr Gin Ser Leu He Arg Tyr Val Glu Asn Asn Lys Asn 
35 40 45 

45 

Ala Asp Asn Asp Tip Phe Arg Leu Glu Ser Asn Lys Glu Gly Thr Arg 
50 55 60 

Trp Phe Gly Lys Cys Trp Tyr He His Asp Leu Leu Lys Tyr Glu Phe 
50 65 70 75 80 

ASP He Glu Phe Asp He Pro He Thr Tyr Pro Thr Thr Ala Pro Glu 
85 90 95 

55 He Ala val Pro Glu Leu Asp Gly Lys Thr Ala Lys Met Tyr Arg Gly 
100 105 110 

Gly Lys He Cys Leu Thr Asp His Phe Lys Pro Leu Trp Gly Gin Glu 
115 120 125 

60 
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Cys Ala Gin He Trp Thr Ser Ser Ser His Gly Ser Gly Ala Gly Ser 
130 135 140 

Met Xaa Gly Ser Gly Asn Pro Xaa 
5 145 150 



(2) INFORMATION FOR SEQ ID NO: 367: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQiGTH: 373 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
15 (xi) SEQUENCE DESCRIPTIOI: SEQ ID NO: 367: 

Met Tyr Asp Gly Thr Lys Glu Val Pro Met Asn Pro Val Lys He Tyr 
15 10 15 

20 Gin Val Cys Asp He Pro Gin Pro Gin Gly Ser lie He Asn Pro Gly 
20 25 30 

Ser Thr Gly Ser Ala Pro Trp Asp Glu Lys Asp Asn Asp Val Asp Glu 
35 40 45 

25 

Glu Asp Glu Glu Asp Glu Leu Asp Gin Ser Gin His His Val Pro He 
50 55 60 

Gin Asp Thr Phe Pro Phe Leu Asn He Asn Gly Ser Pro Met Ala Pro 
30 65 70 75 80 

Ala Ser Val Gly Asn Cys Ser Val Gly Asn Cys Ser Pro Glu Ala Val 
85 90 95 

35 Trp Pro Lys Thr Glu Pro Leu Glu Met Glu Val Pro Gin Ala Pro He 
100 105 UO 

Gin Pro Phe Tyr Ser Ser Pro Glu Leu Trp He Ser Ser Leu Pro Met 
115 120 125 

40 

Thr Asp Leu Asp He Lys Phe Gin Tyr Arg Gly Lys Glu Tyr Gly Gin 
130 135 140 

Thr Met Thr Val Ser Asn Pro Gin Gly Cys Arg Leu Phe Tyr Gly Asp 
45 145 150 155 160 

Leu Gly Pro Met Pro Asp Gin Glu Glu Leu Phe Gly Pro Val Xaa Leu 
165 170 175 

50 Glu Gin Val Lys Phe Pro Gly Pro Glu His He Thr Asn Glu Lys Gin 
180 185 190 

Lys Leu Phe Thr Ser Lys Leu Leu Asp Val Met Asp Arg Gly Leu He 
195 200 205 

55 

Leu Glu Val Ser Gly His Ala He Tyr Ala He Arg Leu Cys Gin Cys 
210 215 220 



Lys Val Tyr Trp Ser Gly Pro Cys Ala Pro Ser Leu Val Ala Pro Asn 
60 225 230 235 240 
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Leu He Glu Arg Gin Lys Lys Val Lys Leu Phe Cys Leu Glu Thr Phe 
245 250 255 

5 Leu Ser Asp Leu He Ala His Gin Lys Gly Gin He Glu Lys Gin Pro 
260 265 270 

Pro Phe Glu He Tyr Leu Cys Phe Gly Glu Glu Trp Pro Asp Gly Lys 
275 280 285 

Pro Leu Glu Arg Lys Leu He Leu Val Gin Val He Pro Val Val Ala 
290 295 300 

Arg Met He Tyr Glu Met Phe Ser Gly Asp Phe Thr Arg Ser Phe Asp 
15 305 310 315 320 

Ser Gly Ser Val Arg Leu Gin He Ser Thr Pro Asp He Lys Asp Asn 
325 330 335 

20 He Val Ala Gin Leu Lys Gin Leu Tyr Arg He Leu Gin Thr. Gin Glu 
340 345 350 

Ser Trp Gin Pro Met Gin- Pro Thr Pro Ser Met Gin Leu Pro Pro' Ala 
355 360 365 

25 

Leu Pro Pro Gin Xaa 
370 



30 

(2) INFORMATION FOR SEQ ID NO: 368: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 amino acids 
35 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 368: 

Met Gly ser Ser Val Leu Pro Phe Cys Val Cys Val Thr Ser Pro Ser 
40 1 5 10 IS 

Leu Gly Gly Arg Cys He Gin Gly Arg Phe Ala Ser His Ser Lys Phe 
20 25 30 

45 Trp Gly Phe Gly Arg Lys Thr Ala Ser Phe Gly Ala Val Gly Glu Thr 
35 40 45 

Pro Pro Asp Gin Glu Pro Gin Lys Glu Thr Glu Pro Ala Thr Ser Ser 
50 55 60 

50 

His Ala Arg Pro Trp Ala Arg Val He Gly Leu Arg He Trp Pro Gin 
65 70 75 80 

Pro Asn Xaa 

55 



(2) INFORMATION FOR SEQ ID NO: 369: 

60 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 369: 

Met Leu Leu Ser Val Ala lie Phe lie Leu Leu Thr Leu Val Tyr Ala 
15 10 15 

10 Tyr Trp Thr Met Xaa 
20 



15 (2) INFORMATION FOR SEQ ID NO: 370: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 227 amino acids 

(B) TY^PE: amino acid 
20 (D) TOPOLOGY: lineau: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 370: 

Met Gly Ala Ser Ala Arg Leu Leu Arg Ala Val lie Met Gly Ala Pro 
15 10 15 

25 

Gly Ser Gly Lys Gly Thr Val Ser Ser Arg He Thr Thr His Phe Glu 
20 25 30 

Leu Lys His Leu Ser Ser Gly Asp Leu Leu Arg Asp Asn Met Leu Arg 
30 35 40 45 

Gly Thr Glu He Gly Val Leu Ala Lys Ala Phe He Asp Gin Gly Lys 
50 55 60 

35 Leu He Pro Asp Asp Val Met Thr Arg Leu Ala Leu His Glu Leu Lys 
65 70 75 80 

Asn Leu Thr Gin Tyr Ser Trp Leu Leu Asp Gly Phe Pro Arg Thr Leu 
85 90 95 

40 

Pro Gin Ala Glu Ala Leu Asp Arg Ala Tyr Gin He Asp Thr Val He 
100 105 110 

Asn Leu Asn Val Pro Phe Glu Val He Lys Gin Arg Leu Thr Ala Arg 
45 115 120 125 

Trp He His Pro Ala Ser Gly Arg Val Tyr Asn He Glu Phe Asn Pro 
130 135 140 

50 Pro Lys Thr Val Gly He Asp Asp Leu Thr Gly Glu Pro Leu He Gin 
145 150 155 160 

Arg Glu Asp Asp Lys Pro Glu Thr Val He Lys Arg Leu Lys Ala Tyr 
165 170 175 

55 

Glu Asp Gin Thr Lys Pro Val Leu Glu Tyr Tyr Gin Lys Lys Gly Val 
180 185 190 



Leu Glu Thr Phe Ser Gly Thr Glu Thr Asn Lys He Trp Pro Tyr Val 
60 195 200 205 
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Tyr Ala Phe Leu Gin Thr Lys Val Pro Gin Arg Ser Gin Lys Ala Ser 
210 215 220 

5 Val Thr Pro 
225 



10 (2) INFORMATION FOR SEQ ID NO: 371: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 79 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGV: linear 

(xi) SEQUENCE DESCRIPTIC»I: SEQ ID NO: 371: 



20 



Met Phe Leu Asn Cys Glu He Leu Glu Tyr Cys Tyr Tyr Leu Thr Gin 
15 10 15 

Leu Lys He Ser Met Gly Lys Tyr Leu Ser He Pro Thr Val Leu Leu 
20 25 30 



Lys He He Arg Cys Ser He Thr Ala Val Ser Asp Ser Ser Thr Ser 
25 35 40 45 

Trp Ala He Lys Ala Gin Leu Lys He Glu Asn Lys Asp Leu Asp Asn 
50 55 60 

30 Lys Thr Ala Lys Gly Gly Gly Gin Glu Ala Leu Thr Cys Thr Xaa 
65 70 75 



35 (2) INFORMATION FOR SEQ ID NO: 372: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 
40 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 372: 

Met Arg Ala Val Phe Pro Cys Cys Pro Phe Leu Thr Leu Met Leu Pro 
1 5 10 15 

Leu Leu Glu Cys Leu Val Gly Met He Met Cys Tyr Leu Gly He Ser 
20 25 30 



45 



Phe Thr Asp Thr Arg Lys Thr Ala Gly Leu Lys Lys Lys Lys Lys Lys 
50 35 40 46 



55 



Lys Xeia Xaa 
50 



(2) INFORMATION FOR SEQ ID NO: 373: 



60 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY; linear 
(xi) SEQUENCE DESCRIPTIC»I: SEQ ID NO: 373: 

Met Phe Leu Met Arg Met His Leu Cys Phe Cys Lys Tyr Cys Cys Ser 
15 10 15 

Phe He Val Thr Pro Thr Ser Thr Ser Asn Thr Ala Ser Tyr Leu Trp 
20 25 30 

Pro Trp He Ser Ala Ser Met Ala Gly Arg Gly Ser Ser Trp Ala Cys 
35 40 45 



Thr Leu Asn Ala Val Thr Arg Glu Gly Leu Pro Glu Xaa 
15 50 55 60 



20 



35 



(2) INFORMATION FOR SEQ ID NO: 374: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

25 <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 374: 

Met Ser Leu Leu Asn Thr His Thr Leu Cys Phe Val Leu Phe Cys Phe 
15 10 15 

30 Thr Leu Ser He Asn Gin Glu Lys Leu Ala Asn His Leu Ala Phe Arg 
20 25 30 



He Leu Phe Phe He Val Phe Xaa 
35 40 



(2) INFXDRMATION FOR SEQ ID NO: 375: 

40 (i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 44 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTIC»J: SEQ ID NO: 375: 

45 

Met Cys Ser Gly Gin Ser Gin Val Trp Lys Met Ala Leu Gin Ala Leu 
15 10 15 

Asp Ser Glu Thr Val Val He Leu Pro Asp Met His Leu He Leu Ser 
50 20 25 30 

Leu Arg Leu He His Asn Ala Arg Pro Cys Leu Xaa 
35 40 

55 

(2) INFORMATION FOR SEQ ID NO: 376: 



60 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 203 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 376: 

5 Met Leu He Ser Glu Glu Glu He Pro Phe Lys Asp Asp Pro Arg Asp 
15 10 15 



10 



25 



40 



Glu Thr Tyr Lys Pro His Leu Glu Arg Glu Thr Pro Lys Pro Arg Arg 
20 25 30 

Lys Ser Gly Lys Val Lys Glu Glu Lys Glu Lys Lys Glu He Lys Val 
35 40 45 



Glu Val Glu Val Glu Val Lys Glu Glu Glu Asri Glu He Arg Glu Asp 
15 .50 55 60 

Glu Glu Pro Pro Arg Lys Arg Gly Arg Arg Arg Lys Asp Asp Lys Ser 
65 70 75 80 

20 Pro Arg Leu Pro Lys Arg Arg Lys Lys Pro Pro He Gin Tyr Val Arg 
85 90 95 



Cys Glu Met Glu Gly Cys Gly Thr Val Leu Ala His Pro Arg Tyr Leu 
100 105 110 

Gin His His He Lys Tyr Gin His Leu Leu Lys Lys Lys Tyr Val Cys 

115 120 125 



Pro His Pro Ser Cys Gly Arg Leu Phe Arg Leu Gin Lys Gin Leu Leu 
30 130 135 140 

Arg His Ala Lys His His Thr Asp Gin Arg Asp Tyr He Cys Glu Tyr 
145 150 155 160 

35 Cys Ala Arg Ala Phe Lys Ser Ser His Asn Leu Ala Val His Arg Met 
165 170 175 



He His Thr Gly Glu Lys His Tyr Asn Val Arg Ser Val Asp Leu Leu 
180 185 190 

Val Asp Lys Arg His Leu Leu He Gly Thr Xaa 
195 200 



45 

(2) INFORMATION FOR SEQ ID NO: 377: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 
50 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 377: 

Met Leu Pro Arg Arg Thr Phe Tyr Phe Tyr Phe He Phe He Phe Phe 
55 1 5 10 15 

Leu Ala Ser Phe Trp Gly Phe Thr Leu Arg Ala Ser Phe 
20 25 



60 
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(2) INFORMATION FOR SEQ ID NO: 378: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 136 amino acids 

(B) TYPE: amino acid 

<D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 378: 

10 Met Phe Asp Ser Leu Ser Tyr Phe Lys Gly Ser Ser Leu Leu Leu Met 
15 10 15 

Leu Lys Thr Tyr Leu Ser Glu Asp Val Phe Gin His Ala Val Val Leu 
20 25 30 

15 

Tyr Leu His Asn His Ser Tyr Ala Ser lie Gin Ser Asp Asp Leu Trp 
35 40 45 

Asp Ser Phe Asn Glu Val Thr Asn Gin Thr Leu Asp Val Lys Arg Met 
20 50 55 60 

Met Lys Thr Trp Thr Leu Gin Lys Gly Phe Pro Leu Val Thr Val Gin 
65 70 75 80 

25 Lys Lys Gly Lys Glu Leu Phe lie Gin Gin Glu Arg Phe Phe Leu Asn 
85 90 95 

Met Lys Pro Glu lie Gin Pro Ser Asp Thr Arg Tyr Met Pro Ser Phe 
100 105 110 

30 

Phe Ser Cys His Leu Phe Cys Thr Leu Arg Trp Lys Tyr Phe Glu Val 
115 120 125 

Phe Tyr Asn His Lys Phe Leu Xaa 
35 130 135 



40 



55 



(2) INFORMATION FOR SEQ ID NO: 379: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 379: 

Met Ala Trp Arg Arg Arg Glu Pro Ala Ser Gly Leu Ala Ala Cys Trp 
15 10 15 

50 Leu Trp Arg Cys Ser Pro Trp Pro Cys Ala Cys Pro Gly Pro Gly Ala 
20 25 • 30 



Gly Leu Ser Ser Gly Ser Arg Pro Trp 
35 40 



(2) INFORMATION FOR SEQ ID NO: 380: 
60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 468 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 380: 

^ Met Glu Phe Leu Lys Val Ala Arg Arg Asn Lys Arg Glu Gin Leu Glu 
15 10 15 

Gin He Gin Lys Glu Leu Ser Val Leu Glu Glu Asp He Lys Arg Val 
10 20 25 30 

Glu Glu Met Ser Gly Leu Tyr Ser Pro Val Ser Glu Asp Ser Thr Val 
35 40 45 

15 Pro Gin Phe Glu Ala Pro Ser Pro Ser His Ser Ser He He Asp Ser 
50 55 60 

Thr Glu Tyr Ser Gin Pro Pro Gly Phe Ser Gly Ser Ser Gin Thr Lys 
65 70 75 80 

Lys Gin Pro Trp Tyr Asn Ser Thr Leu Ala Ser Arg Arg Lys Arg Leu 
85 90 95 



20 



Thr Ala His Phe Glu Asp Leu Glu Gin Cys Tyr Phe Ser Thr Arg Met 
25 100 105 . HO 

ser Arg He Ser Asp Asp Ser Arg Thr Ala Ser Gin Leu Asp Glu Phe 
115 120 125 

30 Gin Glu Cys Leu Ser Lys Phe Thr Arg Tyr Asn Ser Val Arg Pro Leu 
130 135 140 

Ala Thr Leu Ser Tyr Ala Ser Asp Leu Tyr Asn Gly Ser Ser He Val 
145 150 155 160 



35 



ser Ser- He Glu Phe Asp Arg Asp Cys Asp Tyr Phe Ala He Ala Gly 
165 170 175 



Val Thr Lys Lys He Lys Val Tyr Glu Tyr Asp Thr Val He Gin Asp 
40 180 185 190 

Ala Val Asp He His Tyr Pro Glu Asn Glu Met Thr Cys Asn Ser Lys 
195 200 205 

45 He Ser Cys He Ser Trp Ser Ser Tyr His Lys Asn Leu Leu Ala Ser 
210 215 220 

ser Asp Tyr Glu Gly Thr Val He Leu Trp Asp Gly Phe Thr Gly Gin 
225 230 235 240 

Arg ser Lys Val Tyr Gin Glu His Glu Lys Arg Cys Trp Ser Val Asp 
245 250 255 

Phe Asn Leu Met Asp Pro Lys Leu Leu Ala Ser Gly Ser Asp Asp Ala 
55 260 265 270 

Lys Val Lys Leu Trp Ser Thr Asn Leu Asp Asn Ser Val Ala Ser He 
275 280 285 

60 Glu Ala Lys Ala Asn Val Cys Cys Val Lys Phe Ser Pro Ser Ser Arg 
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290 295 300 

Tyr His Leu Ala Phe Gly Cys Ala Asp His Cys Val His Tyr Tyr Asp 
305 310 315 320 

5 

Leu Arg Asn Thr Lys Gin Pro He Met Val Phe Lys Gly His Arg Lys 
325 330 335 

Ala Val Ser Tyr Ala Lys Phe Val Ser Gly Glu Glu He Val Sex Ala 
10 340 345 350 

Ser Thr Asp Ser Gin Leu Lys Leu Trp Asn Val Gly Lys Pro Tyr Cys 

355 360 365 

15 Leu Arg Ser Phe Lys Gly His He Asn Glu Lys Asn Phe Val Gly Leu 
370 375 380 



20 



Ala Ser Asn Gly Asp Tyr He Ala Cys Gly Ser Glu Asn Asn Ser Leu 
385 390 395 400 

Tyr Leu Tyr Tyr Lys Gly Leu Ser Lys Thr Leu Leu Thr Phe Lys Phe 
405 410 415 



Asp Thr Val Lys Ser Val Leu Asp Lys Asp Arg Lys Glu Asp Asp Thr 
25 420 425 430 

Asn Glu Phe Val Ser Ala Val Cys Trp Arg Ala Leu Pro Asp Gly Glu 
435 440 445 

30 Ser Asn Val Leu He Ala Ala Asn Ser Gin Gly Thr He Lys Val Leu 
450 455 460 

Glu Leu Val Xaa 
465 

35 



(2) INFORMATION FOR SEQ ID NO: 381: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 381: 

45 

Met Arg Lys Glu Asp Gly Phe Trp Phe Phe Phe Phe Leu Phe Phe Phe 
15 10 15 

Val Val Gly Ser Lys Phe Val Asn Gly Asn Lys Leu Val 
50 20 25 



55 



(2) INFORMATION FOR SEQ ID NO: 382: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 382: 
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Met Pro Leu Ala Pro Tyr Cys Asp Leu Leu Val Ala Leu Ser Phe Ala 
15 10 15 

5 Leu Val Leu Glu Ser Pro Val Asp Ser Ser Asp Phe Thr 
20 25 



10 (2) INFORMATION FOR SEQ ID NO: 383: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LE2«3TH: 138 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 383: 



20 



Met Asn Ser Leu Val Ser Trp Gin Leu Leu Leu Phe Leu Cys Ala Thr 
15 10 15 

His Phe Gly Glu Pro Leu Glu Lys Val Ala Ser Val Gly Asn Ser Arg 
20 25 30 



Pro Thr Gly Gin Gin Leu Glu Ser Leu Gly Leu Leu Ala Pro Gly Glu 
25 35 40 45 

Gin Ser Leu Pro Cys Ttir Glu Arg Lys Pro Ala Ala Thr Ala Arg Leu 
50 55 60 

30 Ser Arg Arg Gly Thr Ser Leu Ser Pro Pro Pro Glu Ser Ser Gly Ser 
65 70 75 80 

Pro Gin Gin Pro Gly Leu Ser Ala Pro His Ser Arg Gin He Pro Ala 
85 90 95 

35 

Pro Gin Gly Ala Val Leu Val Gin Arg Glu Lys Asp Leu Pro Asn Tyr 
100 105 110 

Asn Trp Asn Ser Phe Gly Leu Arg Phe Gly Lys Arg Glu Ala Ala Pro 
40 115 120 125 

Gly Asn His Gly Arg Ser Ala Gly Arg Gly 
130 135 

45 

(2) INFORMATION FOR SEQ ID NO: 384: 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 74 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY; linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 384: 

55 Met Ser Cys Phe He Asp Ser Xaa Asp Ser Lys He Leu His Leu Leu 
15 10 15 

Val Val Ser Phe He Cys Xaa Leu Phe Leu Leu He Leu Thr His Gly 
20 25 30 

60 
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He Leu He Leu Arg Xaa Phe Phe Ser Val Xaa Xaa His Ser Leu Lys 
35 40 45 

Asn Asn Leu Glu Glu Tyr Leu He Leu Met Asn Lys Ala Leu Leu Thr 
5 50 55 60 

Arg Glu Asp Phe Phe Val Leu Pro Xaa Ala 
65 \ 70 

10 

(2) INFORMATION FOR SEQ ID NO: 385: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 521 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 385: 

20 Met Ser Ala Gly Glu Val Glu Arg Leu Val Ser Glu Leu Ser Gly Gly 
15 10 15 

Thr Gly Gly Asp Glu Glu Glu Glu Trp Leu Tyr Gly Asp Glu Asn Glu 
20 25 30 

25 

Val Glu Arg Pro Glu Glu Glu Asn Ala Ser Ala Asn Pro Pro Ser Gly 
35 40 45 

He Glu Asp Glu Thr Ala Glu Asn Gly Val Pro Lys Pro Lys Val Thr 
30 50 - 55 60 

Glu Thr Glu Asp Asp Ser Asp Ser Asp Ser Asp Asp Asp Glu Asp Asp 
65 70 75 80 

35 Val His Val Thr He Gly Asp He Lys Thr Gly Ala Pro Gin Tyr Gly 
85 90 95 

Ser Tyr Gly Thr Ala Pro Val Asn Leu Asn He Lys Thr Gly Gly Arg 
100 105 110 

40 

Val Tyr Gly Thr Thr Gly Thr Lys Val Lys Gly Val Asp Leu Asp Ala 

115 120 125 

Pro Gly Ser He Asn Gly Val Pro Leu Leu Glu Val Asp Leu Asp Ser 
45 130 . 135 140 

Phe Glu Asp Lys Pro Trp Arg Lys Pro Gly Ala Asp Leu Ser Asp Tyr 
145 150 155 160 

50 Phe Asn Tyr Gly Phe Asn Glu Asp Thr Trp Lys Ala Tyr Cys Glu Lys 
165 170 175 

Gin Lys Arg He Arg Met Gly Leu Glu Val He Pro Val Thr Ser Thr 
180 185 190 

55 

Thr Asn Lys He Thr Val Gin Gin Gly Arg Thr Gly Asn Ser Glu Lys 
195 200 205 

Glu Thr Ala Leu Pro Ser Thr Lys Ala Glu Phe Thr Ser Pro Pro Ser 
60 210 215 220 
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Leu Phe Lys Thr Gly Leu Pro Pro Ser Arg Arg Leu Pro Gly Ala lie 
225 230 235 240 

Asp Val He Gly Gin Thr He Thr He Ser Arg Val Glu Gly Arg Arg 
245 250 255 



10 



Arg Ala Asn Glu Asn Ser Asn He Gin Val Leu Ser Glu Arg Ser Ala 
260 265 270 

Thr Glu Val Asp Asn Asn Phe Ser Lys Pro Pro Pro Phe Phe Pro Pro 

275 280 285 



Gly Ala Pro Pro Thr His Leu Pro Pro Pro Pro Phe Leu Pro Pro Pro. 
15 290 295 300 

Pro Thr Val Ser Thr Ala Pro Pro Leu He Pro Pro Pro Gly Phe Pro 
305 310 315 320 

20 Pro Pro Pro Gly Ala Pro Pro Pro Ser Leu He Pro Thr He Glu Ser 
325 330 335 



25 



Gly His Ser Ser Gly Tyr Asp Ser Arg Ser Ala Arg Ala Phe Pro Tyr 
340 345 350 

Gly Asn Val Ala Phe Pro His Leu Pro Gly Ser Ala Pro Ser Trp Pro 
355 360 365 



Ser Leu Val Asp Thr Ser Lys Gin Trp Asp Tyr Tyr Ala Arg Arg Glu 
30 370 375 380 



35 



Lys Asp Arg Asp Arg Glu Arg Asp Arg Asp Arg Glu Arg Asp Arg Asp 
385 390 395 400 

Arg Asp Arg Glu Arg Glu Arg Thr Arg Glu Arg Glu Arg Glu Arg Asp 
405 410 415 



40 



His Ser Pro Thr Pro Ser Val Phe Asn Ser Asp Glu Glu Arg Tyr Arg 
420 425 430 

Tyr Arg Glu Tyr Ala Glu Arg Gly Tyr Glu Arg His Arg Ala Ser Arg 
435 440 445 



Glu Lys Glu Glu Arg His Arg Glu Arg Arg His Arg Glu Lys Glu Glu 
45 450 455 460 

Thr Arg His Lys Ser Ser Arg Ser Asn Ser Arg Arg Arg His Glu Ser 
465 470 475 480 

50 Glu Glu Gly Asp Ser His Arg Arg His Lys His Lys Lys Ser Lys Arg 
485 490 495 



55 



Ser Lys Glu Gly Lys Glu Ala Gly Ser Glu Pro Ala Pro Glu Gin Glu 
500 505 510 

Ser Thr Glu Ala Thr Pro Ala Glu Xaa 
515 520 



60 
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(2) INFORMATION FOR SEQ ID NO: 386: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LH^KjTH: 137 amino acids 
5 (B) TYPE: andno acid 

(D) TOPOLOGY: linear 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 386: 

Met Asn Ser Arg Gly He Trp Leu Ala Tyr He He Leu Val Gly Leu 
10 1 5 10 15 

Leu His Met Val Leu Leu Ser He Pro Phe Phe Ser He Rro Val Val 
20 25 30 

15 Trp Thr Leu Thr Asn Val He His Asn Leu Ala Thr Tyr Val Phe Leu 
35 40 45 

His Thr Val Lys Gly Thr Pro Phe Glu Thr Pro Asp Gin Gly Lys Ala 
50 55 60 

20 

Arg Leu Leu Thr His Trp Glu Gin Met Asp Tyr Gly Leu Gin Phe Thr 
65 70 75 80 

Ser Ser Arg Lys Phe Leu Ser He Ser Pro He Val Leu Tyr Leu Leu 
25 85 90 95 

Ala Ser Phe Tyr Thr Lys Tyr Asp Ala Ala His Phe Leu He Asn Thr 
100 105 110 

30 Ala Ser Leu Leu Ser Val Leu Leu Pro Lys Leu Pro Gin Phe His Gly 

115 120 125 

Val Arg Val Phe Gly He Asn Lys Tyr 
130 135 

35 



(2) INFORMATION FOR SEQ ID NO: 387: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 186 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID IK): 387: 

45 

Met Ala Ala Gin Lys Asp Gin Gin Lys Asp Ala Glu Ala Glu Gly I^u 
15 10 15 

Ser Gly Thr Thr Leu Leu Pro Lys Leu He Pro Ser Gly Ala Gly Arg 
50 20 25 30 

Glu Trp Leu Glu Arg Arg Arg Ala Thr He Arg Pro Trp Ser Thr Phe 
35 40 45 

55 Val Asp Gin Gin Arg Phe Ser Arg Pro Arg Asn Leu Gly Glu Leu Cys 
50 55 60 



Gin Arg Leu Val Arg Asn Val Glu Tyr Tyr Gin Ser Asn Tyr Val Phe 
65 70 75 80 
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Val Phe Leu Gly I^eu lie Leu Tyr Cys Val Val Thr Ser Pro Met Leu 
85 90 95 

Leu Val Ala Leu Ala Val Phe Phe Gly Ala Cys Tyr lie Leu Tyr Leu 
5 100 105 . 110 

Arg Thr Leu Glu Ser Lys Leu Val Leu Phe Gly Arg Glu Val Ser Pro 
115 120 125 

10 Ala His Gin Tyr Ala Leu Ala Gly Gly He Ser Phe Pro Phe Phe Trp 
130 135 140 

Leu Ala Gly Ala Gly Ser Ala Val Phe Trp Val Leu Gly Ala Thr Leu 
145 150 155 160. 

15 

Val Val He Gly Ser His Ala Ala Phe His Gin He Glu Ala Val Asp 
165 . 170 175 

Gly Glu Glu Leu Gin Met Glu Pro Val Xaa 
20 180 185 



25 



(2) INFORMATION FOR SEQ ID NO: 388: 



(X) SEQUENCE CHARACTERISTICS: 

(A) LE1K3TH: 1 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

30 (xi) SEQUENCE DESCRIPTION: SBQ ID NO: 388: 



35 



50 



Met 
1 



(2) INFORMATICS^ FOR SEQ ID NO: 389: 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 299 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 389: 

45 Met Leu Ser He Phe Tyr Phe Ala He Pro Val Gly Ser Gly Leu Gly 
15 10 15 

Tyr He Ala Gly Ser Lys Val Lys Asp Met Ala Gly Asp Trp His Trp 
20 25 30 



Ala Leu Arg Val Thr Pro Gly Leu Gly Val Val Ala Val Leu Leu Leu 
35 40 45 



Phe Leu Val Val Arg Glu Pro Pro Arg Gly Ala Val Glu Arg His Ser 
55 50 55 60 

Asp Leu Pro Pro Leu Asn Pro Thr Ser Trp Trp Ala Asp Leu Arg Ala 
65 70 75 80 



60 Leu Ala Arg Asn Pro Ser Phe Val Leu Ser Ser Leu Gly Phe Thr Ala 
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85 90 95 

Val Ala Phe Val Thr Gly Ser Leu Ala Leu Trp Ala Pro Ala Phe Leu 
^ 100 105 110 

Leu Arg Ser Arg Val Val Leu Gly Glu Thr Pro Pro Cys Leu Pro Gly 
115 120 125 

Asp Ser Cys Ser Ser Ser Asp Ser Leu He Phe Gly Leu He Thr Cys 
10 130 135 140 

Leu Thr Gly Val Leu Gly Val Gly Leu Gly Val Glu He Ser Arg Arg 
145 150 155 160 

15 Leu Arg His Ser Asn Pro Arg Ala Asp Pro Leu Val Cys Ala Thr Gly 
165 170 175 

Leu Leu Gly Ser Ala Pro Phe Leu Phe Leu Ser Leu Ala Cys Ala Arg 
180 185 190 

20 

Gly Ser He Val Ala Thr Tyr He Phe He Phe He Gly Glu Thr Leu 
195 200 205 

Leu Ser Met Asn Trp Ala He Val Ala Asp He Leu Leu Tyr Val Val 
25 210 215 220 

He Pro Thr Arg Arg Ser Thr Ala Glu Ala Phe Gin He Val Leu Ser 
225 230 235 240 

30 His Leu Leu Gly Asp Ala Gly Ser Pro Tyr Leu He Gly Leu He Ser 
245 250 255 



35 



Asp Arg Leu Arg Arg Asn Trp Pro Pro Ser Phe Leu Ser Glu Phe Arg 
260 265 270 

Ala Leu Gin Phe Ser Leu Met Leu Cys Ala Phe Val Gly Ala Leu Gly 
275 280 285 



Gly Ala Leu Pro Gly His Arg His Leu His Xaa 
40 290 295 



45 



(2) INFORMATION FOR SEQ ID NO: 390: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 390: 

Met Gly Pro Gin Gly Trp Val Arg Pro Leu Lys Thr Ala Pro Lys Leu 
15 10 15 

55 Gly Glu Ala He Arg Leu He Leu Phe Leu Asn Phe Val Lys Gin Cys 

20 25 30 

He Ala Ser Val Asn Leu Cys He Leu Arg Leu Asn He Thr Pro Leu 
35 40 45 

60 
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Leu 



<2) INFORMATION FOR SEQ ID NO: 391: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEMjTH: 61 amino acids 
10 (B) TY^PE: amino acid 

(D) TOPOIjOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 391: 

Met Tyr Val Asn Tyr Gly Thr Arg Asn Tyr Ser Thr Glu Gly Pro Ala 
15 1 5 10 15 

Ala Leu Leu Asp Gin Ala Lys Leu Ser Leu Leu Val Trp Val Leu Cys 
20 25 30 

20 Phe Val Leu Leu Phe Val Cys Phe Cys Gly Leu Ser Tyr Val Val lie 
35 40 45 



25 



Ala Gin Val Pro Val Gly Leu Leu Cys lie Thr Glu Xaa 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 392: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 79 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 392: 



35 



Met Leu Trp Phe Ala Asn Phe Phe Thr Tyr Leu Phe Leu Ser Gin Ser 
15 10 15 



Val Ala Phe Val His He Ser His He Gly Val Arg Gin Val Asn Thr 
40 20 25 30 

Asn Cys Tyr Phe Ser Arg Lys Ser Tyr Cys Tyr Gly He Leu Asn Pro 

35 40 45 

45 He Asn Cys He Lys Gly Lys Lys Lys Lys Lys Lys Lys Lys Lys Lys 
50 55 60 



50 



Lys Lys Lys Lys He Pro Ala Gly Arg Xaa Leu Phe Pro Phe Gly 
65 70 75 



(2) INFORMATION FOR SEQ ID NO: 393: 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 393: 
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589 



Met Pro Gly Ala Phe Ser Glu Thr Val He Asn Asp Leu Leu Ser Leu 
15 10 15 

Phe Leu Val Leu Pro Ala Glu Leu Ser Tyr Ser Thr Leu Ser Gly Val 
5 20 25 30 

Tyr Arg Asn Ala 
35 

10 

(2) INFORMATION FOR SEQ ID NO: 394: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 180 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 394: 

20 Met Ala Gin Ser Arg Asp Gly Gly Asn Pro Phe Ala Glu Pro Ser Glu 
15 10 15 

Leu Asp Asn Pro Phe Gin Asp Pro Ala Val He Gin His Arg Pro Ser 

25 

Arg Gin Tyr Ala Thr Leu Asp Val Tyr Asn Pro Phe Glu Thr Arg Glu 
35 40 45 

Pro Pro Pro Ala Tyr Glu Pro Pro Ala Pro Ala Pro Leu Pro Pro Pro 
30 50 55 60 

Ser Ala Pro Ser Leu Gin Pro Ser Arg Lys Leu Ser Pro Thr Glu Pro 
65 70 75 80 

35 Lys Asn Tyr Gly Ser Tyr Ser Thr Gin Ala Ser Ala Ala Ala Ala Thr 
85 90 95 

Ala Glu Leu Leu Lys Lys Gin Glu Glu Leu Asn Arg Lys Ala Glu Glu 
100 105 110 

40 

Leu Asp Arg Arg Ser Glu Ser Cys Ser Met Leu Pro Trp Xaa Ala Gin 
115 120 125 

Leu Leu Asp Arg Thr lie Gly Pro Leu Tyr Leu Leu Phe Val Gin Phe 
45 130 135 140 

Ser Pro Ala Phe Ser Arg Thr Ser Pro Trp Arg Ser Pro Lys Asn Phe 
145 150 155 160 

50 Arg Arg Leu Tyr Pro Pro Cys Thr Thr Ser Gly Cys Ala Ala Arg Trp 
165 170 175 

Xaa Phe Ser Xaa 
180 

55 



60 



(2) INFORMATION FOR SEQ ID NO: 395; 

(i) SEQUENCE CHARACTERISTICS: 
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<A) LENGTH: 21 amino acids 
(B) TVPE: amino acid 
(D) TOPOLOGV: linear 
^ (xi) SEQUENCE DESCRIPTION: SBQ ID NO: 395: 

Met Pro Thr Pro Cys Thr Ser Leu Pro Ser Cys Cys Gin His Arg Ser 
15 10 15 

lie Thr Met Thr Leu 
10 20 



15 



(2) INFORMATION FOR SEQ ID NO: 396: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 396: 

Met Pro Leu Phe He Pro Leu He Phe Phe Leu Ser Leu Leu His Cys 
15 10 15 

25 Gin Ser Lys His Pro He Gin Met Ser Leu Cys Met Cys Val Asn He 
20 25 30 



30 



Ser Leu Val Trp Ser Pro Val Arg Trp He Phe Gly Ser Lys Gly Leu 
35 40 45 

Phe Ser Val His Leu Gin Ser Ser Gin Arg Pro Ser 
50 55 60 



40 



35 

(2) INFORMATION FOR SEQ ID NO: 397: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 152 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 397: 

Met Ala Gly Pro Arg Pro Xaa Trp Arg Asp Gin Leu Leu Phe Met Ser 
45 1 5 10 15 

He He Val Leu Val He Val Val He Cys Leu Met Leu Tyr Ala Leu 
20 25 30 



50 



55 



60 



Leu Trp Glu Ala Gly Asn Leu Thr Asp Leu Pro Asn Leu Arg He Gly 
35 40 45 

Phe Tyr Asn Phe Cys Leu Trp Asn Glu Asp Thr Ser Thr Leu Gin Cys 
50 55 60 

His Gin Phe Pro Glu Leu Glu Ala Leu Gly Val Pro Arg Val Gly Leu 
^5 70 75 80 

Gly Leu Ala Arg Leu Gly Val Tyr Gly Ser Leu Val Leu Thr Leu Phe 
85 90 95 
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Ala Pro Gin Pro Leu Leu Leu Ala Gin Cys Asn Xaa Asp Glu Arg Ala 
100 105 110 

5 Trp Arg Leu Ala Val Gly Phe Leu Ala Val Ser Ser Val Leu Leu Ala 
115 120 125 



10 



15 



Gly Gly Leu Gly Leu Phe Leu Ser Tyr Val Trp Asn Gly Ser Xaa Ser 
130 135 140 

Pro Ser Arg Gly Leu Gly Phe Xaa 
145 150 



(2) INFORMATION FOR SEQ ID NO: 398: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQJGTH: 480 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 398: 

Met Ser Asp Gly Phe Asp Arg Ala Pro Gly Ala Gly Arg Gly Arg Xaa 
1 5 . 10 15 

Arg Gly Leu Gly Arg Gly Gly Gly Gly Pro Xaa Gly Gly Gly Phe Pro 
20 25 30 

Xaa Gly Xaa Xaa Pro Ala Glu Arg Xaa Arg His Gin Pro Pro Gin Pro 
35 40 45 

Lys Ala Pro Gly Phe Leu Gin Pro Xaa Pro Leu Arg Gin Pro Arg Thr 
^0 55 60 

Thr Pro Pro Pro Gly Ala Gin Cys Glu Val Pro Ala Ser Pro Gin Arg 
70 75 80 

Pro Ser Arg Pro Gly Ala Leu Pro Glu Gin Thr Arg Pro Leu Arg Ala 

85 90 95 

Pro Pro Ser Ser Gin Asp Lys He Pro Gin Gin Asn Ser Glu Ser Ala 
100 105 110 

Met Ala Lys Pro Gin Val Val Val Ala Pro Val Leu Met Ser Lys Leu 
115 120 125 

Ser Val Asn Ala Pro Glu Phe Tyr Pro Ser Gly Tyr Ser Ser Ser Tyr 
130 135 140 

Thr Glu Ser Tyr Glu Asp Gly Cys Glu Asp Tyr Pro Thr Leu Ser Glu 
150 155 160 

Tyr Val Gin Asp Phe Leu Asn His Leu Thr Glu Gin Pro Gly Ser Phe 
165 170 175 

Glu Thr Glu He Glu Gin Phe Ala Glu Thr Leu Asn Gly Cys Val Thr 
180 185 190 

60 Thr Asp Asp Ala Leu Gin Glu Leu Val Glu Leu He Tyr Gin Gin Ala 



20 



25 



30 



35 



40 



45 



50 



55 
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592 



195 



200 



205 



Thr Ser He Pro Asn Phe Ser Tyr Met Gly Ala Arg Leu Cys Asn Tyr 
210 215 220 

Leu Ser His His Leu Thr He Ser Pro Gin Ser Gly Asn Phe Arg Gin 
225 230 235 240 



10 



Leu Leu Leu Gin Arg Cys Arg Thr Glu Tyr Glu Val Lys Asp Gin Ala 
245 250 255 



15 



Ala Lys Gly Asp Glu Val Thr Arg Lys Arg Phe His Ala Phe Val Leu 
260 265 270 

Phe Leu Gly Glu Leu Tyr Leu Asn Leu Glu He Lys Gly Thr Asn Gly 
275 280 285 



20 



Gin Val Thr Arg Ala Asp He Leu Gin Val Gly Leu Arg Glu Leu Leu 
290 295 300 

Asn Ala Leu Phe Ser Asn Pro Met Asp Asp Asn Leu He Cys Ala Val 
305 310 315 320 



Lys Leu Leu Lys Leu Thr Gly Ser Val Leu Glu Asp Ala Trp Lys Glu 
25 325 330 335 



30 



Lys Gly Lys Met Asp Met Glu Glu He He Gin Arg He Glu Asn Val 
340 345 350 

Val Leu Asp Ala Asn Cys Ser Arg Asp Val Lys Gin Met Leu Leu Lys 
355 360 365 



35 



Leu Val Glu Leu Arg Ser Ser Asn Trp Gly Arg Val His Ala Thr Ser 

370 375 380 

Thr Tyr Arg Glu Ala Thr Pro Glu Asn Asp Pro Asn Tyr Phe Met Asn 
385 390 395 400 



40 



Glu Pro Thr Phe Tyr Thr Ser Asp Gly Val Pro Phe Thr Ala Ala Asp 
405 410 415 



45 



Pro Asp Tyr Gin Glu Lys Tyr Gin Glu Leu Leu Glu Arg Glu Asp Phe 
420 425 430 

Phe Pro Asp Tyr Glu Glu Asn Gly Thr Asp Leu Ser Gly Ala Gly Asp 
435 440 445 



50 



Pro Tyr Leu Asp Asp He Asp Asp Glu Met Asp Pro Glu He Glu Glu 
450 455 460 

Ala Tyr Glu Lys Phe Cys Leu Glu Ser Glu Arg Lys Arg Lys Gin Xaa 
465 470 475 480 



55 



60 



(2) INFORMATION FOR SEQ ID NO: 399: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LQIGTH: 423 amino acids 

(B) TYPE: amino acid 
(D) TOPOIiOGY: linear 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 399: 

Met Glu Pro Lys Thr lie Thr Asp Ala Leu Ala Ser Ser lie He Lys 
15 10 15 

10 Ser Val Leu Pro Asn Phe Leu Pro Tyr Asn Val Met Leu Tyr Ser Asp 
20 25 30 



15 



30 



45 



Ala Pro Val Ser Glu Leu Ser Leu Glu Leu Leu Leu Leu Gin Val Val 
35 40 45 

Leu Pro Ala Leu Leu Glu Gin Gly His Thr Arg Gin Tip Leu Lys Gly 
50 55 60 



Leu Val Arg Ala Trp Thr Val Thr Ala Gly Tyr Leu Leu Asp Leu His 
20 65 70 75 80 

Ser Tyr Leu Leu Gly Asp Gin Glu Glu Asn Glu Asn Ser Ala Asn Gin 
85 90 95 

25 Gin Val Asn Asn Asn Gin His Ala Arg Asn Asn Asn Ala He Pro Val 
100 105 110 



Val Gly Glu Gly Leu His Ala Ala His Gin Ala He Leu Gin Gin Gly 
115 120 125 

Gly Pro Val Gly Phe Gin Xaa Tyr Arg Arg Pro Leu Asn Phe Pro Leu 
130 135 140 



Arg He Phe Leu Leu He Val Phe Met Cys He Thr Leu Leu He Ala 
150 155 160 

Ser Leu He Cys Leu Thr Leu Pro Val Phe Ala Gly Arg Trp Leu Met 
165 170 175 

40 Ser Phe Trp Thr Gly Thr Ala Lys He His Glu Leu Tyr Thr Ala Ala 
180 185 190 



Cys Gly Leu Tyr Val Cys Trp Leu Thr He Arg Ala Val Thr Val Met 
195 200 205 

Val Ala Trp Met Pro Gin Gly Arg Arg Val He Phe Gin Lys Val Lys 
210 215 220 



Glu Trp Ser Leu Met He Met Lys Thr Leu He Val Ala Val Leu Leu 
50 225 230 235 240 

Ala Gly Val Val Pro Leu Leu Leu Gly Leu Leu Phe Glu Leu Val He 
245 250 255 

55 Val Ala Pro Leu Arg Val Pro Leu Asp Gin Thr Pro Leu Phe Tyr Pro 

260 265 270 



60 



Trp Gin Asp Trp Ala Leu Gly Val Leu His Ala Lys He He Ala Ala 
275 280 285 
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He Thr Leu Met Gly Pro Gin Trp Trp Leu Lys Thr Val He Glu Gin 
290 295 300 

Val Tyr Ala Asn Gly He Arg Asn He Asp Leu His Tyr He Val Arg 
5 305 310 315 320 

Lys Leu Ala Ala Pro Val He Ser Val Leu Leu Leu Ser Leu Cys Val 
325 330 335 

10 Pro Tyr Val He Ala Ser Gly Val Val Pro Leu Leu Gly Val Thr Ala 
340 345 350 

Glu Met Gin Asn Leu Val His Arg Arg He Tyr Pro Phe Leu Leu Met 
355 360 365 

15 

Val Val Val Leu Met Ala He Leu Ser Phe Gin Val Arg Gin Phe Lys 
370 375 380 

Arg Leu Tyr Glu His He Lys Asn Asp Lys Tyr Leu Val Gly Gin Arg 
20 385 390 395 400 

Leu Val Asn Tyr Glu Arg Lys Ser Gly Lys Gin Gly Ser Ser Pro Pro 
405' 410 415 

25 Pro Pro Gin Ser Ser Gin Glu 
420 



30 (2) INFORMATION FOR SEQ ID NO: 400: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 amino acids 

(B) TVPE: amino acid 

35 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 400: 



40 



Met Leu Arg Leu Asp He He Asn Ser I^u Val Thr Thr Val Phe Met 
15 10 15 

Leu He Val Ser Val Leu Ala Leu He Pro Glu Thr Thr Thr Leu Thr 
20 25 30 



Val Gly Gly Gly Val Phe Ala Leu Val Thr Ala Val Cys Cys Leu Ala 
45 35 40 45 

Asp Gly Ala Leu He Tyr Arg Lys Leu Leu Phe Asn Pro Ser Gly Pro 
50 55 60 

50 Tyr Gin Lys Lys Pro Val His Glu Lys Lys Glu Val Leu Xaa 
65 70 75 



55 (2) INFORMATION FOR SEQ ID NO: 401: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 amino acids 

(B) TYPE: amino acid 
60 (D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 401: 

Met Leu Lys Gin Val Met Phe Val Phe Ser Gly Met Gly Pro Arg Ser 
15 10 15 

5 

His Cys Trp Gly Leu Pro Leu His Val Ala Pro Leu Cys Arg Gly His 
20 25 30 

Gin Ala Asp Ser Ser His Leu Leu Pro Leu Lys His Gin Gly Ala Trp 
10 35 40 45 

Asn Arg Asn Leu Ala Asn Gin Arg His Phe Phe Cys Pro Ser lie Phe 
50 55 60 

15 His Thr Cys Pro Thr Val Leu Phe Phe Xaa 
65 70 



20 (2) INFORMATICaa FOR SEQ ID NO: 402: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 
25 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 402: 

Ala Arg Thr He Leu Val Leu Tyr Leu Ser Leu Gin Arg Leu Glu Asn 
15 10 15 

Leu Ala Tyr His 
20 



30 



35 



(2) INFORMATION FOR SEQ ID NO: 403: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 amino acids 
40 (B) TYPE: amino acid 

{D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 403: 

Met Pro Leu Pro Ser Val Pro He Leu Gly He Phe Ser Phe Leu He 
45 1 5 10 15 

Pro Ser Ser Gin Gly Val Ser Tyr Thr Lys Leu Pro He Ser Ser Pro 
20 25 30 

50 Gin Tyr Ser Pro Phe Val Asn Asp His Phe Ser Phe Leu Asn Pro Phe 
35 40 45 

Pro Val Gin He His Thr Gly Phe Ala Arg Val Gly Ser Tyr Met Gin 
50 55 60 

55 

Met Pro Leu Val His Leu Cys Leu Leu Gin Thr Ser Leu Met Lys Asn 
65 70 75 80 



60 



Ser Gly Val Gin Gin Gly Ser 
85 
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(2) INFORMATION FOR SEQ ID NO: 404: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIFTIC^: SEQ ID NO: 404: 

Met Asn Ala Ala Met Val His lie Asn Arg Ala Leu Lys Leu lie lie 
15 10 15 

IS Arg Leu Phe Leu Val Glu Asp Leu Val Asp Ser Leu Lys Leu Ala Val 
20 25 30 

Phe Met Trp Leu Met Thr Tyr Val Gly Ala Val Phe Asn Gly He Thr 
35 40 45 

20 

Leu Leu He Leu Ala Glu Leu Leu He Phe Ser Val Pro He Val Tyr 
50 55 60 

Glu Lys Tyr Lys Thr Gin He Asp His Tyr Val Gly He Ala Arg Asp 
25 65 70 75 80 

Gin Thr Lys Ser He Val Glu Lys He Pro Ser Lys 
85 90 

30 

(2) INFORMATICS FOR SEQ ID NO: 405: 

(i) SEQUENCE CHARACTERISTICS: . 
35 (A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 405: 

40 Met Ala Cys Ser Cys Leu Met He Gin Ser Phe Ser Thr Ser Ala Leu 
15 10 15 

Val Leu Phe Tyr Gly 
20 

45 



5 



10 



(2) INFORMATION FOR SEQ ID NO: 406: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 406: 

55 

Met Glu Glu Gly Gly Asn Leu Gly Gly Leu He Lys Met Val His Leu 
15 10 15 



Leu Val Leu Ser Gly Ala Trp Gly Met Gin Met Trp Val Thr Phe Val 
60 20 25 30 
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Ser Gly Phe Pro Ala Phe Pro Lys Pro Ser Pro Thr Tyr Leu Arg Thr 
35 40 45 

5 Ser Ala Glu Gin Thr Leu Pro Leu Leu Leu Pro His Leu His Gly Leu 
50 55 60 

Cys Leu His Gin Pro Leu His Leu Gly Phe Thr Ala Cys Leu Gly Ser 
65 70 75 80 

10 

Ala His He Leu Gly Gly Gin Pro Ala Leu Pro Ala Val Pro Glu Pro 
85 90 95 

Tyr Ala Gly His Cys Gin Arg Pro Leu Ala Gly Thr Pro His His Ser . 
15 100 105 110 

Cys His Val Gly Pro Ala Asn Arg Gly Arg Arg Ser Glu Ala Trp Val 
115 120 125 

20 Gly Arg Tyr Gin Ala Ala Asn Arg Phe Pro He Leu Asn Ala Xaa Cys 
130 135 140 

Glu Arg Arg Thr Pro Ser Thr Val Leu Ser Ala Arg He Ser Ser Ala 
145 150 155 160 

25 

Thr Met Gly Cys Pro Leu Phe Ala He Trp Ala Ala Ser Xaa 
165 170 



30 

(2) INFORMATION FOR SEQ ID NO: 407: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 amino acids 
35 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 407: 

Met Ala Phe He Leu Leu Phe Tyr Cys Leu Met Thr Phe Leu Ser Leu 
40 1 5 10 15 

Glu Gin Asn Ser Ala Thr Val Glu Pro Ser Ser His Glu He Leu His 
20 25 30 

45 Leu Leu Gin Asn Cys Phe Glu Leu Leu Arg Thr Ser Thr Ser Gin Cys 
35 40 45 

Thr Glu Gly He Pro Cys Gin Arg Tyr Gin Asn Gly Leu His He Xaa 
50 55 60 

50 



55 

(2) INFORMATION FOR SEQ ID NO: 408: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 280 amino acids 
60 (B) TYPE: amino acid 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 408: 

Met Glu Ala Val Val Asn Leu Tyr Gin Glu Val Met Lys His Ala Asp 
5 1 5 10 15 

Pro Arg He Gin Gly Tyr Pro Leu Met Gly Ser Pro Leu Leu Met Thr 
20 25 30 

10 Ser He Leu Leu Thr Tyr Val Tyr Phe Val Leu Ser Leu Gly Pro Arg 
35 40 45 



15 



30 



45 



He Met Ala Asn Arg Lys Pro Phe Gin Leu Arg Gly Phe Met He Val 
50 55 ^ 60 

Tyr Asn Phe Ser Leu Val Ala Leu Ser Leu Tyr He Val Tyr Glu Phe 
65 70 75 80 



Leu Met Ser Gly Trp Leu Ser Thr Tyr Thr Trp Arg Cys Asp Pro Val 
20 85 90 95 

Asp Tyr Ser Asn Ser Pro Glu Ala Leu Arg Met Val Arg Val Ala Trp 
100 105 110 

25 Leu Phe Leu Phe Ser Lys Phe He Glu Leu Met Asp Thr Val He Phe 
115 120 125 



He Leu Arg Lys Lys Asp Gly Gin Val Thr Phe Leu His Val Phe His 
130 135 140 

His Ser Val Leu Pro Trp Ser Trp Trp Trp Gly Val Lys He Ala Pro 
145 150 155 160 



Gly Gly Met Gly Ser Phe His Ala Met He Asn Ser Ser Val His Val 
35 165 170 175 

He Met Tyr Leu Tyr Tyr Gly Leu Ser Ala Phe Gly Pro Val Ala Gin 
180 185 190 

40 Pro Tyr Leu Trp Trp Lys Lys His Met Thr Ala He Gin Leu He Gin 
195 200 205 



Phe Val Leu Val Ser Leu His He Ser Gin Tyr Tyr Phe Met Ser Ser 
210 215 220 

Cys Asn Tyr Gin Tyr Pro Val He He His Leu He Trp Met Tyr Gly 
225 230 235 240 



Thr He Phe Phe Met Leu Phe Ser Asn Phe Trp Tyr His Ser Tyr Thr 
50 245 250 255 

Lys Gly Lys Arg Leu Pro Arg Ala Leu Gin Gin Asn Gly Ala Pro Gly 
260 265 270 

55 He Ala Lys Val Lys Ala Asn Xaa 
275 280 



60 



(2) INFORMATION FOR SEQ ID NO: 409: 



I 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 284 amino acids 

(B) TYPE: amino acid 
5 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 409: 

Met Xaa Leu Trp Pro Gin Thr Cys Ser Gly Lys Phe Asp Gly Thr Leu 
15 10 15 

10 

Ala Phe Ser lie His Xaa Leu Ala Val He Leu Gly Asp Gin Leu Thr 
20 ^ 25 30 

Ala Ala Asp Leu Val Pro He Phe Asn Gly Phe Leu Lys Asp Leu Asp 
15 35 40 45 

Glu Val Arg He Gly Val Leu Lys His Leu His Asp Phe Leu Lys Leu 
50 55 60 

20 Leu His He Asp Lys Arg Arg Glu Tyr Leu Tyr Gin Leu Gin Glu Phe 
65 70 75 80 

I^eu Val Thr Asp Asn Ser Arg Asn Trp Arg Phe Arg Ala Glu Leu Ala 

85 90 95 

25 

Glu Gin Leu He Leu Leu Leu Glu Leu Tyr Ser Pro Arg Asp Val Tyr 
100 105 110 

Asp Tyr Leu Arg Pro He Ala Leu Asn Leu Cys Ala Asp Lys Val Ser 
30 115 120 125 

Ser Val Arg Trp He Ser Tyr Lys Leu Val Ser Glu Met Val Lys Lys 
130 135 140 

35 Leu His Ala Ala Thr Pro Pro Thr Phe Gly Val Asp Leu He Asn Glu 
145 150 155 160 

Leu Val Glu Asn Phe Gly Arg Cys Pro Lys Trp Ser^Gly Arg Gin Ala 
165 170 175 

40 

Phe Val Phe Val Cys Gin Thr Val He Glu Asp Asp Cys Leu Pro Met 
180 185 190 

Asp Gin Phe Ala Val His Leu Met Pro His Leu Leu Thr Leu Ala Asn 
45 195 200 205 

Asp /org Val Pro Asn Val Arg Val Leu Leu Ala Lys Thr Leu Arg Gin 
210 215 220 

50 Thr Leu Leu Glu Lys Asp Tyr Phe Leu Ala Ser Ala Ser Cys His Gin 

225 230 235 240 

Glu Ala Val Glu Gin Thr He Met Ala Leu Gin Met Asp Arg Asp Ser 
245 250 255 

55 

Asp Val Lys Tyr Phe Ala Ser He His Pro Ala Ser Thr Lys He Ser 
260 265 270 



Glu Asp Ala Met Ser Thr Ala Ser Ser Thr Tyr Xaa 
60 275 280 
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(2) INFORMATIC3N FOR SEQ ID NO: 410: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 187 amino acids 

(B) TYPE: amino acid 
{D) TOPOLOGY: linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 410: 

Met Leu Phe Leu Phe Phe Val He He Phe Leu Phe Val Phe Leu He 
15 10 15 

15 Leu He He Gin Phe Ser Lys Pro Leu Thr Asn Pro His Pro Pro Ala 

20 25 30 

Gly Xaa Ser Asp Arg Arg Arg Arg Tyr Ser Ser Tyr Arg Ser His Asp 
35 40 45 

20 

His Tyr Gin Arg Gin Arg Val Leu Gin Lys Glu Arg Ala He Glu Glu 
50 55 60 

Arg Arg Val Val Phe He Gly Lys He Pro Gly Arg Met Thr Arg Ser 
25 65 70 75 80 

Glu Leu Lys Gin Arg Phe Ser Val Phe Gly Glu He Glu Glu Cys Thr 
85 90 95 

30 He His Phe Arg Val Gin Gly Asp Asn Tyr Gly Phe Val Thr Tyr Arg 
100 105 110 

Tyr Ala Glu Glu Ala Phe Ala Ala He Glu Ser Gly His Lys Leu Arg 
115 120 125 

35 

Gin Ala Asp Glu Gin Pro Phe Asp Leu Cys Phe Gly Gly Arg Arg Xaa 
130 135 140 

Xaa Cys Lys Arg Ser Tyr Ser Asp Leu Asp Ser Asn Arg Glu Asp Phe 
40 145 150 155 160 

Asp Pro Ala Pro Val Lys Ser Lys Phe Asp Ser Leu Asp Phe Asp Thr 
165 170 175 

45 Leu Leu Lys Gin Ala Gin Lys Asn Leu Arg Arg 
180 185 



50 (2) INFORMATION FOR SEQ ID NO: 411: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 237 amino acids 

(B) TYPE: amino acid 
55 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 411: 

Met Lys Leu Pro Gly Lys Phe Arg Arg Ala His Gin Gly Asn Leu Glu 
15 10 15 

60 
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Ser Gin Leu Thr Ser Glu Ser Tyr Tyr Lys Glu Thr Leu Ser Val Pro 

20 25 30 

Thr Val Glu His lie He Gin Glu Leu Lys Asp He Phe Ser Glu Gin 
5 35 40 45 

His Leu Lys Ala Leu Lys Cys Leu Ser Leu Val Pro Ser Val Met Gly 
50 55 60 

10 Gin Leu Lys Phe Asn Thr Ser Glu Glu His His Ala Asp Met Tyr Arg 
65 70 75 80 



15 



Ser Asp Leu Pro Asn Pro Asp Thr Leu Ser Ala Glu Leu His Cys Trp 
85 90 95 . 

Arg He Lys Trp Lys His Arg Gly Lys Asp He Glu Leu Pro Ser Thr 
100 105 110 



He Tyr Glu Ala Leu His Leu Pro Asp He Lys Phe Phe Pro Asn Val 
20 115 120 125 

Tyr Ala Leu Leu Lys Val Leu Cys He Leu Pro Val Met Lys Val Glu 
130 135 140 

25 Asn Glu Arg Tyr Glu Asn Gly Arg Lys Arg Leu Lys Ala Tyr Leu Arg 
145 150 155 160 



30 



Asn Thr Leu Thr Asp Gin Axg Ser Ser Asn Leu Ala Leu Leu Asn He 
165 170 175 

Asn Phe Asp He Lys His Asp Leu Asp Leu Met Val Asp Thr Tyr He 
180 185 190 



35 



Lys Leu Tyr Thr Xaa Xaa Ser Xaa Leu Xaa Thr Xaa Xaa Ser Xaa Xaa 
195 200 205 



40 



Val Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Gly Xaa Xaa Xaa Xaa 
210 215 220 

Asp Xaa Xaa Xaa Arg Glu Lys Ala Val Arg Cys Met Xaa 
225 230 235 



45 (2) INFORMATION FOR SEQ ID NO: 412: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 192 amino acids 

(B) TYPE: amino acid 
50 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 412: 

Met Lys Pro Met Ala Val Val Ala Ser Thr Val Leu Gly Leu Val Gin 
15 10 15 

55 

Asn Met Arg Ala Phe Gly Gly He Leu Val Val Val Tyr Tyr Val Phe 
20 25 30 

Ala He He Gly He Asn Leu Phe Arg Gly Val He Val Ala Leu Pro 
60 35 40 45 
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Gly Asn Ser Ser Leu Ala Pro Ala Asn Gly Ser Ala Pro Cys Gly Ser 
50 55 60 

5 Phe Glu Gin Leu Glu Tyr Trp Ala Asn Asn Phe Asp Asp Phe Ala Ala 
65 70 75 80 



10 



Ala Leu Val Thr Leu Trp Asn Leu Met Val Val Asn Asn Trp Gin Val 
85 90 95 

Phe Leu Asp Ala Tyr Arg Arg Tyr Ser Gly Pro Trp Ser Lys He Tyr 
100 105 110 



Phe Val Leu Trp Trp Leu Val Ser Ser Val He Trp Val Asn Leu Phe 
15 115 120 125 

Leu Ala Leu He Leu Glu Asn Phe Leu His Lys Trp Asp Pro Arg Ser 
130 135 140 

20 His Leu Gin Pro Leu Ala Gly Thr Pro Glu Ala Thr Tyr Gin Met Thr 
145 150 155 ISO 



25 



Val Glu Leu Leu Phe Arg Asp He Leu Glu Glu Pro Gly Glu Asp Glu 
165 170 175 

Leu Thr Glu Arg Leu Ser Gin His Pro His Leu Trp Leu Cys Arg Xaa 
180 185 190 



30 



(2) INFORMATION FOR SEO ID NO: 413- 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEMC5TH: 21 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

40 (xi) SEQUENCE DESCRIPTION: SBQ ID NO: 413: 

Asn Val Val Val Val Ala Phe Gly Leu He Leu He He Glu Ser Leu 
15 10 15 

45 Gly Glu Gin Cys Pro 
20 



50 (2) INFORMATICS FOR SEQ ID NO: 414: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 
55 (D) TOPOLOGY: linecir 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 414: 

Met Asn Trp Gly Leu Ser He Trp Leu His Tyr Tyr Glu Lys Lys Lys 
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Glu Gin Val Phe Leu Val He Leu Ala His Val Val Arg Arg Cys Ala 
20 25 30 

Ser Asp Gly He Leu Gin Phe Glu Ser Ser Leu Leu Lys Met Arg Arg 
5 35 40 45 

Ala Pro Xaa 
50 

10 

(2) INFORMATION FOR SEQ ID NO: 415; 

(i) SEQUENCE CHARACTERISTICS: 
^5 (A) LQJGTH: 32 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 415: 

20 Met Leu He He Ser Leu Arg Pro Gin Phe Pro Ser Leu He Val Gin 
^5 10 15 

Leu Glu Cys Ser Val Leu Phe Leu Pro He Ser Leu Asn Leu Leu Leu 

25 



30 

(2) INFORMATION FOR SEQ ID NO: 416: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 163 amino acids 
35 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 416: 

Met Val Lys Val Cys T^n Asp Ser Asp Arg Trp Ser Leu He Ser Leu 
40 1 5 10 15 

Ser Asn Asn Ser Gly Lys Asn Val Glu Leu Lys Phe Val Asp Ser Leu 
20 25 30 

45 Arg Arg Gin Phe Glu Phe Ser Val Asp Ser Phe Gin He Lys Leu Asp 
35 40 45 

Ser Leu Leu Leu Phe Tyr Glu Cys Ser Glu Asn Pro Met Thr Glu Thr 
50 55 60 

50 

Phe His Pro Thr He He Gly Glu Ser Val Tyr Gly Asp Phe Gin Glu 
65 70 75 80 

Ala Phe Asp His Leu Cys Asn Lys He He Ala Thr Arg Asn Pro Glu 
55 85 90 95 

Glu He Arg Gly Gly Gly Leu Leu Lys Tyr Cys Asn Leu Leu Val Arg 
100 105 110 



60 



Gly Phe Arg Pro Ala Ser Asp Glu He Lys Thr Leu Gin Arg Tyr Met 
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115 120 125 

Cys Ser Arg Phe Phe lie Asp Phe Ser Asp lie Gly Glu Gin Gin Arg 
130 135 140 

5 

Lys Leu Glu Ser Tyr Leu Gin Asn His Phe Val Gly lie Gly Arg Pro 
145 150 155 160 

Gin Val Xaa 

10 



(2) INFORMATION FOR SEQ ID NO; 417: 

15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTICW: SEQ ID NO: 417: 

Met Ala Pro Lys Gly Lys Val Gly Thr Arg Gly Lys Lys Gin lie Phe 
15 10 15 

25 Glu Glu Asn Arg Glu Thr Leu Lys Phe Tyr Leu Arg lie lie Leu Gly 
20 25 30 

Ala Asn Ala lie Tyr Cys Leu Val Thr Leu Val Phe Phe Tyr Ser Ser 
35 40 45 

30 

Ala Ser Phe Trp Ala .Trp Leu Ala Leu Gly Phe Ser Leu Ala Val Tyr 
50 55 60 

Gly Ala Ser Tyr His Ser Met Ser Ser Met Ala Arg Ala Ala Phe Ser 
35 65 70 75 80 

Glu Asp Gly Ala Leu Met Asp Gly Gly Met Asp Leu Asn Met Glu Gin 

85 90 95 

40 Gly Met Ala Glu His Leu Lys Asp Val lie Leu Leu Thr Ala He Val 
100 105 110 

Gin Val Leu Ser Cys Phe Ser Leu Tyr Val Trp Ser Phe Trp Leu Leu 
115 120 125 

45 

Ala Pro Gly Arg Ala Leu Tyr Leu Leu Trp Val Asn Val Leu Gly Pro 
130 135 140 

Trp Phe Thr Ala Asp Ser Gly Thr Pro Ala Pro Glu His Asn Glu Lys 
50 145 150 155 160 

Arg Gin Arg Arg Gin Glu Arg Arg Gin Met Lys Arg Leu Xaa 
165 170 

55 

(2) INFORMATION FOR SEQ ID NO: 418: 



60 



(i) SEQUENCE CHARACTERISTICS: 

(A) LQ^GTH: 50 amino acids 
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<B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTIC»J: SEQ ID NO: 418: 

5 Met Glu Leu Pro Lys Gly Leu Gin Gly Val Gly Pro Val Ala Met Met 
15 10 15 



10 



Arg Pro Phe Tyr Leu Leu Leu Pro Val Leu Cys Thr Gin Ala Leu Arg 
20 25 30 

Gin Ser Gin Gly Lys Ser Pro l^eu Leu Trp Lys Arg Thr Cys Cys Leu 
35 40 45 



Ala Xaa 
15 50 



20 



(2) INFORMATION FCMl SEQ ID NO: 419: 



(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 120 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
25 <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 419: 

Met Leu Gly Lys Gly Gly Gly Arg Ala Gly Leu Leu Arg Tyr T^g Leu 
15 10 15 

30 Leu Tyr Phe Thr Leu Val Val Gly Glu Gly Glu Pro Gly Glu Asn Lys 
20 25 30 

Val Thr lie Pro Phe Phe Glu Thr Gly Lys Lys He He Phe Cys Ser 
35 40 45 

35 

Val Lys Met Val Glu Asn Ser Asn Val Pro Ser His Lys Gly Pro Val 
50 55 60 

Pro Leu Arg Ser Glu Gin Trp Glu Leu Lys He Ser Glu Thr Leu Gly 
40 65 70 75 80 

Glu Gly Lys He Gly Phe Leu Leu He Gly Arg Cys Ser Ser Gly Xaa 
85 90 95 

45 Gly Gly Leu Cys Phe Cys Trp Asp Val Leu Cys Cys Met Tyr Ala Tyr 
100 105 110 

Met Asp Arg Ser Leu Leu Ser Leu 
115 120 

50 



{2) INFORMATION FOR SEQ ID NO: 420: 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 159 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 420: 

60 
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606 

Met Thr His Leu Leu Leu Thr Ala Thr Val Thr Pro Ser Glu Gin Asn 
15 10 15 

Ser Ser Arg Glu Pro Gly Trp Glu Thr Ala Met Ala Lys Asp lie Leu 
5 20 25 30 

Gly Glu Ala Gly Leu His Phe Asp Glu Leu Asn Lys Leu Arg Val Leu 
35 40 45 

10 Asp Pro Glu Val Thr Gin Gin Thr lie Glu Leu Lys Glu Glu Cys Lys 
50 55 60 



Asp Phe Val Asp Lys lie Gly 
65 70 

15 



Lys Ala He Gly Ala Arg Asn 
20 100 



Gin Phe Gin Lys He Val Gly Gly Leu 
75 80 

Asn Glu Lys Met 
95 

Leu Leu Lys Ser He Ala Lys Gin Arg 
105 110 



He Glu Leu Val Asp Gin Leu Ala Lys Glu Ala Glu 
85 90 



Glu Ala Gin Gin Gin Gin Leu Gin Ala Leu He Ala Glu Lys Lys Met 
115 120 125 

25 Gin Leu Glu Arg Tyr Arg Val Glu Tyr Glu Ala Leu Cys Lys Val Glu 
130 135 140 

Ala Glu Gin Asn Glu Phe He Asp Gin Phe He Phe Gin Lys Xaa 
145 150 155 

30 



(2) INFORMATION FOR SEQ ID NO: 421: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC^I: SEQ ID NO: 421: 

40 

Met Asn Val Gly Val Ala His Ser Glu Val Asn Pro Asn Thr Arg Val 
15 10 15 

Met Asn Ser Arg Gly Met Trp Leu Thr Tyr Ala Leu Gly Val Gly Leu 
45 20 25 30 

Leu His He Val Leu Leu Ser He Pro Phe Phe Ser Val Pro Val Ala 
35 40 45 

50 Trp Thr Leu Thr Asn He He His Asn Leu Gly Met Tyr Val Phe Leu 
50 55 60 

His Ala Val Lys Gly Thr Pro Phe Glu Thr Pro Asp Gin Gly Lys Ala 
65 70 75 80 

55 

Arg Leu Leu Thr His Trp Glu Gin Leu Asp Tyr Gly Val Gin Phe Thr 
85 90 95 

Ser Ser Arg Lys Phe Phe Thr He Ser Pro He He Leu Tyr Phe Leu 
60 100 105 110 
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Ala Ser Phe Tyr Thr Lys Tyr Asp Pro Thr His Phe lie Leu Asn Thr 
115 120 125 

5 Ala Ser Lieu Leu Ser Val Leu lie Pro Lys Met Pro Gin Leu His Gly 
130 135 140 

Val Arg lie Phe Gly He Asn Lys Tyr Xaa 
145 150 

10 



(2) INFORMATION FOR SEQ ID NO: 422: 

15 ^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 204 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 422: 

20 

Met Val Cys Gly Gly Phe Ala Cys Ser Lys Asn Cys Leu Cys Ala Leu 
15 10 15 

Asn Leu Leu Tyr Thr Leu Val Ser Leu Leu Leu He Gly He Ala Ala 
25 20 25 30 

Trp Gly He Gly Phe Gly Leu He Ser Ser Leu Arg Val Val Gly Val 
35 40 45 

30 Val He Ala Val Gly He Phe Leu Phe Leu He Ala Leu Val Gly Leu 
50 55 60 

He Gly Ala Val Lys His His Gin Val Leu Leu Phe Phe Tyr Met He 
65 70 75 80 

35 

He Leu Leu Leu Val Phe He Val Gin Phe Ser Val Ser Cys Ala Cys 
85 90 95 

Leu Ala Leu Asn Gin Glu Gin Gin Gly Gin Leu Leu Glu Val Gly Trp 
40 100 105 110 

Asn Asn Thr Ala Ser Ala Arg Asn Asp He Gin Arg Asn Leu Asn Cys 
115 120 125 

45 Cys Gly Phe Arg Ser Val Asn Pro Asn Asp Thr Cys Leu Ala Ser Cys 
130 135 140 

Val Lys Ser Asp His Ser Cys Ser Pro Cys Ala Pro He He Gly Glu 
145 150 155 160 

50 

Tyr Ala Gly Glu Val Leu Arg Phe Val Gly Gly He Gly Leu Phe Phe 
165 170 175 

Ser Phe Thr Glu He Leu Gly Val Trp Leu Thr Tyr Arg Tyr Arg Asn 
55 180 185 190 

Gin Lys Asp Pro Arg Ala Asn Pro Ser Ala Phe Leu 
195 200 

60 
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(2) INFORMATION FOR SEQ ID NO: 423: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 67 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 423: 

10 Met Leu Gin Ser lie lie Lys Asn lie Trp lie Pro Met Lys Pro Tyr 
15 10 15 



15 



25 



40 



50 



Tyr Thr Lys Val Tyr Gin Glu lie Trp lie Gly Met Gly Leu Met Gly 
20 25 30 

Phe lie Val Tyr Lys lie Arg Ala Ala Asp Lys Arg Ser Lys Ala Leu 
35 40 45 



Lys Ala Ser Ala Pro Ala Pro Gly His. His Asn Gin lie Tyr Leu Glu 
20 50 55 60 



Tyr Met Xaa 
65 



(2) INFORMATION FOR SEQ ID NO: 424: 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTIC»«: SEQ ID NO: 424: 

35 Met Leu Gly Val Ser Leu Phe Leu Leu Val Val Leu Tyr His Tyr Val 
15 10 15 



Ala Val Asn Asn Pro Lys Lys Gin Glu 
20 25 



(2) INFORMATION FOR SEQ ID NO: 425: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LE^K5TH: 299 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 425: 



Met Ala Ala Xaa Glu Pro Ala Val Leu Ala Leu Pro Asn Ser Gly Ala 
1 5 10 15 



Gly Gly Ala Gly Ala Pro Ser Gly Thr Val Pro Val Leu Phe Cys Phe 
55 20 25 30 

Ser Val Phe Ala Arg Pro Ser Ser Val Pro His Gly Ala Gly Tyr Glu 
35 40 45 

60 Leu Leu lie Gin Lys Phe Leu Ser Leu Tyr Gly Asp Gin He Asp Met 
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50 



55 



60 



His Arg Lys Phe Val Val Gin Leu Phe Ala Glu Glu Trp Gly Gin Tyr 
65 70 75 80 

Val Asp Leu Pro Lys Gly Phe Ala Val Ser Glu Arg Cys Lys Val Arg 
85 90 95 



10 



Leu Val Pro Leu Gin lie Gin Leu Thr Thr Leu Gly Asn Leu Thr Pro 
100 105 110 



Ser Ser Thr Val Phe Phe Cys Cys Asp Met Gin Glu Arg Phe Arg Pro 
115 120 125 

15 Ala lie Lys Tyr Phe Gly Asp lie lie Ser Val Gly Gin Arg Leu Leu 
130 135 140 



20 



Gin Gly Ala Arg He Leu Gly He Pro Val He Val Thr Glu Gin Tyr 
145 150 155 160 

Pro Lys Gly Leu Gly Ser Thr Val Gin Glu He Asp Leu Thr Gly Val 
165 170 175 



Lys Leu Val Leu Pro Lys Thr Lys Phe Ser Met Val Leu Pro Glu Val 
25 180 185 190 

Glu Ala Ala Leu Ala Glu He Pro Gly Val Arg Ser Val Val Leu Phe 
195 200 205 

30 Gly Val Glu Thr His Val Cys He Gin Gin Thr Ala Leu Glu Leu Val 
210 215 220 



35 



Gly Arg Gly Val Glu Val His He Val Ala Asp Ala Thr Ser Ser Arg 
225 230 235 240 

Ser Met Met Asp Arg Met Phe Ala lieu Glu Arg Leu Ala Xaa Xaa Gly 

245 250 255 



He He Val Thr Thr Ser Glu Ala Val Leu Leu Gin Leu Val Ala Asp 
40 260 265 270 

Lys Asp His Pro Lys Phe Lys Glu He Gin Asn Leu He Lys Ala Ser 
275 280 285 



45 Ala Pro Glu Ser Gly Leu Leu Ser Lys Val Xaa 
290 295 



50 (2) INFORMATION FOR SEQ ID NO: 426: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
55 {D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 426: 

Met Arg Asp Leu Gly Thr Leu Leu Ser Pro Val Cys Ser 
15 10 

60 
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(2) INFORMATION FOR SEQ ID NO: 427: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 198 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 427: 

10 

Met Phe Gly Cys Leu Val Ala Gly Arg Leu Val Gin Thr Ala Ala Gin 
15 10 15 

Gin Val Ala Glu Asp Lys Phe Val Phe Asp Leu Pro Asp Tyr Glu Ser 
15 20 25 30 

He Asn His Val Val Val Phe Met Leu Gly Thr He Pro Phe Pro Glu 
35 40 45 

20 Gly Met Gly Gly Ser Val Tyr Phe Ser Tyr Pro Asp Ser Asn Gly Met 
50 55 60 



25 



Pro Val Trp Gin Leu Leu Gly Phe Val' Thr Asn Gly Lys Pro Ser Ala 
65 70 75 80 

He Phe Lys He Ser Gly Leu Lys Ser Gly Glu Gly Ser Gin His Pro 
85 90 95 



Phe Gly Ala Met Asn He Val Arg Thr Pro Ser Val Ala Gin He Gly 
30 100 105 110 

He Ser Val Glu Leu Leu Asp Ser Met Ala Gin Gin Thr Pro Val Gly 
115 120 125 

35 Asn Ala Ala Val Ser Ser Val Asp Ser Phe Thr Gin Phe Thr Gin Lys 
130 135 140 



40 



Met Leu Asp Asn Phe Tyr Asn Phe Ala Ser Ser Phe Ala Val Ser Gin 
145 150 155 160 

Ala Gin Met Thr Pro Ser Pro Ser Glu Met Phe He Pro Ala Asn Val 
165 170 175 



Val Leu Lys Trp Tyr Glu Asn Phe Gin Arg Arg Leu Ala Gin Asn Pro 
45 180 185 190 



50 



Xaa Phe Trp Xaa Thr Xaa 
195 



<2) INFORMATION FOR SEQ ID NO: 428: 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 428: 



60 Met Gly Leu Pro Leu Met Ala Leu Met Trp Ser Thr Leu Pro Ala Ser 
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15 10 15 

Ala Gly Val Asn Phe lie Leu Ala Leu Pro Leu Leu Leu Leu Trp Lys 
20 25 30 

5 

Asn Arg Gly Gly Val Gly Arg Ser Val Met Ser Ala Val Glu Xaa 
35 40 45 



10 

(2) INFORMATION FOR SEQ ID NO: 429: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 370 amino acids 
15 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 429: 

Met Lys Lys Val Glu Glu Lys Arg Val Asp Val Asn Ser Ala Val Ala 
20 1 5 10 15 

Met Gly Glu Val He Leu Ala Val Cys His Pro Asp Cys He Thr Thr 
20 25 30 

25 He Lys His Trp He Thr He He Arg Ala Arg Phe Glu Glu Val Leu 

35 40 45 

Thr Trp Ala Lys Gin His Gin Gin Arg Leu Glu Thr Ala Leu Ser Glu 
50 55 60 

30 

Leu Val Ala Asn Ala Glu Leu Leu Glu Glu Leu Leu Ala Trp He Gin 
65 70 75 80 

Trp Ala Glu Thr Thr Leu He Gin Arg Asp Gin Glu Pro He Pro Gin 

35 85 90 95 

Asn He Asp Arg Val Lys Ala Leu He Ala Glu His Gin Thr Phe Met 
100 105 110 

40 Glu Glu Met Thr Arg Lys Gin Pro Asp Val Asp Arg Val Thr Lys Thr 
115 120 125 

Tyr Lys Arg Lys Asn He Glu Pro Thr His Ala Pro Phe He Glu Lys 
130 135 140 

45 

Ser Arg Ser Gly Gly Arg Lys Ser Leu Ser Gin Pro Thr Pro Pro Pro 
145 150 155 160 

Met Pro He Leu Ser Gin Ser Glu Ala Lys Asn Pro Arg He Asn Gin 
50 165 170 175 

Leu Ser Ala Arg Trp Gin Gin Val Trp Leu Leu Ala Leu Glu Arg Gin 

180 185 190 

55 Arg Lys Leu Asn Asp Ala Leu Asp Arg Leu Glu Glu Leu Lys Glu Phe 
195 200 205 

Ala Asn Phe Asp Phe Asp Val Trp Arg Lys Lys Tyr Met Arg Trp Met 
210 215 220 

60 
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Asn His Lys Lys Ser Arg Val Met Asp Phe Phe Arg Arg lie Asp Lys 
225 230 235 240 

Asp Gin Asp Gly Lys lie Thr Arg Gin Glu Phe He Asp Gly He Leu 
5 245 250 255 

Ala Ser Lys Phe Pro Thr Thr Lys Leu Glu Met Thr Ala Val Ala Asp 
260 265 270 

10 He Phe Asp Arg Asp Gly Asp Gly Tyr He Asp Tyr Tyr Glu Phe Val 
275 280 285 

Ala Ala Leu His Pro Asn Lys Asp Ala Tyr Arg Pro Thr Thr Asp Ala 
290 295 300 

15 

Asp Lys He Glu Asp Glu Val Thr Arg Gin Val Ala Gin Cys Lys Cys 
305 310 315 320 

Ala Lys Arg Phe Gin Val Glu Gin He Gly Glu Asn Lys Tyr Arg Phe 
20 325 330 335 

Phe Leu Gly Asn Gin Phe Gly Asp Ser Gin Gin Leu Arg Leu Val Arg 
340 345 350 

25 He Leu Arg Asn Arg Asp Gly Ser Arg Trp Trp Arg Met Asp Gly Leu 
355 360 365 

Gly Xaa 
370 

30 



(2) INFORMATION FOR SEQ ID NO: 430: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 430: 

40 

Met Asn Val Lys Thr Phe Ser Xaa Asp His Met His Phe Leu Cys Cys 
15 10 15 

Leu Tyr Leu Arg Tyr Val Thr Phe Val Tyr Leu Asn Leu Phe 
45 20 25 30 



(2) INFORMATION FOR SEQ ID NO: 431: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 431: 

Met Glu Pro His Leu Arg Cys Arg Val Thr Arg Val Arg Gly Ser Leu 
15 10 15 



60 Gly Asn Thr Gly Arg Trp Leu Leu 
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20 



5 (2) INFORMATICS FOR SBQ ID NO: 432: 

(i) SEQUENCE CHARACTERISTICS: 

(A) IflJGTH: 53 amino acids 

(B) TYPE: amino acid 
10 {D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 432: 



15 



25 



40 



55 



Met His Tyr Leu Val Leu Gly Gly Leu Gly Val Phe Leu Phe Phe Ser 
1 5 10 15 . 

Cys Phe Val Phe Leu Phe Phe Xaa Phe Ser Phe Ala Phe Phe Pro Phe 

20 25 30 



Tyr Leu Glu Gly Met Gly Gly Ser Gly Asn Arg Glu Val Gly Gly Gly 
20 35 40 45 



Phe Cys Leu Phe Phe 
50 



(2) INFORMATIC»I FOR SEQ ID NO: 433: 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 176 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 433: 

35 Met Val Ser Lys Ala Leu Leu Arg Leu Val Ser Ala Val Asn Arg Arg 
15 10 15 

Arg Met Lys Leu Leu Leu Gly He Ala Leu Leu Ala Tyr Val Ala Ser 
20 25 30 



Val Trp Gly Asn Phe Val Asn Met Arg Ser He Gin Glu Asn Gly Glu 
35 40 . 45 



Leu Lys He Glu Ser Lys He Glu Glu Met Val Glu Pro Leu Arg Glu 
45 50 55 60 

Lys He Arg Asp Leu Glu Lys Ser Phe Thr Gin Lys Tyr Pro Pro Val 
65 70 75 80 

50 Lys Phe Leu Ser Glu Lys Asp Arg Lys Arg He Leu He Thr Gly Gly 
85 90 95 



Ala Gly Phe Val Gly Ser His Leu Thr Asp Lys Leu Met Met Asp Gly 
100 105 110 

His Glu Val Thr Val Val Asp Asn Phe Phe Thr Gly Arg Lys Arg Asn 
115 120 125 



Val Glu His Trp He Gly His Glu Asn Phe Glu Leu He Asn His Asp 
60 130 135 140 
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Val Trp Ser Pro Ser Thr Ser Arg Leu Thr Arg Tyr Thr lie Trp His 
145 150 155 160 

5 Leu Gin Pro Pro Leu Gin Thr Thr Cys lie lie Leu Ser Arg His Xaa 
165 170 175 



10 



(2) INFORMATION FOR SEQ ID NO: 434: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 434: 

20 

Met Leu Arg Cys Trp Pro Leu Phe Trp Leu Pro Leu Val Ser Pro Phe 
15 10 15 

Cys Ser Leu Phe Trp Leu Leu Val Glu Trp Phe Gly Thr Asn He Asp 
25 20 25 30 

Arg Glu Ser Tyr Asp Ala He Gly Gly Pro Ser Trp Met Thr Ala Ser 
35 40 45 

30 Ser Phe Cys Leu Ser Asn Ser Asn He Trp Ser Leu Glu He Ser Ser 
50 55 60 

Gly Ser Thr Ser Val Val His Ser Gin Gin Ala Met Asp 
65 70 75 

35 



(2) INFORMATION FOR SEQ ID NO: 435: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 435: 

45 

Met Arg Ser Cys Glu He Gin Leu Cys Val Trp Leu Leu Val Ser Ser 
15 10 15 

His Val Asp Met Val Leu Gly Gly Ser Pro Ser Thr Leu Tyr Met Met 
50 20 25 30 



55 

(2) INFORMATION FOR SEQ ID NO: 436: 

(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 30 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 436: 

5 Met Val Val Asn Ser Leu Cys Phe Leu Ser Leu Leu Leu Val lie Leu 
1 5 10 15 

Glu Leu Ser Thr Asp Ser Ser Ala Arg Leu Leu Tyr His Glu 
20 25 30 

10 



(2) INFORMATION FOR SEQ ID NO: 437: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 
{D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 437: 



20 



35 



Met Asp Lys Gin Lys His Leu Glu Val Arg Arg Ser Val Phe Lys He 
15 10 15 



Gin Gly Lys He Ala Phe Ser Leu Met Phe Val Leu Lys Asp Leu Ser 
25 20 25 30 

Pro Thr He Phe Ser His Ser He Leu Leu Leu Leu Pro His His Val 
35 40 45 

30 Leu Pro Cys Thr Pro Gin Met Val Arg Gly Val Thr Gin Val Leu Arg 
50 55 60 



Glu Phe Gly Asp Gin 
65 



(2) INFORMATION FOR SEQ ID NO: 438: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 438: 

45 

Met Pro Leu Cys Phe Phe Ser Phe Leu Cys Cys Trp Val Leu Val Phe 
15 10 15 

Lys Leu He 

50 



55 



(2) INFORMATION FOR SEQ ID NO: 439: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 439: 
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Met Lys Phe Ser Leu Val Leu Leu lie Lys lie lie Ser Phe Glu Arg 
15 10 15 

5 Leu Leu He Phe Leu Phe Pro Leu Ser Phe Leu Pro Asn He Trp Arg 
20 25 30 

Arg Val Met Val Asn Leu Asn He Leu Phe Xaa 
35 40 

10 



(2) INFORMATION FOR SEQ ID NO: 440: 

15 (i) SBC3UENCE CHARACTERISTICS: 

(A) LEI^EGTH: 33 eunino acids 

(B) TV^PE: amino acid 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC»^: SEQ ID NO: 440: 



20 



30 



45 



Met Leu Leu Phe Pro Ser Leu Leu Phe Ala Ala Thr Tyr Asn Val Ala 
15 10 15 



Asn Pro Ser Arg Leu He Leu Tyr Met He Ser Ala Gly Ala Asp Ser 
25 20 25 30 



Gin 



(2) INFORMATIC»I FOR SEQ ID NO: 441: 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 53 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 441: 

40 Met Trp Gin Val Arg Gly Leu Pro Pro Val Pro Leu Leu Leu Thr Met 
15 10 15 

Ser Pro Pro Pro Cys Leu Ser Ser Pro Phe Pro Phe He Ser Val Pro 
20 25 30 



Leu Phe Glu Ala Val Pro He Ser Val Ser Asp Gin Pro Ser Pro Xaa 
35 40 45 



Leu Thr Thr Leu Leu 

50 50 



(2) INFORMATION FOR SEQ ID NO: 442: 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 amino acids 

(B) TYPE: amino acid 
(D) T0P0I/X5Y: linear 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 442: 
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Met He Thr Ser Val Leu Val Phe Leu He Phe Phe Phe Pro Tyr Leu 
15 10 15 

5 Ser Leu Val Thr Leu Leu Gin Ala Arg Asn Leu Trp Val He His Arg 

20 25 30 



10 



Ala Ala Leu Cys Glu Ser Gly Leu Phe His Trp Arg Lys Gly He Glu 
35 40 45 

Asn Gin Leu Glu Pro Met Tyr Phe Leu Pro His Gly Thr Leu Phe Leu 
50 55 60 



15 



(2) INFOHMATIC»« FOR SEQ ID NO; 443: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 443: 

Met Leu Tyr Ser Cys Glu Pro Tyr Leu He He Leu Asn He Tyr Ser 
1 5 10 15 

30 Gin Lys Ala Phe Tyr Phe Tyr Phe Phe Glu Gly Ser Phe Ser Val Cys 
20 25 30 

Thr Leu 

35 



20 



25 



(2) INFORMATION FOR SEQ ID NO: 444: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 89 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 444: 



45 



60 



Met Arg Gin Arg Gin Ala Ala Cys Gin Pro Pro Pro Ser Arg Asn Gly 
15 10 15 



Leu Ala Gin Glu Cys Pro Pro His He Pro Ser Ser Phe Phe Leu Val 
50 20 25 30 

Lys Leu Leu Phe He Pro Trp Leu Ala Ser Leu Leu Ser Ser Pro Leu 

35 40 45 

55 Asn Leu Leu Leu Leu Val Ser He Ser Trp Asp Leu Gly Leu Lys Leu 
50 55 60 



Asn Leu Gin Gin Cys Arg Gin His Gin Val Leu Gin Glu Lys Asn Thr 
65 70 75 80 
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Lys Lys Phe Asn Lys Lys hys Lys Lys 
85 



(2) INFORMATION FOR SEQ ID NO: 445: 

(i) SEQUE3^CE CHARACTERISTICS: 

(A) LENGTH: 350 amino acids 
10 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 445: 

Met Asp Phe lie Thr Ser Thr Ala lie Leu Pro Leu Leu Phe Gly Cys 
15 1 5 10 15 

Leu Gly Val Phe Gly Leu Phe Arg Leu Leu Gin Trp Val Arg Gly Lys 

20 25 30 

20 Ala Tyr Leu Arg Asn Ala Val Val Val lie Thr Gly Ala Thr Ser Gly 
35 40 45 



25 



Leu Gly Lys Glu Cys Ala Lys Val Phe Tyr Ala Ala Gly Ala Lys Leu 
50 55 60 

Val Leu Cys Gly Arg Asn Gly Gly Ala Leu Glu Glu Leu He Arg Glu 
65 70 75 80 



Leu Thr Ala Ser His Ala Thr Lys Val Gin Thr His Lys Pro Tyr Leu 
30 85 90 95 

Val Thr Phe Asp Leu Thr Asp Ser Gly Ala He Val Ala Ala Ala Ala 
100 105 110 

35 Glu He Leu Gin Cys Phe Gly Tyr Val Asp He Leu Val Asn Asn Ala 
ns 120 125 



40 



Gly He Ser Tyr Arg Gly Thr He Met Asp Thr Thr Val Asp Val Asp 
130 135 140 

Lys Arg Val Met Glu Thr Asn Tyr Phe Gly Pro Val Ala Leu Thr Lys 
145 150 155 160 



Ala Leu Leu Pro Ser Met He Lys Arg Arg Gin Gly His He Val Ala 
45 165 170 175 

He Ser Ser He Gin Gly Lys Met Ser He Pro Phe Arg Ser Ala Tyr 
ISO 185 190 

50 Ala Ala Ser Lys His Ala Thr Gin Ala Phe Phe Asp Cys Leu P^rg Ala 
195 200 205 

Glu Met Glu Gin Tyr Glu He Glu Val Thr Val He Ser Pro Gly Tyr 
210 215 220 

55 

He His Thr Asn Leu Ser Val Asn Ala He Thr Ala Asp Gly Ser Arg 
225 230 235 240 



60 



Tyr Gly Val Met Asp Thr Thr Thr Ala Gin Gly Arg Ser Pro Val Glu 
245 250 255 



wo 98/39448 



619 



PCT/US98/04493 



Val Ala Gin Asp Val Leu Ala Ala Val Gly Lys Lys Lys Lys Asp Val 
260 265 270 

5 lie Leu Ala Asp Leu Leu Pro Ser Leu Ala Val Tyr Leu Arg Thr Leu 

275 280 285 



10 



20 



Ala Pro Gly Leu Phe Phe Ser Leu Met Pro Pro Gly Pro Glu Lys Ser 
290 295 300 

Gly Asn Pro Arg Thr Pro Ser Thr Leu Thr Ser Gin Gly Gin Gly Arg 
305 310 315 320 



Glu Ala Ala Leu Leu Gly Leu Leu Thr Leu Gin Gly Thr Val Ala Phe 
15 325 330 335 



Val Glu Thr Leu Met Glu lie Cys Leu Thr Ser Gly Lys Asp 
340 345 350 



(2) INFORMATION FOR SEQ ID NO: 446: 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 49 amino acicJs 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 446: 

30 Met Val Phe Leu Pro Arg Gly Val Val Val Ser Gly Gly Ala Ala Cys 
15 10 15 

Leu Trp Leu Thr Phe lie Leu Glu Thr Glu Val Tyr Leu Asp Leu Ala 
20 25 30 

35 

Thr Glu Ala Arg Ala His Ser Arg Met Gly Leu Gly Leu Trp Pro Pro 
35 40 45 

Asn 

40 



45 



60 



(2) INFORMATION FOR SEQ ID NO: 447: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 278 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 447: 

Met Ala Ser Ala Glu Leu Asp Tyr Thr lie Glu lie Pro Asp Gin Pro 
15 10 15 

55 Cys Trp Ser Gin Lys Asn Ser Pro Ser Pro Gly Gly Lys Glu Ala Glu 
20 25 30 



Thr Arg Gin Pro Val Val He Leu Leu Gly Trp Gly Gly Cys Lys Asp 
35 40 45 i 
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Lys Asn Leu Ala Lys Tyr Ser Ala lie Tyr His Lys Arg Gly Cys lie 
50 55 60 

Val He Arg Tyr Thr Ala Pro Trp His Met Val Phe Phe Ser Glu Ser 
5 65 70 75 80 

Leu Gly He Pro Ser Leu Arg Val Leu Ala Gin Lys Leu Leu Glu Leu 
85 90 95 

10 Leu Phe Asp Tyr Glu He Glu Lys Glu Pro Leu Leu Phe His Val Phe 
100 105 110 



15 



Ser Asn Gly Gly Val Met Leu Tyr Arg Tyr Val Leu Glu Leu Leu Gin 
115 120 125 

Thr Arg Arg Phe Cys Arg Leu Arg Val Val Gly Thr He Phe Asp Ser 
130 135 140 



Ala Pro Gly Asp Ser Asn Leu Val Gly Ala Leu Arg Ala Leu Ala Ala 
20 145 150 155 160 

He Leu Glu Arg Arg Ala Ala Met Leu Arg Leu Leu Leu Leu Val Ala 
165 170 175 

25 Phe Ala Leu Val Val Val Leu Phe His Val Leu Leu Ala Pro He Thr 
180 185 190 



30 



Ala Xaa Phe His Thr His Phe Tyr Asp Arg Leu Gin Asp Ala Gly Ser 

195 200 205 

Arg Trp Pro Glu Leu Tyr Leu Tyr Ser Arg Ala Asp Glu Val Val Leu 
210 215 220 



Ala Arg Asp He Glu Arg Met Val Glu Ala Arg Leu Ala Arg Arg Val 
35 225 230 235 240 

Leu Ala Arg Ser Val Asp Phe Val Ser Ser Ala His Val Ser His Leu 
245 250 255 

40 Arg Asp Tyr Pro Thr Tyr Tyr Thr Ser Leu Cys Val Asp Phe Met Arg 
260 265 270 



45 



Asn Cys Val Arg Cys Xaa 
275 



(2) INFORMATION FOR SEQ ID NO: 448: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 199 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 448: 

55 

Met Ser Phe He Phe Asp Trp He Tyr Ser Gly Phe Ser Ser Val Leu 
15 10 15 

Gin Phe Leu Gly Leu Tyr Lys Lys Thr Gly Lys Leu Val Phe Leu Gly 
60 20 25 30 
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Leu Asp Asn Ala Gly Lys Thr Thr Leu Leu His Met Leu Lys Asp Asp 
35 40 45 

5 Arg Leu Gly Gin His Val Pro Thr Leu His Pro Thr Ser Glu Glu Leu 
50 55 60 

Thr lie Ala Gly Met Thr Phe Thr Thr Phe Asp Leu Gly Gly His Val 
65 70 75 80 

10 

Gin Ala Arg Arg Val Trp Lys Asn Tyr Leu Pro Ala He Asn Gly He 

85 90 95 

Val Phe Leu Val Asp Cys Ala Asp His Glu Arg Leu Leu Glu Ser Lys 
15 100 105 110 

Glu Glu Leu Asp Ser Leu Met Thr Asp Glu Thr He Ala Asn Val Pro 
115 120 125 

20 He Leu He Leu Gly Asn Lys He Asp Arg Pro Glu Ala He Ser Glu 
130 135 140 

Glu Arg Leu Arg Glu Met Phe Gly Leu Tyr Gly Gin Thr Thr Gly Lys 
145 150 155 160 

25 

Gly Ser He Ser Leu Lys Glu Leu Asn Ala Arg Pro Leu Glu Val Phe 
165 170 175 

Met Cys Ser Val Leu Lys Arg Gin Gly Tyr Gly Glu Gly Phe Arg Trp 
30 180 185 190 

Met Ala Gin Tyr He Asp Xaa 
195 

35 

(2) INFORMATION FOR SEQ ID NO: 449: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 258 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 449: 

45 Met Thr Leu Ser Arg Phe Ala Tyr Asn Gly Lys Arg Cys Pro Ser Ser 
15 10 15 

Tyr Asn He Leu Asp Asn Ser Lys He He Ser Glu Glu Cys Arg Lys 
20 25 30 

50 

Glu Leu Thr Ala Leu Leu His His Tyr Tyr Pro He Glu He Asp Pro 
35 40 45 

His Arg Thr Val Lys Glu Lys Leu Pro His Met Val Glu Trp Trp Thr 
55 50 55 60 

Lys Ala His Asn Leu Leu Cys Gin Gin Lys He Gin Lys Phe Gin He 
65 70 75 80 



60 Ala Gin Val Val Arg Glu Ser Asn Ala Met Leu Arg Glu Gly Tyr Lys 
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85 



90 



95 



Thr Phe Phe Asn Thr Leu Tyr His Asn Asn He Pro Leu Phe lie Phe 

100 105 110 

5 

Ser Ala Gly He Gly Asp He Leu Glu Glu He He Arg Gin Met Lys 
H5 120 125 

Val Phe His Pro Asn He His He Val Ser Asn Tyr Met Asp Phe Asn 
10 130 135 140 

Glu Asp Gly Phe Leu Gin Gly Phe Lys Gly Gin Leu He His Thr Tyr 
145 150 155 160 

15 Asn Lys Asn Ser Ser Val Cys Glu Asn Xaa Gly Tyr Phe Gin Gin Leu 
165 170 175 



20 



Glu Gly Lys Thr Asn Val He Leu Leu Gly Asp Ser He Gly Asp Leu 
180 185 190 

Thr Met Ala Asp Gly Val Pro Gly Val Gin Asn He Leu Lys He Gly 
195 200 205 



Phe Leu Asn Asp Lys Val Glu Glu Arg Arg Xaa Arg Tyr Met Asp Ser 
25 210 215 220 

Tyr Asp He Val Leu Glu Lys Asp Glu Thr Leu hsp Val Val Asn Gly 
225 230 235 240 

30 Leu Leu Gin His He Leu Cys Gin Gly Val Gin Leu Glu Met Gin Gly 
245 250 255 

Pro Xaa 



35 



(2) INFORMATION FOR SEQ ID NO: 450: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 450: 



45 



Met Ser His Val Leu Leu Cys Pro Ser Leu Ser Cys Ser Asn Leu Leu 
15 10 15 



Pro Pro Ser His Ser Leu Gly Thr Met Gly Ser Leu Ser Pro His Leu 
50 20 25 30 

Cys Gly His Thr Met Cys Pro Val Asn Pro Glu Leu Pro Leu Ser Ser 
35 40 45 

55 Arg Leu Thr Thr Asp Gin Pro Gin Pro Asp Ala Cys Ser Pro Thr Leu 
50 55 60 

Leu Thr Leu Pro Leu Pro Ser Ser Phe Leu Pro His Ser Lys Pro Thr 
65 70 75 80 

60 
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Phe Xaa His Pro Cys Ser Pro 
85 



5 

(2) INPORMATIOJ FOR SEQ ID NO: 451: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 315 amino acids 
10 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 451: 

Met Phe Ser He Asn Pro Leu Glu Asn Leu Lys Val Tyr He Ser Ser 
15 1 5 10 15 

Arg Pro Pro Leu Val Val Phe Met He Ser Val Xaa Pro Met Ala He 
20 25 30 

20 Ala Phe Leu Thr Leu Gly Tyr Phe Phe Lys He Lys Glu He Lys Ser 

35 40 45 



25 



Pro Glu Met Ala Glu Asp Trp Asn Thr Phe Leu Leu Arg Phe Asn Asp 
50 55 60 

Leu Asp Leu Cys Val Ser Glu Asn Glu Thr Leu Lys His Leu Thr Asn 
65 70 75 80 



Asp Thr Thr Thr Pro Glu Ser Thr Met Thr Ser Gly Gin Ala Arg Ala 
30 85 90 95 

Ser Thr Gin Ser Pro Gin Ala Leu Glu Asp Ser Gly Pro Val Asn He 
100 105 110 

35 Ser Val Ser He Thr Leu Thr Leu Asp Pro Leu Lys Pro Phe Gly Gly 
115 120 125 



40 



Tyr Ser Arg Asn Val Thr His Leu Tyr Ser Thr He Leu Gly His Gin 
130 135 140 

He Gly Leu Ser Gly Arg Glu Ala His Glu Glu He Asn He Thr Phe 
145 150 155 160 



Thr Leu Pro Thr Ala Trp Ser Ser Asp Asp Cys Ala Leu His Gly His 
45 165 170 175 

Cys Glu Gin Val Val Phe Thr Ala Cys Met Thr Leu Thr Ala Ser Pro 
180 185 190 

50 Gly Val Phe Pro Val Thr Val Gin Pro Pro His Cys Val Pro Asp Thr 
195 200 205 

Tyr Ser Asn Ala Thr Leu Trp Tyr Lys He Phe Thr Thr Ala Arg Asp 
210 215 220 

55 

Ala Asn Thr Lys Tyr Ala Gin Asp Tyr Asn Pro Phe Trp Cys Tyr Lys 
225 230 235 240 

Gly Ala He Gly Lys Val Tyr His Ala Leu Asn Pro Lys Leu Thr Val 
60 245 250 255 
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lie Val Pro Asp Asp Asp Arg Ser Leu lie Asn Leu His Leu Met His 

260 265 270 

5 Thr Ser Tyr Phe Leu Phe Val Met Val lie Thr Met Phe Cys Tyr Ala 
275 280 285 

Val lie Lys Gly Arg Pro Ser Lys Leu Arg Gin Ser Asn Pro Glu Phe 
290 295 300 

10 

Cys Pro Glu Lys Val Ala Leu Ala Glu Ala Xaa 
305 310 315 



15 

(2) INFORMATION FOR SEQ ID NO: 452: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 
20 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTIC^T: SEQ ID NO: 452: 

Met Pro Gly Leu Ser Leu Ala Leu Leu Pro Phe Gly Pro Gly Cys Thr 
25 1 5 10 15 

Glu Ala Leu His Ala Gly Cys Phe Pro Ala Phe Ala Ser Ala Thr Arg 
20 25 30 

30 Val Asn Gly Glu Ala Ala Leu Ser Pro Gly Leu Cys Asp Pro lie Ser 
35 40 45 

Val Pro Tyr Val 
50 

35 



(2) INFORMATION FOR SEQ ID NO: 453: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 383 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 453: 



45 



Met Ala Val Gly Gin He Met Thr Phe Gly Ser Pro Val He Gly Cys 
15 10 15 



Gly Phe He Ser Gly Trp Asn Leu Val Ser Met Cys Val Glu Tyr Val 
50 20 25 30 

Leu Leu Trp Lys Val Tyr Gin Lys Thr Pro Ala Leu Ala Val Lys Ala 
35 40 45 

55 Gly Leu Lys Glu Glu Glu Thr Glu Leu Lys Gin Leu Asn Leu His Lys 

50 55 60 



60 



Asp Thr Glu Pro Lys Pro Leu Glu Gly Thr His Leu Met Gly Val Lys 
65 70 75 80 
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Asp Ser Asn He His Glu Leu Glu His Glu Gin Glu Pro Thr Cys Ala 
85 90 95 

Ser Gin Met Ala Glu Pro Phe Arg Thr Phe Arg Asp Gly Trp Val Ser 
100 105 110 



10 



Tyr Tyr Asn Gin Pro Val Phe Leu Ala Gly Met Gly Leu Ala Phe Leu 
115 120 125 

Tyr Met Thr Val Leu Gly Phe Asp Cys He Thr Thr Gly Tyr Ala Tyr 
130 135 140 



15 



Thr Gin Gly Leu Ser Gly Phe His Pro Gin Tyr Phe Asp Gly Ser He 
145 150 155 160 

Ser Tyr Asn Trp Asn Asn Gly Asn Cys Ser Phe Tyr Leu Ala Thr Ser 
165 170 175 



Lys Met Trp Phe Gly Ser Ala Gly He Ser Gly Leu Ala Gin Leu 
20 180 185 190 



25 



Ser Cys Leu He Leu Cys Val He Ser Val Phe Met Pro Gly Ser Pro 
195 200 205 

Leu Asp Leu Ser Val Ser Pro Phe Glu Asp He Arg Ser Arg Phe He 
210 215 220 



30 



Gin Gly Glu Ser He Thr Pro Thr Lys He Pro Glu He Thr Thr Glu 
225 230 235 240 

He Tyr Met Ser Asn Gly Ser Asn Ser Ala Asn He Val Pro Glu Thr 
245 250 255 



Ser Pro Glu Ser Val Pro He He Ser Val Ser Leu Leu Phe Ala Gly 
35 260 265 270 



40 



Val He Ala Ala Arg He Gly Leu Trp Ser Phe Asp Leu Thr Val Thr 
275 280 285 

Gin Leu Leu Gin Glu Asn Val He Glu Ser Glu Arg Gly He He Asn 
290 295 300 



45 



Gly Val Gin Asn Ser Met Asn Tyr Leu Leu Asp Leu Leu His Phe He 
305 310 315 320 

Met Val He Leu Ala Pro Asn Pro Glu Ala Phe Gly Leu Leu Val Leu 
325 330 335 



He Ser Val Ser Phe Val Ala Met Gly His He Met Tyr Phe Arg Phe 
50 340 345 350 



55 



Ala Gin Asn Thr Leu Gly Asn Lys Leu Phe Ala Cys Gly Pro Asp Ala 

355 360 365 

Lys Glu Val Arg Lys Glu Asn Gin Ala Asn Thr Ser Val Val Xaa 
370 375 380 



60 (2) INFORMATION FOR SEQ ID NO: 454: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 186 amino acids 

(B) TYPE: amino acid 
5 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 454: 



10 



Met Arg Ser lie Gly Asn Lys Asn Thr lie Leu Leu Gly Leu Gly Phe 
15 10 15 

Gin He Leu Gin Leu Ala Trp Tyr Gly Phe Gly Ser Glu Pro Trp Met 
20 25 30 



Met Trp Ala Ala Gly Ala Val Ala Ala Met Ser Ser He Thr Phe Pro 
15 35 40 45 

Ala Val Ser Ala Leu Val Ser Arg Thr Ala Asp Ala Asp Gin Gin Gly 
50 55 60 

20 Val Val Gin Gly Met He Thr Gly He Arg Gly Leu Cys Asn Gly Leu 
65 70 75 80 



25 



Gly Pro Ala Leu Tyr Gly Phe He Phe Tyr He Phe His Val Glu Leu 
85 90 95 

Lys Glu Leu Pro He Thr Gly Thr Asp Leu Gly Thr Asn Thr Ser Pro 

100 105 110 



Gin His His Phe Glu Gin Asn Ser He He Pro Gly Pro Pro Phe Leu 
30 115 120 125 

Phe Gly Ala Cys Ser Val Leu Leu Ala Leu Leu Val Ala Leu Phe He 
130 135 140 

35 Pro Glu His Thr Asn Leu Ser Leu Arg Ser Ser Ser Trp Arg Lys His 
145 150 155 160 



40 



Cys Gly Ser His Ser His Pro His Asn Thr Gin Ala Pro Gly Glu Ala 
165 170 175 

Lys Glu Pro heu Leu Gin Asp Thr Asn Val 
180 185 



45 

(2) INFORMATION FOR SEQ ID NO: 455: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 163 amino acids 
50 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 455: 

Met Leu Gin Thr Ser Asn Tyr Ser Leu Val Leu Ser Leu Gin Phe Leu 
55 1 5 10 15 

Leu Leu Ser Tyr Asp Leu Phe Val Asn Ser Phe Ser Glu Leu Leu Gin 

20 25 30 



60 Lys Thr Pro Val He Gin Leu Val Leu Phe He He Gin Asp He Ala 
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35 40 45 

Val Leu Phe Asn lie lie He He Phe Leu Met Phe Phe Asn Thr Phe 

50 55 60 

5 

Val Phe Gin Ala Gly Leu Val Asn Leu Leu Phe His Lys Phe Lys Gly 
65 70 75 80 

Thr He He Leu Thr Ala Val Tyr Phe Ala Leu Ser He Ser Leu His 
10 85 90 95 

Val Trp Val Met Asn Leu Arg Trp Lys Asn Ser Asn Ser Phe He Trp 
100 105 110 

15 Thr Asp Gly Leu Gin Met Leu Phe Val Phe Gin Arg Leu Ala Ala Val 
115 120 125 

Leu Tyr Cys Tyr Phe Tyr Lys Arg Thr Ala Val Arg Leu Gly Asp Pro 
130 135 140 

20 

His Phe Tyr Gin Asp Ser Leu Trp Leu Arg Lys Glu Phe Met Gin Val 
145 150 155 160 

Arg Arg Xaa 

25 



30 



45 



(2) INFORMATION FOR SEQ ID NO: 456: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 456: 

Met Arg He Gin Val Phe He Leu Leu Leu Gly Ala Gly Gly Thr Ser 
15 10 15 

40 Gin Phe Thr Lys Pro Pro Ser Leu Pro Leu Glu Pro Glu Pro Ala Val 
20 25 30 



Glu Ser Ser Pro Thr Glu Thr Ser Glu Gin He Arg Glu Lys 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 457: 

50 ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 457: 

55 

Met Ser Tyr Leu Ala Phe Leu Tyr Met Thr Phe Asp Phe Cys Cys Leu 
15 10 15 



Tyr Phe Ser Thr Val Tyr Ala Pro Ser Phe Lys Tyr He Cys Val His 
60 20 25 30 
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Thr Asp Thr His He Cys Val Cys Val Cys He Tyr Leu Ser Ser Val 

35 40 45 

5 Val Ser Lys Ser Ser Ala Glu Ala Asp Gly Val Leu Gin Pro Arg Arg 

50 55 60 

His Pro Ala Ser Leu Leu He Val Phe Ala Thr Ser He Ser Glu Ser 

65 70 75 BO 



10 



Ser Leu Leu He Phe Ser Phe Gin Lys Thr Glu Ala Lys Leu He Val 
85 90 95 



Phe Ala Val Ser Leu Ala Ala Lys Xaa 
15 100 105 



20 



(2) INFORMATION FOR SEQ ID NO: 458: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 458: 

Met Leu Pro Pro Phe Ser Leu Val Tyr Thr His Phe Leu Val Ala Ser 
15 10 15 

30 Leu Leu Pro Val He Leu Ala Val Phe Pro Asp Ser Ala Gin He Val 
20 25 30 



35 



Pro Leu Leu Lys Pro He Pro Arg Pro Gin Pro Glu Val He Phe Pro 

35 40 45 

Ser Ser Glu Leu Leu Glu Gin Leu Leu Ser Val Gin Phe Val Trp Gin 
50 55 60 



Ala His Thr Val Ala Xaa 
40 65 70 



45 



(2) INFORMATION FOR SEQ ID NO: 459: 



(i) SEQUENCE CHAEU^CTERISTICS : 

(A) LENGTH: 155 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

50 (xi) SEQUENCE DESCRIPTIC»J: SEQ ID NO: 459: 

Met Ala Leu Leu Leu Ser Val Leu Arg Val Leu Leu Gly Gly Phe Phe 
15 10 15 

55 Ala Leu Val Gly Leu Ala Lys Leu Ser Glu Glu He Ser Ala Pro Val 

20 25 30 

Ser Glu Arg Met Asn Ala Leu Phe Val Gin Phe Ala Glu Val Phe Pro 
35 40 45 

60 
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Leu Lys Val Phe Gly Tyr Gin Pro Asp Pro Leu Asn Tyr Gin lie Ala 
50 55 60 

Val Gly Phe Leu Glu Leu Leu Ala Gly Leu Leu Leu Val Met Gly Pro 
5 65 70 75 80 

Pro Met Leu Gin Glu lie Ser Asn Leu Phe Leu lie Leu Leu Met Met 

85 90 95 

10 Gly Ala lie Phe Thr Leu Ala Ala Leu Lys Glu Ser Leu Ser Thr Cys 
100 105 110 



15 



He Pro Ala He Val Cys Leu Gly Phe Leu Leu Leu Leu Asn Val Gly 
115 120- 125 

Gin Leu Leu Ala Gin Thr Lys Lys Val Val Arg Pro Thr Arg Lys Lys 

130 135 140 



Thr Leu Ser Thr Phe Lys Glu Ser Trp Lys Xaa 
20 145 150 155 



(2) INFORMATION FOR SEQ ID NO: 460: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 332 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 460: 

Met Lys Leu Gly Arg Ala Val Leu Gly Leu Leu Leu Leu Ala Pro Ser 
15 10 15 

35 Val Val Gin Ala Val Glu Pro He Ser Leu Gly Leu Ala Leu Ala Gly 
20 25 30 

Val Leu Thr Gly Tyr He Tyr Pro Arg Leu Tyr Cys Leu Phe Ala Glu 
35 40 45 

40 

Cys Cys Gly Gin Lys Arg Ser Leu Ser Arg Glu Ala Leu Gin Lys Asp 
50 55 60 

Leu 'Asp Asp Asn Leu Phe Gly Gin His Leu Ala Lys Lys He He Leu 
45 65 70 75 80 

Asn Ala Val Phe Gly Phe He Asn Asn Pro Lys Pro Lys Lys Pro Leu 

85 90 95 

50 Thr Leu Ser Leu His Gly Trp Thr Gly Thr Gly Lys Asn Phe Val Ser 
100 105 110 

Lys He He Ala Glu Asn He Tyr Glu Gly Gly Leu Asn Ser Asp Tyr 

55 ^'^^ 

Val His Leu Phe Val Ala Thr Leu His Phe Pro His Ala Ser Asn He 
130 135 140 



60 



Thr Leu Tyr Lys Asp Gin Leu Gin Leu Trp He Arg Gly Asn Val Ser 
145 150 155 160 
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Ala Cys Ala Arg Ser He Phe He Phe Asp Glu Met Asp Lys Met His 
165 170 175 

5 Ala Gly Leu He Asp Ala He Lys Pro Phe Leu Asp Tyr Tyr Asp Leu 
180 185 190 

Val Asp Gly Val Ser Tyr Gin Lys Ala Met Phe He Phe Leu Ser Asn 
195 200 205 

10 

Ala Gly Ala Glu Arg He Thr Asp Val Ala Leu Asp Phe Trp Arg Ser 
210 215 220 

Gly Lys Gin Arg Glu Asp He Lys Leu Lys Asp He Glu His Ala Leu 
15 225 230 235 240 

Ser Val Ser Val Phe Asn Asn Lys Asn Ser Gly Phe Trp His Ser Ser 

245 250 255 

20 Leu He Asp Arg Asn Leu He Asp Tyr Phe Val Pro Phe Leu Pro Leu 
260 265 270 

Glu Tyr Lys His Leu Lys Met Cys He Arg Val Glu Met Gin Ser Arg 
275 280 285 

25 

Gly Tyr Glu He Asp Glu Asp He Val Ser Arg Val Ala Glu Glu Met 

290 295 300 

Thr Phe Phe Pro Lys Glu Glu Arg Val Phe Ser Asp Lys Gly Cys Lys 
30 305 310 315 320 

Thr Val Phe Thr Lys Leu Asp Tyr Tyr Tyr Asp Asp 
325 330 

35 

(2) INFORMATION FOR SEQ ID NO: 461: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 461: 

45 Met Leu Lys Cys He 
1 5 



50 (2) INFORMATIC»J FOR SEQ ID NO: 462: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
55 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 462: 



60 



Met He Leu Thr Leu Leu Ser Val Val Ser Thr Met Ala Ser 
15 10 
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(2) INFORMATION FOR SEQ ID NO: 463: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28S amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 463: 

10 

Met Lys Leu His Pro Pro Pro Pro Ser Pro Val Thr Gin Asp His Arg 
15 10 15 

Ser Lys Ser Ser His Ser Asn Trp Met Pro Arg Met Gly Ala Cys Ser 
15 20 25 30 

Met Ser Arg Thr Ser Ser Ser Gly Pro Pro Ser Leu Cys Lys Ser Thr 
35 40 45 

20 Ser Gly Arg Ser Cys Thr Arg Pro His Cys Trp Pro Ser Leu Pro Ala 
50 55 60 

Trp Val Ser Val Phe Thr Arg Thr Asn Thr Gly Ser Trp Cys Tyr Pro 
65 70 75 80 

25 

Ala Trp Gly Gly Ala Phe Ser Arg Pro Trp Met Ser Ala Gin Ser Met 
85 90 95 

Cys Cys Ala Glu Arg Ser Val Leu Gin Val Ala Cys Arg Leu Leu Asp 
30 100 105 110 

Ala Leu Glu Phe Leu His Glu Asn Glu Tyr Val His Gly Asn Val Thr 

115 120 125 

35 Ala Glu Asn lie Phe Val Asp Pro Glu Asp Gin Ser Gin Val Thr Leu 
130 135 140 



40 



Ala Gly Tyr Gly Phe Ala Phe Arg Tyr Cys Pro Ser Gly Lys His Val 
145 150 155 160 

Ala Tyr Val Glu Gly Ser Arg Ser Pro His Glu Gly Asp Leu Glu Phe 
165 170 175 



lie Ser Met Asp Leu His Lys Gly Cys Gly Pro Ser Arg Arg Xaa Asp 
45 180 185 190 

Leu Gin Ser Leu Gly Tyr Cys Met Leu Lys Trp Leu Tyr Gly Phe Leu 
195 200 205 

50 Pro Trp Thr Asn Cys Leu Pro Xaa Xaa Glu Asp lie Met Lys Gin Lys 
210 215 220 



55 



Gin Lys Phe Val Asp Lys Pro Gly Pro Phe Val Gly Pro Cys Gly His 
225 230 235 240 

Trp lie Arg Pro Ser Glu Thr Leu Gin Lys Tyr Leu Lys Val Val Met 
245 250 255 



Ala Leu Thr Tyr Glu Glu Lys Pro Pro Tyr Ala Met Leu Arg Asn Asn 
60 260 265 270 
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Leu Glu Ala Leu Leu Gin Asp Leu Arg Val Ser Pro Tyr 
275 280 285 

5 

(2) INFX>RMATIC»I FOR SEQ ID NO: 464: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 80 amino acids 

(B) TYPE: amino acid 

<D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTIC»J: SEQ ID NO: 464: 

15 Met Thr Ser Pro Pro Pro His Gin Gly Trp Glu Gin Arg Gly Cys Gly 
15 10 15 

Glu Ser Gin Val Pro Leu Ala Leu Ser Arg Val Phe Ser Thr Ser His 
20 25 30 

20 

Tyr Cys Leu Leu Leu Val Ala Asn Gin Ser lie Phe Phe Pro Cys Leu 
35 40 45 

Trp Ala Val Glu Arg Leu Leu Gly Val Arg Cys Thr Cys Pro Leu Ser 
25 50 55 60 

Trp Gly Lys Arg lie lie Ser Glu His Cys Ser Ala Gin Ser Ser Xaa 
65 70 75 80 

30 



35 (2) INFORMATIC»J FOR SEQ ID NO: 465: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 
40 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 465: 

Met His Thr Trp Tyr Asn Asp Arg Arg Gin Asn Cys His Cys Leu Leu 
15 10 15 

45 

Phe Phe Leu lie Tyr Leu Arg Lys lie Tyr Gin Val Val Pro His Val 
20 25 30 

Pro Leu Leu Val Lys Cys Arg Gly Arg Leu Lys Gly Val Asn lie 
50 35 40 45 



(2) INFORMATION FOR SEQ ID NO: 466: 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 466: 
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Met Glu Leu Val Leu Val Phe Leu Cys Ser Leu Leu Ala Pro Met Val 
15 10 15 

5 Leu Ala Ser Ala Ala Glu Lys Glu Lys Glu Met Asp Pro Phe His Tyr 
20 25 30 



10 



Asp Tyr Gin Thr Leu Arg He Gly Gly Leu Val Phe Ala Val Val Leu 
35 40 45 

Phe Ser Val Gly He Leu Leu He Leu Ser Arg Arg Cys Lys Cys Ser 
50 55 60 



Phe Asn Gin Lys Pro Arg Ala Pro Gly Asp Glu Glu Ala Gin Val Glu 
15 65 70 75 80 

Asn Leu He Thr Ala Asn Ala Thr Glu Pro Gin Lys Ala Glu Asn Xaa 
85 90 95 



20 



25 (2) INFORMATK^ FOR SEQ ID NO: 467: 

(i) SEQUENCE CHARACTERISTICS: 

(A) L^GTH: 399 amino acids 

(B) TYPE: amino acid 
30 (D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 467: 

Met Ala Ser Gly Ala Asp Ser Lys Gly Asp Asp Leu Ser Thr Ala He 
15 10 15 

35 

Leu Lys Gin Lys Asn Arg Pro Asn Arg Leu He Val Asp Glu Ala He 
20 25 30 

Asn Glu Asp Asn Ser Val Val Ser Leu Ser Gin Pro Lys Met Asp Glu 
40 35 40 45 

Leu Gin I>eu Phe Arg Gly Asp Thr Val Leu Leu Lys Gly Lys Lys Arg 
50 55 60 

45 Arg Glu Ala Val Cys He Val Leu Ser Asp Asp Thr Cys Ser Asp Glu 
65 70 75 80 

Lys He Arg Met Asn Arg Val Val Arg Asn Asn Leu Arg Val Arg Leu 
85 90 95 

50 

Gly Asp Val He Ser He Gin Pro Cys Pro Asp Val Lys Tyr Gly Lys 
100 105 110 

Arg He His Val Leu Pro He Asp Asp Thr Val Glu Gly He Thr Gly 
55 115 120 125 

Asn Leu Phe Glu Val Tyr Leu Lys Pro Tyr Phe Leu Glu Ala Tyr Arg 
130 135 140 

60 Pro He Arg Lys Gly Asp He Phe Leu Val Arg Gly Gly Met Arg Ala 
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145 150 155 160 

Val Glu Phe Lys Val Val Glu Thr Asp Pro Ser Pro Tyr Cys lie Val 
165 170 175 

5 

Ala Pro Asp Thr Val lie His Cys Glu Gly Glu Pro lie Lys Arg Glu 
180 185 190 

Asp Glu Glu Glu Ser Leu Asn Glu Val Gly Tyr Asp Asp He Gly Gly 
10 195 200 205 

Cys Arg Lys Gin Leu Ala Gin He Lys Glu Met Val Glu Leu Pro Leu 
210 215 220 

15 Arg His Pro Ala Leu Phe Lys Ala He Gly Val Lys Pro Pro Arg Gly 
225 230 235 240 



20 



He Leu Leu Tyr Gly Pro Pro Gly Thr Gly Lys Thr Leu He Ala Arg 
245 250 255 

Ala Val Ala Asn Glu Thr Gly Ala Phe Phe Phe Leu He Asn Gly Pro 
260 265 270 



Glu He Met Ser Lys Leu Ala Gly Glu Ser Glu Ser Asn Leu Arg Lys 
25 275 280 285 

Ala Phe Glu Glu Ala Glu Lys Asn Ala Pro Ala He He Phe He Asp 
290 295 300 

30 Glu Leu Asp Ala He Ala Pro Lys Arg Glu Lys Thr His Gly Glu Val 
305 310 315 320 

Glu Arg Arg He Val Ser Gin Leu Leu Thr Leu Met Asp Gly Leu Lys 
325 330 335 

35 

Gin Arg Ala His Val He Val Met Ala Ala Thr Asn Arg Pro Asn Ser 
340 345 350 

He Asp Pro Ala Leu Arg Arg Phe Gly Arg Phe Asp Arg Glu Val Asp 
40 355 360 365 

He Gly He Pro Asp Ala Thr Gly Arg Leu Glu He Leu Gin He His 
370 375 380 

45 Thr Lys Asn Met Lys Leu Ala Asp Asp Val Asp Leu Glu Gin Xaa 
385 390 395 



50 (2) INFORMATION FOR SEQ ID NO: 468: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 amino acids 
<B) TYPE: amino acid 
55 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 468: 



60 



Leu 
1 
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(2) INFORMATION FOR SEQ ID NO: 469: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC»J: SEQ ID NO: 469: 

10 

Met Ala Ala Pro Lys Gly Ser Leu Trp Val Arg Thr Gin Leu Gly Leu 
15 10 15 

Pro Pro Leu Leu Leu Leu Thr Met Ala Leu Ala Gly Gly Ser Gly Thr 
15 20 25 30 

Ala Ser Ala Glu Ala Phe Asp Ser Val Leu Gly Asp Thr Ala Ser Cys 
35 40 45 

20 His Arg Ala Cys Gin Leu Thr Tyr Pro Leu His Thr Tyr Pro Lys Glu 
50 55 60 

Glu Glu Leu Tyr Ala Cys Gin Arg Gly Cys Arg Leu Phe Ser lie Cys 
65 70 75 80 

25 

Gin Phe Val Asp Asp Gly lie Asp Leu Asn Arg Thr Lys Leu Glu Cys 
85 90 95 

Glu Ser Ala Cys Thr Glu Ala Tyr Ser Gin Ser Asp Glu Gin Tyr Ala 
30 100 105 110 

Cys His Leu Gly Cys Gin Asn Gin Leu Pro Phe Ala Glu Leu Arg Gin 

115 120 125 

35 Glu Gin Leu Met Ser Leu Met E>ro Lys Met His Leu Leu Phe Pro Leu 
130 135 140 

Thr Leu Val Arg Ser Phe Trp Ser Asp Met Met Asp Ser Ala Gin Ser 
145 150 155 160 

40 

Phe He Thr Ser Ser Trp Thr Phe Tyr Leu Gin Ala Asp Asp Gly Lys 

165 170 175 

He Val He Phe Xaa Ser Lys Pro Arg Asn Pro Arg Tyr Ala Pro His 
45 180 185 190 

Leu Glu Pro Gly Ala Leu Pro Asn Leu Xaa Xaa Xaa Ser Leu Ser Lys 
195 200 205 

50 Met Ser Xaa Xaa Ser Xaa Met Arg Asn Ser Gin Ala His Arg Asn Phe 

210 215 220 

Leu Glu Asp Gly Glu Ser Asp Gly Phe Leu Arg Cys Leu Ser Leu Asn 
225 230 235 240 

55 

Ser Gly Trp He Leu Thr Thr Thr Leu Val Leu Ser Val Met Val Leu 
245 250 255 



Leu Trp He Cys Cys Ala Thr Cys Cys Tyr Thr Leu Leu Asp Ala Val 
60 260 265 270 
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Xaa 



5 

(2) INFORMATION FOR SEQ ID NO: 470: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 192 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 470: 

15 Met Met Val Leu Ser Leu Gly He He Leu Ala Ser Ala Ser Phe Ser 

1 5 * 10 15 

Pro Asn Phe Thr Gin Val Thr Ser Thr Leu Leu Asn Ser Ala Tyr Pro 
20 25 30 

20 

Phe He Gly Pro Phe Phe Phe He He Ser Gly Ser Leu Ser He Ala 
35 40 45 

Thr Glu Lys Arg Leu Thr Lys Leu Leu Val His Ser Ser Leu Val Gly 
25 50 55 60 

Ser He Leu Ser Ala Leu Ser Ala Leu Val Gly Phe He He Leu Ser 
65 70 75 80 

30 Val Lys Gin Ala Thr Leu Asn Pro Ala Ser Leu Gin Cys Glu Leu Asp 
85 90 95 

Lys Asn Asn He Pro Thr Arg Ser Tyr Val Ser Tyr Phe Tyr His Asp 
100 105 110 

35 

Ser Leu Tyr Thr Thr Asp Cys Tyr Thr Ala Lys Ala Ser Leu Ala Gly 
115 120 125 

Xaa Leu Ser Leu Met Leu He Cys Thr Leu Leu Glu Phe Cys Leu Ala 
40 130 135 140 

Val Leu Thr Ala Val Leu Arg Trp Lys Gin Ala Tyr Ser Asp Phe Pro 
145 150 155 160 

45 Gly Ser Val Leu Phe Leu Pro His Ser Tyr He Gly Asn Ser Gly Met 
165 170 175 

Ser Ser Lys Met Thr His Asp Cys Gly Tyr Glu Glu Leu Leu Thr Ser 
180 185 190 

50 



55 

(2) INFORMATION FOR SEQ ID NO: 471: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 234 amino acids 
60 (B) TYPE: amino acid 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 471: 

Met Arg Lys Thr Arg Leu Trp Gly Leu Leu Trp Met Leu Phe Val Ser 
5 1 5 10 15 

Glu Leu Arg Ala Ala Thr Lys Leu Thr Glu Glu Lys Tyr Glu Leu Lys 
20 25 30 

10 Glu Gly Gin Thr Leu Asp Val Lys Cys Asp Tyr Thr Leu Glu Lys Phe 
35 40 45 



15 



30 



35 



45 



50 



55 



Ala Ser Ser Gin Lys Ala Trp Gin He He Arg Asp Gly Glu Met Pro 
50 55 60 

Lys Thr Leu Ala Cys Thr Glu Arg Pro Ser Lys Asn Ser His Pro Val 
65 70 75 80 



Gin Val Gly Arg He He Leu Glu Asp Tyr His Asp His Gly Leu Leu 
20 85 90 95 

Arg Val Arg Met Val Asn Leu Gin Val Glu Asp Ser Gly Leu Tyr Gin 
100 105 110 

25 Cys Val He Tyr Gin Pro Pro Lys Glu Pro His Met Leu Phe Asp Arg 
115 120 125 



He Arg Leu Val Val Thr Lys Gly Phe Ser Gly Thr Pro Gly Ser Asn 
130 135 140 

Glu Asn Ser Thr Gin Asn Val Tyr Lys He Pro Pro Thr Thr Thr Lys 
145 150 155 160 

Ala Leu Cys Pro Leu Tyr Thr Ser Pro Arg Thr Val Thr Gin Ala Pro 
165 170 175 

Pro Lys Ser Thr Ala Asp Val Ser Thr Pro Asp Ser Glu He Asn Leu 
180 185 190 

40 Thr Asn Val Thr Asp He He Arg Val Pro Val Phe Asn He Val He 
195 200 205 



Leu Leu Ala Gly Gly Phe Leu Ser Lys Ser Leu Val Phe Ser Val Leu 
210 215 220 

Phe Ala Val Thr Leu Arg Ser Phe Val Pro 
225 230 



(2) INFORMATION FOR SEQ ID NO: 472: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 472: 

Met Leu His He Leu Pro Leu Lys Ser Tyr Asp Phe Pro His Phe Ser 
60 1 5 10 15 
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Leu Met Gly Arg Tyr Arg Cys Ala Ser Leu Leu Phe Cys Phe Leu Leu 
20 25 30 

5 Leu Phe Phe Phe Phe Cys Ser Val Leu Trp Thr Phe Ser Asp Met His 

35 40 45 



10 



20 



Arg Ser Gly Glu Asp Gly Pro Trp Thr Pro Cys Val His His Leu Ala 
50 55 60 

Ala Ser Leu lie Ser Tyr Gly Gin Pro Gly Phe He Cys He Ser Leu 
65 70 75 80 



Phe Ser Pro Val Leu Phe lie Glu Asn Pro Arg His Tyr Ala Asn Ala 
15 85 90 95 



Thr Val Thr Thr Leu Gly Asp Trp Xaa 
100 105 



(2) INFORMATIC»I FOR SEQ ID NO: 473: 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 473: 

30 Met Val Phe Leu Lys Tyr Arg Phe Leu Phe Phe Leu Val Phe Leu Ala 
15 10 15 

Asn Cys He Tyr Ser Leu His Tyr Lys Pro Ser Leu Met Tyr Pro Lys 
20 25 30 

35 



45 



40 

(2) INFORMATION FOR SEQ ID NO: 474: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 571 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 474: 

Met Ala Leu Ser Arg Gly Leu Pro Arg Glu I^eu Ala Glu Ala Val Ala 
50 1 5 10 15 

Gly Gly Arg Val Leu Val Val Gly Ala Gly Gly He Gly Cys Glu Leu 

20 25 30 



55 



60 



Leu Lys Asn Leu Val Leu Thr Gly Phe Ser His He Asp Leu He Asp 
35 40 45 

Leu Asp Thr He Asp Val Ser Asn Leu Asn Arg Gin Phe Leu Phe Gin 
50 55 60 
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10 



Lys Lys His Val Gly Arg Ser Lys Ala Gin Val Ala Lys Glu Ser Val 
70 75 80 

Leu Gin Phe Tyr Pro Lys Ala Asn lie Val Ala Tyr His Asp Ser He 
85 90 95 

Met Asn Pro Asp Tyr Asn Val Glu Phe Phe Arg Gin Phe He Leu Val 

105 110 

Met Asn Ala Leu Asp Asn Arg Ala Ala Arg Asn His Val Asn Arg Met 

120 125 



15 



20 



25 



Cys Leu Ala Ala Asp Val Pro Leu He Glu Ser Gly Thr Ala Gly Tyr 

135 140 

Leu Gly Gin Val Thr Thr He Lys Lys Gly Val Thr Glu Cys IVr Glu 
150 155 160 

Cys His Pro Lys Pro Thr Gin Arg Thr Phe Pro Gly Cys Thr He Arg 
165 170 175 

Asn Thr Pro Ser Glu Pro He His Cys He Val Trp Ala Lys Tyr Leu 
180 185 190 

Phe Asn Gin Leu Phe Gly Glu Glu Asp Ala Asp Gin Glu Val Ser Pro 
195 200 205 



30 



Asp Arg Ala Asp Pro Glu Ala Ala Trp Glu Pro Thr Glu Ala Glu Ala 
210 215 220 

Arg Ala Arg Ala Ser Asn Glu Asp Gly Asp He Lys Arg He Ser Thr 
225 230 



235 



240 



35 



Lys Glu Trp Ala Lys Ser Thr Gly Tyr Asp Pro Val Lys Leu Phe Thr 
245 250 255 



40 



Lys Leu Phe Lys Asp Asp He Arg Tyr Leu Leu Thr Met Asp Lys Leu 
260 265 270 

Trp Arg Lys Arg Lys Pro Pro Val Pro Leu Asp Trp Ala Glu Val Gin 
275 280 285 



45 



50 



Ser Gin Gly Glu Glu Thr Asn Ala Ser Asp Gin Gin Asn Glu Pro Gin 
290 295 300 

Leu Gly Leu Lys Asp Gin Gin Val Leu Asp Val Lys Ser Tyr Ala Arg 
310 315 320 

Leu Phe Ser Lys Ser He Glu Thr Leu Arg Val His Leu Ala Glu Lys 
325 330 335 



55 



Gly Asp Gly Ala Glu Leu He Trp Asp Lys Asp Asp Pro Ser Ala Met 

340 345 350 

Asp Phe Val Thr Ser Ala Ala Asn Leu Arg Met His He Phe Ser Met 
355 360 365 



60 



Asn Met Lys Ser Arg Phe Asp He Lys Ser Met Ala Gly Asn He He 
370 375 380 
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Pro Ala He Ala Thr Thr Asn Ala Val He Ala Gly Leu He Val Leu 
385 390 395 400 

Glu Gly Leu Lys He Leu Ser Gly Lys He Asp Gin Cys Arg Thr He 
5 405 410 415 

Phe Leu Asn Lys Gin Pro Asn Pro Arg Lys Lys Leu Leu Val Pro Cys 
420 425 430 

10 Ala Leu Asp Pro Pro Asn Pro Asn Cys Tyr Val Cys Ala Ser Lys Pro 
435 440 445 



15 



Glu Val Thr Val Arg Leu Asn Val His Lys Val Thr Val Leu Thr Leu 
450 455 460 

Gin Asp Lys He Val Lys Glu Lys Phe Ala Met Val Ala Pro Asp Val 
465 470 475 480 



Gin He Glu Asp Gly Lys Gly Thr He Leu He Ser Ser Glu Glu Gly 
20 485 490 



495 



Glu Thr Glu Ala Asn Asn His Lys Lys Leu Ser Glu Phe Gly He Arg 
500 505 510 

25 Asn Gly Ser Arg Leu Gin Ala Asp Asp Phe Leu Gin Asp Tyr Thr Leu 
515 520 525 



30 



Leu He Asn He Leu His Ser Glu Asp Leu Gly Lys Asp Val Glu Phe 
530 535 540 

Glu Val Val Gly Asp Ala Pro Glu Lys Val Gly Xaa Lys Gin Ala -Glu 
545 550 555 560 



Asp Ala Ala Lys Ser He Thr Asn Gly Gin Xaa 
35 565 570 



40 



55 



(2) INFORMATION FOR SEQ ID NO: 475: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 312 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

^5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 475: 

Met Gin Val Val Thr Cys Leu Thr Arg Asp Ser Tyr Leu Thr His Cys 
15 10 15 

50 Phe Leu Gin His Leu Met Val Val Leu Ser Ser Leu Glu Arg Thr Pro 
20 25 30 



Ser Pro Glu Pro Val Asp Lys Asp Phe Tyr Ser Glu Phe Gly Asn Lys 
35 40 45 

Thr Thr Gly Lys Met Glu Asn Tyr Glu Leu He His Ser Ser Arg Val 
50 55 60 



Lys Phe Thr Tyr Pro Ser Glu Glu Glu He Gly Asp Leu Thr Phe Thr 
60 65 70 75 80 



wo 98/39448 



PCTAJS98/04493 



641 



10 



Val Ala Gin Lys Met Ala Glu Pro Glu Lys Ala Pro Ala Leu Ser lie 
85 90 95 

Leu Leu Tyr Val Gin Ala Phe Gin Val Gly Met Pro Pro Pro Gly Cys 
100 105 110 

Cys Arg Gly Pro Leu Arg Pro Lys Thr Leu Leu Leu Thr Ser Ser Glu 
lis 120 125 

He Phe Leu Leu Asp Glu Asp Cys Val His Tyr Pro Leu Pro Glu Phe 
130 135 140 



Ala Lys Glu Pro Pro Gin Arg Asp Arg Tyr Arg Leu Asp Asp Gly Arg 
15 145 150 155 160 



20 



Arg Val Arg Asp Leu Asp Arg Val Leu Met Gly Tyr Gin Thr Tyr Pro 
165 170 175 

Gin Pro Ser Pro Ser Ser Ser Met Thr Cys Lys Val Met Thr Ser Trp 
180 185 190 



25 



Ala Val Ser Pro Trp Thr Thr Leu Gly Arg Cys Gin Val Ala Arg Leu 

195 200 205 

Glu Pro Ala Arg Ala Val Lys Ser Ser Gly Arg Cys Leu Ser Pro Val 
210 215 220 



Leu Arg Ala Glu Arg Ser Ser Ser Arg Cys Trp Leu Ala Ser Gly Arg 
30 225 230 235 240 



35 



Pro Cys Val Ala Val Ser Cys Leu Ser Ser Ser Pro Ala Ser Pro Gly 

245 250 255 

His Ser Gin Pro Val Val Ser Ser Leu Thr Pro Thr Gly Ala Gly Gin 
260 265 270 



40 



Gin Ala Phe Val Phe Ser Lys Asn Val Leu Ser Ser Leu Trp Tyr Leu 
275 280 285 

Asn Leu Thr Val Leu Ala Glu Asn Val Asn Met Cys Val Cys Cys Val 
290 295 300 



Asn Ser Phe Ser Cys Trp Glu Xaa 
45 305 310 



<2) INFORMATION FOR SEQ ID NO: 476: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 329 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 476: 

Met Ala Gin His His Leu Trp lie Leu Leu Leu Cys Leu Gin Thr Trp 
15 10 15 

60 Pro Glu Ala Ala Gly Lys Asp Ser Glu He Phe Thr Val Asn Gly He 



50 



55 



wo 98/39448 



642 



PCTAJS98/04493 



20 



25 



30 



Leu Gly Glu Ser Val Thr Phe Pro Val Asn He Gin Glu Pro Arg Gin 
35 40 45 

5 

Val Lys He He Ala Trp Thr Ser Lys Thr Ser Val Ala Tyr Val Thr 
50 55 60 

Pro Gly Asp Ser Glu Thr Ala Pro Val Val Thr Val Thr His Arg Asn 
10 65 70 75 80 

Tyr Tyr Glu Arg He His Ala Leu Gly Pro Asn Tyr Asn Leu Val He 
85 90 95 



15 



Ser Asp Leu Arg Met Glu Asp Ala Gly Asp Tyr Lys Ala Asp He Asn 
100 105 110 



20 



Thr Gin Ala Asp Pro Tyr Thr Thr Thr Lys Arg Tyr Asn Leu Gin He 
115 120 125 

Tyr Arg Arg Leu Gly Lys Pro Lys He Thr Gin Ser Leu Met Ala Ser 
130 135 140 



Val Asn Ser Thr Cys Asn Val Thr Leu Thr Cys Ser Val Glu Lys Glu 
25 145 150 155 160 



30 



Glu Lys Asn Val Thr Tyr Asn Trp Ser Pro Leu Gly Glu Glu Gly Asn 
165 170 175 

Val Leu Gin He Phe Gin Thr Pro Glu Asp Gin Glu Leu Thr Tyr Thr 
180 185 190 



35 



Cys Thr Ala Gin Asn Pro Val Ser Asn Asn Ser Asp Ser He Ser Ala 
195 200 205 

Arg Gin Leu Cys Ala Asp He Ala Met Gly Phe Arg Thr His His Thr 
210 215 220 



Gly Leu Leu Ser Val Leu Ala Met Phe Phe Leu Leu Val Leu He Leu 
40 225 230 235 240 



45 



Ser Ser Val Phe Leu Phe Arg Leu Phe Lys Arg Arg Gin Asp Ala Ala 
245 250 255 

Ser Lys Lys Thr He Tyr Thr Tyr He Met Ala Ser Arg Asn Thr Gin 
260 265 270 



50 



Pro Ala Glu Ser Arg He Tyr Asp Glu He Leu Gin Ser Lys Val l.eu 

275 280 285 

Pro Ser Lys Glu Glu Pro Val Asn Thr Val Tyr Ser Glu Val Gin Phe 
290 295 300 



Ala Asp Lys Met Gly Lys Ala Ser Thr Gin Asp Ser Lys Pro Pro Gly 
55 305 310 315 320 

Thr Ser Ser Tyr Glu He Val He Xaa 
325 



60 
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(2) INFORMATION FOR SEQ ID NO: 477: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 178 amino acids 

(B) TYPE: amino acid 

<D) TOPOLOGy: linear 
(xi) SEQUENCE DESCRIPTIC2N : SEQ ID NO: 477: 

10 Met Lys Leu Gin Cys Val Ser Leu Trp Leu Leu Gly Thr lie Leu He 

15 10 15 



15 



Leu Cys Ser Val Asp Asn His Gly Leu Arg Arg Cys Leu He Ser Thr 
20 25 30 

Asp Met His His He Glu Glu Ser Phe Gin Glu He Lys Arg Ala He 
35 40 45 



Gin Ala Lys Asp Thr Phe Pro Asn Val Thr He Leu Ser Thr Leu Glu 
20 50 55 60 

Thr Leu Gin He He Lys Pro Leu Asp Val Cys Cys Val Thr Lys Asn 
65 70 75 80 

25 Leu Leu Ala Phe Tyr Val Asp Arg Val Phe Lys Asp His Gin Glu Pro 
85 90 95 



30 



Asn Pro Lys He Leu Arg Lys He Ser Ser He Ala Asn Ser Phe Leu 
100 105 110 

Tyr Met Gin Lys Thr Leu Arg Gin Cys Gin Glu Gin Arg Gin Cys His 
US 120 125 



Cys Arg Gin Glu Ala Thr Asn Ala Thr Arg Val He His Asp Asn Tyr 
35 130 135 140 

Asp Gin Leu Glu Val His Ala Ala Ala He Lys Ser Leu Gly Glu Leu 
145 150 155 160 

40 Asp Val Phe Leu Ala Trp He Asn Lys Asn His Glu Val Met Ser Ser 
165 170 175 

Ala Xaa 

45 



(2) INFORMATION FOR SEQ ID NO: 478: 

50 <i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 
{B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 478: 



55 



Asp Thr Ala He Arg Val Ala Leu Ala Val Ala Val Leu Lys Thr Val 
15 10 15 



He Leu Gly Leu Leu Cys Leu Leu Leu Cys Gly Gly Gly Glu Gly Lys 
60 20 25 30 
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Val Ala Gly Arg Gin Ala Val Thr Ser Asp Gin Gin Ser Val Gly Arg 
35 40 45 

5 Arg Asp Val Tyr 
50 



10 (2) INFORMATION FOR SEQ ID NO: 479: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOIOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 479: 



20 



30 



Met Gin Lys Lys Asn Ser Leu Phe Phe Phe Phe Ala Phe Tyr Tyr Glu 
15 10 IS 

Asn Lys Thr Asn Ala Pro Gly Glu Gly Ser Met lie Thr Arg Asn lie 
20 25 30 



Lys Glu Tyr Phe Leu Pro Phe Leu Phe Cys Cys Val Glu Ala Ser lie 
25 35 40 45 



Ala lie Asn Lys Leu Asn Tyr Leu His Trp Thr His Phe Gin 
50 55 60 



<2) INFORMATION FOR SEQ ID NO: 480: 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 480: 

40 Met Pro Gly Leu Ser Leu lie Leu Thr Val Thr Leu Leu Ala Val Ser 
15 10 15 



45 



Asp Ser Ala Ala Thr Cys lie Val Ala Lys Gly 
20 25 



(2) INFORMATION FOR SEQ ID NO: 481: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 339 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 481: 

55 

Met Ser Gly Pro Asp Val Glu Thr Pro Ser Ala He Gin He Cys Arg 
15 10 15 



He Met Arg Pro Asp Asp Ala Asn Val Ala Gly Asn Val His Gly Gly 
60 20 25 30 
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Thr lie Leu Lys Met lie Glu Glu Ala Gly Ala lie lie Ser Thr Arg 
35 40 45 

His Cys Asn Ser Gin Asn Gly Glu Arg Cys Val Ala Ala Leu Ala Arg 
50 55 60 

Val Glu Arg Thr Asp Phe Leu Ser Pro Met Cys He Gly Glu Val Ala 
65 70 75 80 

His Val Ser Ala Glu He Thr Tyr Thr Ser Lys His Ser Val Glu Val 
85 90 95 



Gin Val Asn Val Met Ser Glu Asn He Leu Thr Gly Ala Lys Lys Leu 
15 100 105 110 



20 



Thr Asn Lys Ala Thr Leu Trp Tyr Val Pro Leu Ser Leu Lys Asn Val 
115 120 125 

Asp Lys Val Leu Glu Val Pro Pro Val Val Tyr Ser Arg Xaa Glu Gin 
130 135 140 



25 



Glu Glu Glu Gly Arg Lys Arg Tyr Glu Ala Gin Lys Leu Glu Arg Met 
145 150 155 160 

Glu Thr Lys Trp Arg Asn Gly Asp He Val Gin Pro Val Leu Asn Pro 
165 170 175 



Glu Pro Asn Thr Val Ser Tyr Ser Gin Ser Ser Leu He His Leu Val 
30 180 185 190 



35 



Gly Pro Ser Asp Cys Thr Leu His Gly Phe Val His Gly Gly Val Thr 
195 200 205 

Met Lys Leu Met Asp Glu Val Ala Gly He Val Ala Ala Arg His Cys 
210 215 220 



40 



Lys Thr Asn He Val Thr Ala Ser Val Asp Ala He Asn Phe His Asp 
225 230 235 240 

Lys He Arg Lys Gly Cys Val He Thr He Ser Gly Arg Met Thr Phe 

245 250 255 



Thr Ser Asn Lys Ser Met Glu He Glu Val Leu Val Asp Ala Asp Pro 
45 260 265 270 



50 



Val Val Asp Ser Ser Gin Lys Arg Tyr Arg Ala Ala Ser Ala Phe Phe 
275 280 285 

Thr Tyr Val Ser Leu Ser Gin Glu Gly Arg Ser Leu Pro Val Pro Gin 
290 295 300 



55 



Leu Val Pro Glu Thr Glu Asp Glu Lys Lys Arg Phe Glu Glu Gly Lys 
305 310 315 320 

Gly Arg Tyr Leu Gin Met Lys Ala Lys Xaa Gin Gly His Ala Xaa Xaa 
325 330 335 



60 



Gin Pro Xaa 



wo 98/39448 



646 



PCT/US98/04493 



(2) INFORMATION FOR SEQ ID NO: 482: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 482: 

Met Leu Asn Ser Asn lie Asn Asp Lieu Leu Met Val Thr Tyr Leu Ala 
15 10 15 

15 Asn Leu Thr Gin Ser Gin He Ala Leu Asn Glu Lys Leu Val Asn Leu 
20 25 30 



20 



(2) INFORMATION FOR SEQ ID NO: 483: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 483: 

30 

Met Arg Glu Thr Ser He Arg Val Leu Leu Met Leu Pro Ala Leu Glu 
15 10 15 

Ser Thr Ser Gly Leu Ser Ala Phe Met Gly Leu Gly Thr Arg He Gly 
35 20 25 30 

Cys Phe Lys Thr He Thr Cys Trp Pro Thr Ser Leu Thr Gin Arg Xaa 
35 40 45 

40 



45 (2) INFORMATI(a« FOR SEQ ID NO: 484: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 amino acids 

(B) TYPE: amino acid 
50 {D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 484: 

Met Tyr Met Tyr Ser Leu Asn Val Phe Leu Ser Phe He Phe Leu Ala 
15 10 15 

55 

Leu Val Phe Lys Cys Val His Val Cys Gin Gly Ala Asn Ala Phe Leu 
20 25 30 

Phe Leu Lys Leu Val Phe 
60 35 
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(2) INFORMATION FOR SEQ ID NO: 485: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) I;ENGTH: 61 amino acids 
(B> TYPE: amino acid 
(D) TOPOLOGY: linear 
10 (xi) SEQUENCE DESfCRIPTION : SEQ ID NO: 485: 

Met Gly Leu Arg Leu lie Cys Leu Glu Leu Thr Met Val Lys Ala Leu 

15 10 15 

15 Val Cys Glu Met Phe Leu Phe Phe Leu Met Thr Gin Lys Leu He Trp 
20 25 30 



20 



Gin Glu Cys Thr Glu Lys Phe Ala Lys Leu Leu Val Gin Leu He Ser 
35 40 45 

Leu Val Phe Ala Trp Glu Phe Phe Ser Glu Asp Thr Pro 
50 55 60 



25 

(2) INFORMATION FOR SEQ ID NO: 486: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 346 amino acids 
30 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 486: 

Met Leu Ala Ala Arg Leu Val Cys Leu Arg Thr Leu Pro Ser Arg Val 
35 1 5 10 15 

Phe His Pro Ala Phe Thr Lys Ala Ser Pro Val Val Lys Asn Ser He 
20 25 30 

40 Thr Lys Asn Gin Trp Leu Leu Thr Pro Ser Arg Glu Tyr Ala Thr Lys 
35 40 45 

Thr Arg He Gly He Arg Arg Gly Arg Thr Gly Gin Glu Leu Lys Glu 

50 55 60 

45 

Ala Ala Leu Glu Pro Ser Met Glu Lys He Phe Lys He Asp Gin Met 
65 70 75 80 

Gly Arg Trp Phe Val Ala Gly Gly Ala Ala Val Gly Leu Gly Ala Leu 
50 85 90 95 

Cys Tyr Tyr Gly Leu Gly Leu Ser Asn Glu He Gly Ala He Glu Lys 
100 105 110 

55 Ala Val He Trp Pro Gin Tyr Val Lys Asp Arg He His Ser Thr Tyr 
115 120 125 

Met Tyr Leu Ala Gly Ser He Gly Leu Thr Ala Leu Ser Ala He Ala 
130 135 140 

60 
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lie Ser Arg Thr Pro Val Leu Met Asn Phe Met Met Arg Gly Ser Trp 
145 150 155 160 

Val Thr He Gly Val Thr Phe Ala Ala Met Val Gly Ala Gly Met Leu 
5 165 170 175 

Val Arg Ser He Pro Tyr Asp Gin Ser Pro Gly Pro Lys His Leu Ala 
180 185 190 

10 Trp Leu Leu His Ser Gly Val Met Gly Ala Val Val Ala Pro Leu Thr 
195 200 205 



15 



30 



40 



He Leu Gly Gly Pro Leu Leu lie Arg Ala Ala Trp Tyr Thr Ala Gly 
210 215 220 

He Val Gly Gly Leu Ser Thr Val Ala Met Cys Ala Pro Ser Glu Lys 
225 230 235 240 



Phe Leu Asn Met Gly Ala Pro Leu Gly Val Gly Leu Gly Leu Val Phe 
20 245 250 255 

Val Ser Ser Leu Gly Ser Met Phe Leu Pro Pro Thr Thr Val Ala Gly 
260 265 270 

25 Ala Thr Leu Tyr Ser Val Ala Met Tyr Gly Gly Leu Val Leu Phe Ser 
275 280 285 



Met Phe Leu Leu Tyr Asp Thr Gin Lys Val He Lys Arg Ala Glu Val 
290 295 300 

Ser Pro Met Tyr Gly Val Gin Lys Tyr Asp Pro He Asn Ser. Met Leu 
305 310 315 320 



Ser He Tyr Met Asp Thr I^eu Asn He Phe Met Arg Val Ala Thr Met 
35 325 330 335 



Leu Ala Thr Gly Gly Asn Arg Lys Lys Xaa 
340 345 



(2) INFORMATION FOR SEQ ID NO: 487: 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 237 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 487: 

50 Met Glu Glu Val X^eu Leu Leu Gly Leu Lys Asp Arg Glu Gly Tyr Thr 
15 10 15 

Ser Phe Trp Asn Asp Cys He Ser Ser Gly Leu Arg Gly Cys Met Leu 
20 25 30 

55 

He Glu Leu Ala Leu Arg Gly Arg Leu Gin Leu Glu Ala Cys Gly Met 
35 40 45 



Arg Arg Lys Ser Leu Leu Thr Arg Lys Val He Cys Lys Ser Asp Ala 
60 50 55 60 
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Pro Thr Gly Asp Val I^eu Leu Asp Glu Ala Leu Lys His Val Lys Glu 
65 70 75 80 

5 Thr Gin Pro Pro Glu Thr Val Gin Asn Trp lie Glu Leu Leu Ser Gly 

85 90 95 

Glu Thr Trp Asn Pro Leu Lys Leu His Tyr Gin Leu Arg Asn Val Arg 
100 105 110 

10 

Glu Arg Leu Ala Lys Asn Leu Val Glu Lys Gly Val Leu Thr Thr Glu 
115 120 125 

Lys Gin Asn Phe Leu Leu Phe Asp Met Thr Thr His Pro Leu Thr TVsn 
15 130 135 140 

Asn Asn lie Lys Gin Arg L»eu lie Lys Lys Val Gin Glu Ala Val Leu 
145 150 155 160 

20 Asp Lys Trp Val Asn Asp Pro His Arg Met Asp Arg Arg Leu Leu Ala 
165 170 175 



25 



Leu He Tyr Leu Ala His Ala Ser Asp Val Leu Glu Asn Ala Phe Ala 

180 185 190 

Pro Leu Leu Asp Glu Gin Tyr Asp Leu Ala Thr Lys Arg Val Arg Gin 
195 200 205 



Leu Leu Asp Leu Asp Pro Glu Val Glu Cys Leu Lys Ala Asn Thr Asn 
30 210 215 220 



35 



Glu Val Leu Trp Ala Val Val Ala Ala Phe Thr Lys Xaa 
225 230 235 



(2) INFORMATION FOR SEQ ID NO: 488: 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 200 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 488: 

45 Met Ala Gin Arg Met Val Trp Val Asp Leu Glu Met Thr Gly Leu Asp 

15 10 15 

lie Glu Lys Asp Gin lie lie Glu Met Ala Cys Leu lie Thr Asp Ser 
20 25 30 

50 

Asp Leu Asn lie Leu Ala Glu Gly Pro Asn Leu lie lie Lys Gin Pro 
35 40 45 

Asp Glu Leu Leu Asp Ser Met Ser Asp Trp Cys Lys Glu His His Gly 
55 50 55 60 

Lys Ser Gly Leu Thr Lys Ala Val Lys Glu Ser Thr He Thr Leu Gin 
65 70 75 80 



60 Gin Ala Glu 



Tyr Glu Phe Leu Ser Phe Val Arg Gin Gin Thr Pro Pro 
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85 



90 



95 



Gly Leu Cys Pro Leu Ala Gly Asn Ser Val His Glu Asp Lys Lys Phe 
100 105 110 

5 

Leu Asp Lys Tyr Met Pro Gin Phe Met Lys His Leu His Tyr Arg lie 
115 120 125 

He Asp Val Ser Thr Val Lys Glu Leu Cys Arg Arg Trp Tyr Pro Glu 
10 130 135 140 

Glu Tyr Glu Phe Ala Pro Lys Lys Ala Ala Ser His Arg Ala Leu Asp 
145 150 155 160 

15 Asp He Ser Glu Ser He Lys Glu Leu Gin Phe Tyr Arg Asn Asn He 
165 170 175 



20 



Phe Lys Lys Lys He Asp Glu Lys Lys Arg Lys He He Glu Asn Gly 
180 185 190 

Glu Asn Glu Lys Thr Val Ser Xaa 
195 200 



25 

(2) INFORMATION FOR SEQ ID NO: 489: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 351 amino acids 
30 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 489: 

Met Ala Thr Thr Ala Ala Pro Ala Gly Gly Ala Arg Asn Gly Ala Gly 
35 1 5 10 15 

Pro Glu Trp Gly Gly Phe Glu Glu Asn He Gin Gly Gly Gly Ser Ala 
20 25 30 

40 Val He Asp Met Glu Asn Met Asp Asp Thr Ser Gly Ser Ser Phe Glu 
35 40 45 

Asp Met Gly Glu Leu His Gin Arg Leu Arg Glu Glu Glu Val Asp Ala 
50 55 60 

45 

Asp Ala Ala Asp Ala Ala Ala Ala Glu Glu Glu Asp Gly Glu Phe Leu 
65 70 75 80 

Gly Met Lys Gly Phe Lys Gly Gin Leu Ser Arg Gin Val Ala Asp Gin 
50 85 90 95 

Met Trp Gin Ala Gly Lys Arg Gin Ala Ser Arg Ala Phe Ser Leu Tyr 
100 105 110 

55 Ala Asn He Asp He Leu Arg Pro Tyr Phe Asp Val Glu Pro Ala Gin 
115 120 125 



60 



Val Arg Thr Gly Leu Leu Glu Ser Met He Pro He Lys Met Val Asn 
130 135 140 
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Phe Pro Gin Lys lie Ala Gly Glu Leu Tyr Gly Pro Leu Met Leu Val 
145 150 155 160 

Phe Thr Leu Val Ala lie Leu Leu His Gly Met Lys Thr Ser Asp Thr 
165 170 175 

He He Arg Glu Gly Thr Leu Met Gly Thr Ala He Gly Thr Cys Phe 
180 185 190 

Gly Tyr Trp Leu Gly Val Ser Ser Phe He Tyr Phe Leu Ala Tyr Leu 
195 200 205 

Cys Asn Ala Gin He Thr Met Leu Gin Met Leu Ala Leu Leu Gly Tyr 
210 215 220 

Gly Leu Phe Gly His Cys He Val Leu Phe He Thr Tyr Asn He His 
225 230 235 240 

Leu His Ala Leu Phe Tyr Leu Phe Trp Leu Leu Val Gly Gly Leu Ser 
245 250 255 

Thr Leu Arg Met Val Ala Val Leu Val Ser Arg Thr Val Gly Pro Thr 
260 265 270 

Gin Arg Leu Leu Leu Cys Gly Thr Leu Ala Ala Leu His Met Leu Phe 
275 280 285 

Leu Leu Tyr Leu His Phe Ala Tyr His Lys Val Val Glu Gly He Leu 
290 295 300 

Asp Thr Leu Glu Gly Pro Asn He Pro Pro He Gin Arg Val Pro Arg 
305 310 315 320 

Asp He Pro Ala Met Leu Pro Ala Ala Arg Leu Pro Thr Thr Val Leu 
325 330 335 

Asn Ala Thr Ala Lys Ala Val Ala Val Thr Leu Gin Ser His Xaa 
340 345 350 



(2) INFORMATION FOR SEQ ID NO: 490: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 265 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 490; 



Met Arg Gly Ser Arg Gly Gly Trp 
1 5 

Glu Ser Gly Thr Ser Gly Gly Gly 
20 

Thr Phe Tyr Ser Glu Val Lys Gin 
35 40 



Ala Gly Glu Met Ala Ala Ser Gly 
10 15 

Gly Ser Thr Glu Glu Ala Phe Met 
25 30 

He Glu Lys Arg Asp Ser Val Leu 

45 



Thr Ser Lys Asn Gin He Glu Arg Leu Thr Arg Pro Gly Ser Ser Tyr 
50 55 60 
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Phe Asn Leu Asn Pro Phe Glu Val Leu Gin lie Asp Pro Glu Val Thr 

65 70 75 80 

5 Asp Glu Glu lie Lys Lys Arg Phe Arg Gin Leu Ser lie Leu Val His 

85 90 95 



10 



Pro Asp Lys Asn Gin Asp Asp Ala Asp Arg Ala Gin Lys Ala Phe Glu 
100 105 110 

Ala Val Asp Lys Ala Tyr Lys Leu Leu Leu Asp Gin Glu Gin Lys Lys 
115 120 125 



Arg Ala Leu Asp Val He Gin Ala Gly Lys Glu Tyr Val Glu His Thr 
15 130 135 140 

Val Lys Glu Arg Lys Lys Gin Leu Lys Lys Glu Gly Lys Pro Thr He 
145 150 155 160 

20 Val Glu Glu Asp Asp Pro Glu Leu Phe Lys Gin Ala Val Tyr Lys Gin 
165 170 175 



25 



Thr Met Lys Leu Phe Ala Glu Leu Glu He Lys Arg Lys Glu Arg Glu 
180 185 190 

Ala Lys Glu Met His Glu Arg Lys Arg Gin Arg Glu Glu Glu He Glu 
195 200 205 



Ala Gin Glu Lys Ala Lys Arg Glu Arg Glu Trp Gin Lys Asn Phe Glu 
30 210 215 220 

Glu Ser Arg Asp Gly Arg Val Asp Ser Trp Arg Asn Phe Gin Ala Asn 
225 230 235 240 

35 Thr Lys Gly Lys Lys Glu Lys Lys Asn Arg Thr Phe Leu Arg Pro Pro 
245 250 255 

Lys Val Lys Met Glu Gin Arg Glu Xaa 
260 265 

40 



(2) INFORMATION FOR SEQ ID NO: 491: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 491: 

50 

Asp Ser Met Pro Thr Cys Pro Leu Xaa Ala Ser Leu Glu Cys Gly Pro 
16 10 15 

Leu Leu Pro Val Arg Leu Cys Cys Leu 
55 20 25 



(2) INFORMATION FOR SEQ ID NO: 492: 

60 



W0 98A39448 



653 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENCSTH: 159 amino acids 

(B) TYPE: amino acid 
(D) TOPOIiOGy: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 492: 



Met Asn Glu Tyr Arg Val Pro Glu 
1 5 

Lys Ser Leu Ser Phe Leu Phe Glu 
20 

Tyr lie Tyr Ala Val Thr Pro Leu 
35 40 



Leu Asn Val Gin Asn Gly Val Leu 
10 15 

Tyr lie Gly Glu Met Gly Lys Asp 

25 30 

Leu Glu Asp Ala Leu Met Asp Arg 
45 



Asp Leu Val His Arg Gin Thr Ala Ser Ala Val Val Gin His Met Ser 
50 55 60 

Leu Gly Val Tyr Gly Phe Gly Cys Glu Asp Ser Leu Asn His Leu Leu 

65 70 75 80 

Asn Tyr Val Trp Pro Asn Val Phe Glu Thr Ser Pro His Val lie Gin 
85 90 95 

Ala Val Met Gly Ala Leu Glu Gly Leu Arg Val Ala lie Gly Pro Cys 
100 105 110 

Arg Met Leu Gin Tyr Cys Leu Gin Gly Leu Phe His Pro Ala Arg Lys 
115 120 125 

Val Arg Asp Val Tyr Trp Lys lie Tyr Asn Ser lie Tyr lie Gly Ser 
130 135 140 

Gin Asp Ala Leu lie Ala His Tyr Pro Arg lie Tyr Gin Arg Xaa 
145 150 155 



(2) INFORMATION FOR SEQ ID NO: 493: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 279 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 493: 

Met lie Ser Asp Asn Ser Ala Glu Asn lie Ala Leu Val Thr Ser Met 
15 10 15 

Tyr Asp Gly Leu Leu Gin Ala Gly Ala Arg Leu Cys Pro Thr Val Gin 

20 25 30 

Leu Glu Asp lie Arg Asn Leu Gin Asp Leu Thr Pro Leu Lys Leu Ala 
35 40 45 

Ala Lys Glu Gly Lys lie Glu lie Phe Arg His lie Leu Gin Arg Glu 
50 55 60 

Phe Ser Gly Leu Ser His Leu Ser Arg Lys Phe Thr Glu Trp Cys Tyr 
65 70 75 80 
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Gly Pro Val Arg Val Ser Leu Tyr Asp Leu Ala Ser Val Asp Ser Cys 
85 90 95 

Glu Glu Asn Ser Val Leu Glu lie lie Ala Phe His Cys Lys Ser Pro 
100 105 110 

His Arg His Arg Met Val Val Leu Glu Pro Leu Asn Lys Leu Leu Gin 
115 120 125 

Ala Lys Trp Asp Leu Leu He Pro Lys Phe Phe Leu Asn Phe Leu Cys 
130 135 140 

Asn Leu He Tyr Met Phe He Phe Thr Ala Val Ala Tyr His Gin Pro 
145 150 155 160 

Thr Leu Lys Lys Gin Ala Ala Pro His Leu Lys Ala Glu Val Gly Asn 
165 170 175 

Ser Met Leu Leu Thr Gly His He Leu He Leu Leu Gly Gly He Tyr 
180 185 190 

Leu Leu Val Gly Gin Leu Trp Tyr Phe Trp Arg Arg His Val Phe He 
195 200 205 

Trp He Ser Phe He Asp Ser Tyr Phe Glu He Leu Phe Leu Phe Gin 
210 215 220 

Ala Leu Leu Thr Val Val Ser Gin Val Leu Cys Phe Leu Xaa He Glu 
225 230 235 240 

Trp Tyr Leu Pro Leu Leu Val Ser Ala Leu Val Leu Gly Trp Leu Asn 
245 250 255 

Leu Leu Tyr Tyr Thr Arg Gly Phe Gin His Thr Gly He Tyr Ser Val 
260 265 270 

Met He Gin Lys Pro Trp Xaa 
275 



(2) It3FOPMATION FOR SEQ ID NO: 494: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 193 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 494: 

Met He Arg Cys Gly Leu Ala Cys Glu Arg Cys Arg Trp He Leu Pro 
15 10 15 

Leu Leu Leu Leu Ser Ala He Ala Phe Asp He He Ala Leu Ala Gly 
20 25 30 

Arg Gly Trp Leu Gin Ser Ser Asp His Gly Gin Thr Ser Ser Leu Trp 
35 40 45 

Trp Lys Cys Ser Gin Glu Gly Gly Gly Ser Gly Ser Tyr Glu Glu Gly 
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50 55 60 

Cys Gin Ser Leu Met Glu Tyr Ala Trp Gly Arg Ala Ala Ala Ala Met 
65 70 75 80 

Leu Phe Cys Gly Phe lie lie Leu Val lie Cys Phe lie Leu Ser Phe 
85 90 95 

Phe Ala Leu Cys Gly Pro Gin Met Leu Val Phe Leu Arg Val lie Gly 
100 105 110 

Gly Leu Leu Ala Leu Ala Ala Val Phe Gin He He Ser Leu Val He 

115 120 125 

Tyr Pro Val Lys Tyr Thr Gin Thr Phe Thr Leu His Ala Asn Xaa Ala 
130 135 140 

Val Thr Tyr He Tyr Asn Trp Ala Tyr Gly Phe Gly Trp Ala Ala Thr 
145 150 155 160 

He He Leu He Gly Cys Ala Phe Phe Phe Cys Cys Leu Pro Asn Tyr 

165 170 175 

Glu Asp Asp Leu Leu Gly Asn Ala Lys Pro Arg Tyr Phe Tyr Thr Ser 
180 185 190 

Ala 



(2) INFORMATICaJ FOR SEQ ID NO: 495: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 205 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 495: 

Met Ala Ala Gly Asp Gin Val Phe Ser Gly Ala Gly His Val Xaa Glu 
15 10 15 

His Val Ala Gly Gly Arg His Ala Trp Leu Leu Thr Trp Gin Ser Ala 

. 20 25 30 

Cys Pro Ala Asn Arg Leu Ser Leu Val Pro Leu Val Pro Ser Ala Ser 
35 40 45 

Met Thr Arg Leu Met Arg Xaa Arg Thr Ala Ser Gly Ser Ser Val He 
50 55 60 

Leu Trp Met Ala Pro Ala Ala Ala Pro Thr Pro Ala Arg Ala Pro Glu 
65 70 75 80 

Ala Ala Pro Thr Pro Ala Arg Ala Pro Ala Ala Ala Arg Thr Pro Ala 
85 90 95 

Arg Gly Pro Thr Trp Thr Ser Pro Pro Thr Arg Val Leu Leu Gly Thr 
100 105 110 
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Xaa Pro Gly Pro Ser Pro Trp Arg Ser Pro Ala Arg Arg Pro Ala Gin 
115 120 125 

L.eu Pro Pro Pro Asp Ser Asp Leu Cys Ser Gly Pro Leu Leu Pro Gly 
5 130 135 140 

Pro Phe Ser Pro Pro Ala Cys His Thr Ala Pro Asn Ser Val Leu He 
145 150 155 160 

10 Gin Ser Leu Phe Cys Lys Ser Glu Leu Trp Trp Arg Gin Met Arg Ser 
165 170 175 



15 



He Thr Trp Val Pro Ser Pro Lys Ala Gly Tip Arg Trp Thr Lys Gly 
180 185 190 

Arg Lys Gin Ala Ser Pro His Arg He Leu Phe His Xaa 
195 200 205 



20 

{2) INFORMATIOT FOR SEQ ID NO: 496: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 147 amino acids 
25 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 496: 

Met Ala Leu Thr Leu Leu Pro Ser Val Ser Arg Leu Pro Gly Glu Arg 
30 1 5 10 15 

Met Ala Ala Ser Gly Leu Pro Tyr Val Leu His His Lys Ser Ser Leu 
20 25 30 

35 Met Lys Val He Phe Phe Pro Tyr Pro Val Leu Pro Leu Pro Ala Pro 
35 40 45 

Asn Gly Thr Trp Val Pro Arg Leu Val Leu Gly Leu Gly Ser Gly Asp 
50 55 60 

40 

Gin Val His Tyr Leu Pro He Ser Ser Ser He Val Asn Tyr Gly Thr 
65 70 75 80 

Ser Val Ser Gly Lys Ser Trp Val Phe Leu Val Tyr Pro Leu His Pro 
45 85 90 95 

Thr Pro Thr Trp Ser Thr Arg Cys Phe Gin Val Trp Asp Leu Leu Ser 
100 105 110 

50 Val Glu Leu Pro Asp Lys Gly Glu Gly Asn Thr Arg Arg Ala Ser Gly 
115 120 125 

Val Pro Gly Leu Ser Gin Leu Pro Thr Ser His Lys Pro He Lys Gin 
130 135 140 

55 

Glu Tyr Xaa 
145 



60 
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(2) INFORMATION FOR SBQ ID NO: 497: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTICW: SBQ ID NO: 497: 

Met Val Trp Val Leu Trp Ser Ala Pro Ser Leu Ala Pro Pro Trp Val 
15 10 15 

Gly Pro Cys Trp Pro Ser Thr Gly Asn Cys Cys Leu Cys Glu Val Gly 
20 25 30 

Ala Ala Leu Pro Pro Arg Gly Pro Ser Leu Ser Asp Cys Leu Gly Leu 
35 40 45 

Pro Pro Trp Thr Pro Trp Gly Pro Ala Trp Thr Leu Ala Gin Ser Xaa 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 498: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 498: 

Met Ser Thr Gly Ala Leu Asn Thr Ser Pro Pro Ala Ser Asn Arg Leu 

15 10 15 

Glu Ser Thr Leu Asn Glu Tyr Leu He Gin Pro Gin Leu His Cys Ser 
20 25 30 

Ser Val Gin Arg Leu Thr Leu Lys Trp Gly Cys Ser Ser Leu Gin Arg 
35 40 45 

Asp Gly Gin Ala Val Pro Trp Gly Leu Trp Gin Arg Ala Tyr Pro Ser 
50 55 60 

Leu Leu Pro Thr Leu Pro Ser Asp Leu Leu Arg Pro His Ala Val Thr 
65 70 75 80 

Pro Ser Val Ser Val Ser Val His Thr Cys Glu Ser Ser Xaa 
85 90 



(2) INFORMATION FOR SEQ ID NO: 499: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 499: 
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Met Phe Leu lie Phe Val Tyr Phe Leu Lys Xaa Leu Phe Ser Ser Ser 
15 10 15 

Leu Pro Phe Leu Trp Leu 
20 



(2) INFORMATICS^ FOR SEQ ID NO: 500: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 
{D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 500: 

Arg Gly Gly Leu Cys Pro Leu Leu Val Pro Gly Pro Leu Ala Arg Gin 
15 10 15 

Glu Pro Ser Pro Ser Leu Gin Gly Cys Ser Glu Ser Pro Val Gly Met 
20 25 30 

Asp 



(2) INFORMATION FOR SEQ ID NO: 501: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 501: 

Met Gin Phe Leu Leu Thr Ala Phe Leu Leu Val Pro Leu Leu Ala Leu 
15 10 15 

Cys Asp Val Pro lie Ser Leu Gly Phe Ser Pro Ser 
20 25 



(2) INFORMATION FOR SEQ ID NO: 502: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 502: 

Pro Gly Lys Pro Gin Ala Cys Pro Glu Leu Thr Ser Val Leu Pro 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 503: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 503: 

Asn Lys Ser Leu Xaa Ser Cys Leu Phe Val Leu His Phe Val Leu His 
15 10 15 

Cys Xaa Phe 



(2) INFORMATION FOR SEQ ID NO: 504: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 504: 

Met Glu Lys Thr His Arg Leu Arg lie Arg Asn Pro Cys Leu Gin Phe 
15 10 15 

Ser lie Leu Asn Leu Phe Leu Leu Lys Met He Val Ser 
20 25 



(2) INFORMATION FOR SEQ ID NO: 505: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 amino acids 

<B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 505: 

Met Val Asp He Ser Lys Met His Met He Leu Tyr Asp Leu Gin Gin 

15 10 15 

Asn Leu Ser Ser Ser His Arg Ala Leu Glu Lys Gin He Asp Thr Leu 
20 25 30 

Ala Gly Lys Leu Asp Ala Leu Thr Glu Leu Leu Ser Thr Ala Leu Gly 
35 40 45 

Pro Ser Ser Phe Gin Asn Pro Ala Ser Ser Pro Ser Ser Trp Thr His 
50 55 60 

Glu Glu Glu Pro Gly Tyr Phe Pro Gin Tyr Xaa 
65 70 75 



(2) INFORMATION FOR SEQ ID NO: 506: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 cunino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 506: 

Leu Pro Leu Ala Glu Leu Lys Asn Trp Val 
15 10 



(2) INFORMATION FOR SEQ ID NO: 507: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 207 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 507: 

15 

Met Leu Trp Phe Gly Gly Cys Ser Ala Val Asn Ala Thr Gly His Leu 
1 5 10 15 

Ser Asp Thr Leu Trp Leu lie Pro lie Thr Phe Leu Thr lie Gly Tyr 
20 20 25 30 

Gly Asp Val Val Pro Gly Thr Met Trp Gly Lys He Val Cys Leu, Cys 
35 40 45 

25 Thr Gly Val Met Gly Val Cys Cys Thr Ala Leu Leu Val Ala Val Val 
50 55 60 

Ala Arg Lys Leu Glu Phe Asn Lys Ala Glu Lys His Val His Asn Phe 
65 70 75 80 

30 

Met Met Asp He Gin Tyr Thr Lys Glu Met Lys Glu Ser Ala Ala Arg 

85 90 95 

Val Leu Gin Glu Ala Trp Met Phe Tyr Lys His Thr Arg Arg Lys Glu 
35 100 105 110 

Ser His Ala Ala Arg Arg His Gin Arg Xaa Leu Leu Ala Ala He Asn 
115 120 125 

40 Ala Phe Arg Gin Val Arg Leu Lys His Arg Lys Leu Arg Glu Gin Val 
130 135 140 

Asn Ser Met Val Asp He Ser Lys Met His Met He Leu Tyr Asp Leu 
145 150 155 160 

45 

Gin Gin Asn Leu Ser Ser Ser His Arg Ala Leu Glu Lys Gin He Asp 
165 170 175 

Thr Leu Ala Gly Lys Leu Asp Ala Leu Thr Glu Leu Leu Ser Thr Ala 
50 180 185 190 

Leu Gly Pro Arg Gin Leu Pro Glu Pro Ser Gin Gin Ser Lys Xaa 
195 200 205 

55 

(2) INFORMATION FOR SEQ ID NO: 508: 



60 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGfY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 508: 

Met Trp Arg Cys Arg Gly Lys Leu Ser Phe Pro Leu Phe Ala Val Val 
1 5 10 15 

He Val Ser Cys Arg Lys Asp Gly Pro Asp Ala Ala Ala Ala Pro Ala 
20 25 30 

Val Xaa Lys Lys 
35 



(2) INFORMATION FOR SEQ ID NO: 509: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 509: 

Met Ala Leu Val Ala Leu Phe Thr Gin Leu Met Arg Xaa Leu Gly Arg 
15 10 15 

Cys Pro Gin 



(2) INFORMATION FOR SEQ ID NO: 510: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 510: 

Met Thr Phe Pro Phe Glu Lys Glu Asn Ser Cys Phe Gin Cys Leu Leu 
15 10 15 

Phe Asp Ser Trp Arg Glu Gin Thr Arg Thr Asn He Gin Pro Gin Arg 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 511: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 511: 



Met His Leu Leu Asp Phe Phe Arg Asp Leu Val Leu Leu Val Leu Leu 
15 10 15 
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Ala Leu Leu Asp Ser Phe Trp Leu Glu Val Gin Lys 
20 25 



(2) INFORMATION FOR SEQ ID NO: 512: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 512: 

Met Cys Leu He His Phe He Lys He He Leu Val Phe He Leu Lys 
15 10 15 

Leu Trp Leu Tyr Ser Gin Lys Cys Pro Lys 
20 25 



(2) INFORMATION FOR SEQ ID NO: 513: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 513: 

Met He His Val His Glu Trp Asn Asp Gin Met Leu Met Val Tyr He 

15 10 15 

Phe Leu Tyr Pro Val Ser He Thr Phe Leu Asn Leu Cys Ser Leu Thr 
20 25 30 

Cys 



(2) INFORMATION FOR SEQ ID NO: 514: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 514: 

Leu Asn Glu Ser Tyr Val Ser Arg Ala Gly Gly Trp Phe Ser Met Phe 
15 10 15 

Xaa Leu He Phe Phe Leu Leu Ala Leu Gly Ser Xaa Leu Cys Leu Leu 

20 25 30 

Leu Cys Leu Pro Ser Phe Asn Lys Thr Arg Arg Lys Gin Lys Pro 
35 40 45 
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(2) INFORMATION FOR SEQ ID NO: 515: 

<!) SE3C?UKNCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 515: 

Ser Ser Lys Thr Pro Leu Pro Ser Glu Arg Arg Trp He Ser Gly Ser 
15 10 15 

Ser Leu Met Ala Pro Arg Pro Trp Leu Leu Gly He Ala Leu Leu Gly 
20 25 30 

Leu Trp Ala lieu Glu Pro Ala Leu Gly His Trp 
35 40 



(2) INFORMATION FOR SEQ ID NO: 516: 

(i) SEQUEtJCE CHARACTERISTICS: 

(A) LENGTH: 3 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 516: 

Leu Asn Trp 
1 



(2) INFORMATION FOR SEQ ID NO: 517: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LQ^GTH: 174 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 517: 

Phe Ala Phe Cys Ala Glu Leu Met He Gin Asn Trp Thr Leu Gly Ala 
15 10 15 

Val Asp Ser Gin Met Asp Asp Met Asp Met Asp Leu Asp Lys Glu Phe 
20 25 30 

Leu Gin Asp Leu Lys Glu Leu Lys Val Leu Val Ala Asp Lys Asp Leu 
35 40 45 

Leu Asp Leu His Lys Ser Leu Val Cys Thr Ala Leu Arg Gly Lys Leu 
50 55 60 

Gly Val Phe Ser Glu Met Glu Ala Asn Phe Lys Asn Leu Ser Arg Gly 
65 70 75 80 

Leu Val Asn Val Ala Ala Lys Leu Thr His Asn Lys Asp Val Arg Asp 
85 90 95 

Leu Phe Val Asp Leu Val Glu Lys Phe Val Glu Pro Cys Arg Ser Asp 
100 105 110 
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His Trp Pro Leu Ser Asp Val Arg 
115 120 

5 Ser Val His Ser Leu Asp Gly Phe 
130 135 

Thr Trp Ala Pro Ser Ala Ala Ala 
145 150 

10 

Val Pro Pro Asn Ala Pro Pro Thr 
165 



Phe Phe Leu Asn Gin Tyr Ser Ala 
125 

Arg His Gin Ala Ser Gly Thr Ala 
140 

Ser Cys Ala Cys lie Met Thr Glu 
155 160 

Leu Thr lie Lys Leu Leu 
170 



15 

(2) INFORMATION FOR SEQ ID NO; 518: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 
20 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 518: 

Met Trp Lys Asn Leu Gly Ser Gly Ser Val Phe Val Thr Trp Phe Ser 
25 1 5 10 15 

Leu Val Met He Leu Ser Gly He Gly Pro Leu Gly Asp Ala Glu Asp 
20 25 30 

30 Ser He Ser Asp Val Ser His Arg Leu Arg Pro 
35 40 



35 {2) INFORMATION FOR SEQ ID NO: 519: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
40 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 519: 



Phe Gin Phe Pro Leu Leu Thr He Ala Leu Gin Phe Leu 
15 10 

45 



<2) INFORMATION FOR SEQ ID NO: 520: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 520: 

55 

Met His Tyr Val He Val Leu Ser Leu Phe Val Val Leu Glu Lys Lys 
15 10 15 



Asn Lys Met Gly Ser Asp Gly Cys Leu Arg Lys Asn Gly Ser 
60 20 25 30 
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(2) INFORMATION FOR SEQ ID NO: 521: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 andno acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 521: 

Met Ser Arg Ser lie Val Leu Arg Gly Ser Leu Phe Leu Phe Phe Ser 
15 10 15 

His Tyr Thr Leu Lys Leu Leu Ser Val lie Lys Gin Thr Asn Arg Lys 
20 25 30 

lie Val Trp Glu Lys Pro Cys He Arg Leu Phe Tyr Xaa Val Leu 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 522: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 522: 

Met Pro Leu Pro Val Leu Leu Cys Leu Thr Leu Pro Met Pro Leu Pro 
15 10 15 

Ser Ala Thr Ala Arg Gly Gly Asn Arg Thr 
20 25 



(2) INFORMATION FOR SEQ ID NO: 523: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 523: 

Ser Ser He Pro Val Ser He Leu He Gly Met Lys Leu He Leu Tyr 
15 10 15 

Leu Leu He Thr Glu Ser Gly Ser His Glu Lys Lys Ser Phe Tyr Pro 
20 25 30 

Ser Phe Lys Tyr Met Phe Lys He He He Tyr Val Ser Ala Tyr Cys 
35 40 45 

Arg Thr Ala Leu Arg Ala Thr Val Ser His 
50 55 
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(2) INFORMATION FOR SEQ ID NO: 524: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

<B) TYPE: amino acid 

(D) T0POLCX3Y: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 524: 

Asn Arg Thr Leu Leu Phe Leu He Leu Phe Val Leu Phe Gly Leu Gly 
15 10 15 

Tyr Gly Phe 



(2) INFORMATION FOR SEQ ID NO: 525: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TYPE; amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 525: 

Met Phe Leu Leu Val Leu Ser Val Phe Cys Asp Phe Met Cys Ser He 
15 10 15 

Ala Pro Arg Cys His Ala Leu Ser Leu Val Ser Leu Arg Ala Gin His 
20 25 30 

Leu Ser Leu Phe He Thr Cys His 
35 40 



(2) INFORMATION FOR SEQ ID NO: 526: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 526: 



Met Leu Leu Phe He Leu Leu Thr 
1 5 

Val Ser Ser Trp Lys Thr Phe Leu 
20 

Arg Glu His Pro Glu Gly Ser Arg 
35 40 

Pro Gly Ala His Cys Leu His Cys 
50 55 



Leu Ser Ser Gly Cys Arg Leu Leu 
10 15 

Pro His Phe Ser Leu Pro Gly Pro 
25 30 

Thr Trp Phe Phe Arg Tyr Trp Glu 
45 

Ala 



(2) INFORMATION FOR SEQ ID NO: 527: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 527: 

5 

Ala Arg Leu Leu Leu Phe Leu Ser Ser Val His Pro Ser lie Met Pro 
15 10 15 

Ser Cys Asn Gin Leu 
10 20 



15 



30 



(2) INFORMATION FOR SEQ ID NO: 528: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 528: 

Met Ser Leu Thr Ser Ser Leu Thr Phe Leu Ser His lie Leu Leu Leu 
15 10 15 

25 Pro Gin Lys Leu Gin Phe Leu Ser Trp Met Glu Arg Gin Gin Arg Cys 
20 25 30 



Thr Gly Val Ala Lys Tyr Ala 
35 



(2) INFORMATION FOR SEQ ID NO: 529: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 128 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC»^: SEQ ID NO: 529: 

40 

Met Val Leu Arg Leu lie Gin Leu lie Phe Leu He Phe Phe He His 

15 10 15 

He He He Leu Leu He Pro Gly Ser Arg Pro Cys Gly Ser Trp Val 
45 20 25 30 

Asn Asp Arg Xaa Leu Gly Leu Arg Asp Val Thr His Leu He Tyr Leu 
35 40 45 

50 His Trp Val His Gly His Leu Pro Trp Cys His Pro Tyr He Gin Val 

50 55 60 

Glu Phe Ser Ala Leu He Glu Ser Thr Ala Gin Leu Gly Leu Pro Phe 
65 70 75 80 

55 

Ser Trp Val Arg Val He His Pro Phe Leu Val Leu Pro Cys Leu Tyr 
85 90 95 

Ser Pro Gly Leu Lys Asn Gly He Phe Leu Phe Leu Leu Arg Ala Met 
60 100 105 110 
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Pro Gly Gly Met Phe Pro Gly Asn Leu Glu Ala Phe Arg Val Pro Val 
115 120 125 



(2) INFORMATIC»r FOR SEQ ID NO: 530: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGy: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 530: 

Met Gly Ser Ser Val Leu Pro Phe Cys Val Cys Val Thr Ser Pro Ser 
15 10 15 

Leu Gly Gly Arg Cys He Gin Gly Arg Phe Ala Ser His Ser Lys Phe 
20 25 30 

Trp Gly Phe Gly Xaa Lys Thr Ala Ser Phe Gly Ala Val Gly Glu Thr 
35 40 45 

Pro Pro Asp Gin Glu Pro Gin Lys Glu Thr Glu Pro Ala Thr Ser Ser 
50 55 60 

His Ala Arg Pro Trp Ala Arg Val He Gly Leu Arg He Trp Pro Gin 
65 70 75 80 

Pro Asn 



(2) INFORMATION FOR SEQ ID NO: 531*: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 531: 

Met Leu Leu Ser Val Ala He Phe He Leu Leu Thr Leu Val Tyr Ala 
15 10 15 

Tyr Trp Thr Met 
20 



{2> INFORMATION FOR SEQ ID NO: 532: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 532: 
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Asn cys Glu He Leu Glu Tyr Cys Tyr Tyr Leu Thr Gin Leu Lys He 
^5 10 15 



5 Ser Met Gly Lys Tyr Leu Ser He Pro Thr Val Leu Leu Lys He He 
20 25 30 



10 



Arg Cys Ser He Thr Ala Val Ser Asp Ser Ser Thr Ser Trp Ala He 
35 40 45 

Lys Ala Gin Leu Lys He Glu Asn Lys Asp Leu Asp Asn Lys Thr Ala 
50 55 60 



Lys Gly Gly Gly Gin Glu Ala Leu Thr Cys Thr 
15 65 70 75 



20 



(2) INFORMATION FOR SEQ ID NO: 533: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 533: 

Met Phe Leu Met Arg Met His Leu Cys Phe Cys Lys Tyr Cys Cys Ser 

15 10 15 

30 Phe He Val Thr Pro Thr Ser Thr Ser Asn Thr Xaa Ser Tyr Leu Trp 
20 25 30 

Pro Trp He Ser Ala Ser Met Ala Gly Arg Gly Ser Xaa Trp Ala Cys 

35 

Thr Leu Asn Ala Val Thr Arg Glu Gly Leu Pro Glu 
50 55 60 



40 

(2) INFORMATION FOR SEQ ID NO: 534: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQK3TH: 39 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 534: 

Met Ser Leu Leu Asn Thr His Thr Leu Cys Phe Val Leu Phe Cys Phe 
^5 10 15 

Thr Leu Ser He Asn Gin Glu Lys Leu Ala Asn His Leu Ala Phe Arg 
20 25 30 

55 He Leu Phe Phe He Val Phe 
35 



45 



50 



(2) INFORMATION FOR SEQ ID NO: 535: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 535: 



Met Leu 
1 

10 



(2) INFORMATION FOR SEQ ID NO: 536: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 
<B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 536: 

20 

Met Asp Gin Phe Lys He Phe Tyr Phe Leu Lys Ala Phe Phe Ala Cys 
15 10 16 

Cys Asn Val Gin Asp Pro Ser Pro Phe Met Gly Glu Thr Gly Ser Tyr 
25 20 25 30 . . 

Leu Asn He Gly 
35 

30 

(2) INFORMATION FOR SEQ ID NO: 537: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 537: 

40 Met Phe Asp Phe Leu Ser Tyr Phe Lys Asp Leu Leu Ser Cys 
15 10 



45 (2) INFORMATION FOR SEQ ID NO: 538: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 
50 (D) T0POLCX5Y: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 538: 

Met Gly Phe Gly Phe Val Leu Asn He Phe Ser Phe Phe Leu Xaa Pro 

55 ' 

Pro Leu 



60 
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(2) INFORMATICS FOR SEQ ID NO: 539: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 539: 

Leu Leu Leu Trp Thr Leu Leu Ala Xaa Tyr Xaa 
15 10 



(2) INFORMATION FOR SEQ ID NO: 540: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 540: 

Met Ala Ala Gin Lys Asp Gin Gin Lys Asp Ala Glu Ala Glu Gly Leu 
15 10 15 

Ser Gly Thr Thr Leu Leu Pro Lys Leu lie Pro Ser Gly Ala Gly Arg 
20 25 30 

Glu Trp Leu Glu Arg Arg Arg Ala Thr lie Arg Pro Trp Ser Thr Phe 
35 40 45 

Val Asp Gin Gin Arg Phe Ser Arg Pro Arg Asn Leu Gly Glu Leu Cys 
50 55 60 

Gin Arg Leu Val Arg Asn Val Glu Tyr Tyr Gin Ser Asn Tyr Val Phe 
65 70 75 80 

Val Phe Leu Gly Leu lie Leu Tyr Cys Val Val Thr Ser Pro Met Leu 
85 90 95 

Leu Val Ala Leu Ala Val Phe Phe Gly Ala Cys Xaa 
100 105 



(2) INFORMATION FOR SEQ ID NO: 541: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LB^GTH: 106 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 541: 

Phe Val Phe Leu Gly Leu lie Leu Tyr Cys Val Val Thr Ser Pro Met 
15 10 15 

Leu Leu Val Ala Leu Ala Val Phe Phe Gly Ala Cys Tyr lie Leu Tyr 
20 25 30 

Leu Arg Thr Leu Glu Ser Lys Leu Val Leu Phe Gly Arg Glu Val Ser 
35 40 45 
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Pro Ala His Gin Tyr Ala Leu Ala Gly Gly lie Ser Phe Pro Phe Phe 
50 55 60 

5 Trp Leu Ala Gly Ala Gly Ser Ala Val Phe Trp Val Leu Gly Ala Thr 
65 70 75 80 

Leu Val Val lie Gly Ser His Ala Ala Phe His Gin lie Glu Ala Val 
85 90 95 

10 

Asp Gly Glu Glu Leu Gin Met Glu Pro Val 
100 105 



15 

(2) INFORMATION FOR SEQ ID NO: 542: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 136 amino acids 
20 (B) TYPE: amino acid 

(D) TOPOLOGY*: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 542: 

Met Asp Arg Phe Thr Val Ala Gly Val Leu Pro Asp lie Glu Gin Phe 
25 1 5 10 15 

Phe Asn lie Gly Asp Ser Ser Ser Gly Leu lie Gin Thr Val Phe lie 
20 25 30 

30 Ser Ser Tyr Met Val Leu Ala Pro Val Phe Gly Tyr Leu Gly Asp Arg 
35 40 45 

Tyr Asn Arg Lys Tyr Leu Met Cys Gly Gly lie Ala Phe Trp Ser Leu 

50 55 60 

35 

Val Thr Leu Gly Ser Ser Phe lie Pro Gly Glu His Phe Trp Leu Leu 
65 70 75 80 

Leu Leu Thr Arg Gly Leu Val Gly Val Gly Glu Ala Ser Tyr Ser Thr 
40 85 90 95 

lie Ala Pro Thr Leu lie Ala Asp Leu Phe Val Ala Asp Gin Arg Thr 
100 105 110 

45 Gly Cys Ser Ala Ser Ser Thr Leu Pro Phe Arg Trp Ala Val Val Trp 
115 120 125 

Ala Thr Leu Gin Ala Pro Lys Xaa 
130 135 

50 



(2) INFORMATION FOR SEQ ID NO: 543: 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 424 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 543: 

60 
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Met Ala Gly Asp Trp His Trp Ala Leu Arg Val Thr Pro Gly Leu Gly 
15 10 15 

Val Val Ala Val Leu Leu Leu Phe Leu Val Val Arg Glu Pro Pro Arg 
5 20 25 30 

Gly Ala Val Glu Arg His Ser Asp Leu Pro Pro Leu Asn Pro Thr Ser 
35 40 45 

10 Trp Trp Ala Asp Leu Arg Ala Leu Ala Arg Asn Pro Ser Phe Val Leu 

50 55 60 

Ser Ser Leu Gly Phe Thr Ala Val Ala Phe Val Thr Gly Ser Leu Ala 
65 70 75 80 

15 

Leu Trp Ala Pro Ala Phe Leu Leu Arg Ser Arg Val Val Leu Gly Glu 
85 90 95 

Thr Pro Pro Cys Leu Pro Gly Asp Ser Cys Ser Ser Ser Asp Ser Leu 
20 100 105 110 

He Phe Gly Leu He Thr Cys Leu Thr Gly Val Leu Gly Val Gly Leu 
115 120 125 

25 Gly Val Glu He Ser Arg Arg Xaa Arg His Ser Asn Pro Arg Ala Asp 
130 135 140 

Pro Leu Val Cys Ala Thr Gly Leu Leu Gly Ser Ala Pro Phe Leu Phe 
145 150 155 160 

30 

Leu Ser Leu Ala Cys Ala Arg Gly Ser He Val Ala Thr Tyr He Phe 
165 170 175 

He Phe He Gly Glu Thr Leu Leu Ser Met Asn Trp Ala He Val Ala 
35 180 185 190 

Asp He Leu Leu Tyr Val Val He Pro Thr Arg Arg Ser Thr Ala Glu 
195 200 205 

40 Ala Phe Gin He Val Leu Ser His Leu Leu Gly Asp Ala Gly Ser Pro 
210 215 220 

Tyr Leu He Gly Leu He Ser Asp Arg Leu Arg Arg Asn Trp Pro Pro 

225 230 235 240 

45 

Ser Phe Leu Ser Glu Phe Arg Ala Leu Gin Phe Ser Leu Met Leu Cys 
245 250 255 

Ala Phe Val Gly Ala Leu Gly Gly Ala Leu Ser Trp Ala Pro Xaa Ser 
50 260 265 270 

Ser Leu Arg Pro Thr Ala Gly Gly His Ser Cys Thr Cys Arg Ala Cys 
275 280 285 

55 Cys Thr Lys Gin Gly Pro Gin Thr Thr Gly Leu Trp Cys Pro Ser Gly 
290 295 300 

Ala Ala Pro Pro Ala Cys Pro Trp Pro Val Cys Ser Ser Glu Arg Leu 
305 310 315 320 

60 
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Pro Leu Thr Tyr Leu His lie Cys His Ser Xaa Pro Trp Ala His Pro 
325 330 335 

Thr Lys Gly Leu Gly Leu Thr Pro Trp Pro Gly Pro Ala Ser Arg Gly 
5 340 345 350 

Thr Leu Gly Arg Val Pro Ala Pro Arg His Tyr Xaa Gly Ser Ser Gly 
355 360 365 

10 Glu Glu Val Gly Val Gin Glu Gly Asp Pro Ser Pro Gin Gly Xaa Pro 
370 375 380 

Gin Gly Leu Gly Ala He Cys Asn Gly He Lys Phe Val Ala Arg Pro 
385 390 395 400 

15 

Gin Val Pro Ala Leu Val Phe Leu Trp Val Ala Ser Asp Leu Ala Pro 
405 410 415 

Arg Leu His Pro Arg Ala Pro Glu 
20 420 



25 



40 



(2) INFORMATIC»l FOR SEQ ID NO: 544: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 amino acids 
(B> TYPE: amino acid 
(D) TOPOLOGY: linear 
30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 544: 

Met Phe Arg Phe Val He Cys Leu Phe Leu Trp Leu Val Leu Cys Arg 
15 10 15 

35 Asp Ser Thr Ser Ala Ser Arg He Ala Leu Tyr Tyr Arg He Val Phe 
20 25 30 



Leu He His Gin Cys Ser Ser 
35 



(2) INFORMATION FOR SEQ' ID NO: 545: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 545: 

50 

Met Leu Pro Trp Xaa Ala Gin Leu Leu Asp Arg Thr He Gly Pro Leu 
15 10 15 

Tyr Leu Leu Phe Val Gin Phe Ser Pro Ala Phe Ser Arg Thr Ser Pro 
55 20 25 30 

Trp Arg Ser Pro Lys Asn Phe Arg Arg Leu Tyr Pro Pro Cys Thr Thr 
35 40 45 

60 Ser Gly Cys Ala Ala Arg Trp Leu Phe Ser 
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50 55 



5 (2) INFOPMATION FOR SBQ ID NO: 546: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 cimino acids 

(B) TYPE: amino acid 
10 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 546: 



15 



20 



25 



Met Gly Leu Ser Val Leu Leu Pro Leu Cys Leu Leu Gly Pro Gly Arg 
15 10 15 

Phe Thr Ser Gly Gin Lys Pro Leu Asp Thr Pro Gly Leu Gly Val Pro 
20 25 30 

Phe 



(2) INFORMATION FOR SEQ ID NO: 547: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 367 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 547: 

Met Ala Lys Pro Gin Val Val Val Ala Pro Val Leu Met Ser Lys Leu 
15 10 15 

35 Ser Val Asn Ala Pro Glu Phe Tyr Pro Ser Gly Tyr Ser Ser Ser Tyr 
20 25 30 

Thr Glu Ser Tyr Glu Asp Gly Cys Glu Asp Tyr E>ro Thr Leu Ser Glu 
35 40 45 

40 

Tyr Val Gin Asp Phe Leu Asn His Leu Thr Glu Gin Pro Gly Ser Phe 
50 55 60 

Glu Thr Glu lie Glu Gin Phe Ala Glu Thr Leu Asn Gly Cys Val Thr 
45 65 70 75 80 

Thr Asp Asp Ala Leu Gin Glu Leu Val Glu Leu He Tyr Gin Gin Ala 
85 90 95 

50 Thr Ser He Pro Asn Phe Ser Tyr Met Gly Ala Arg Leu Cys Asn Tyr 
100 105 110 

Leu Ser His His Leu Thr He Ser Pro Gin Ser Gly Asn Phe Arg Gin 
115 120 125 

55 

Leu Leu Leu Gin Arg Cys Arg Thr Glu Tyr Glu Val Lys Asp Gin Ala 
130 135 140 

Ala Lys Gly Asp Glu Val Thr Arg Lys Arg Phe His Ala Phe Val Leu 
60 145 150 155 160 
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Phe Leu Gly Glu Leu Tyr Leu Asn Leu Glu lie Lys Gly Thr Asn Gly 
165 170 175 

Gin Val Thr Arg Ala Asp lie Leu Gin Val Gly Leu Arg Glu Leu Leu 
180 185 190 

Asn Ala Leu Phe Ser Asn Pro Met Asp Asp Asn Leu He Cys Ala Val 
195 200 205 

Lys Leu Leu Lys Leu Thr Gly Ser Val Leu Glu Asp Ala Trp Lys Glu 
210 215 220 

Lys Gly Lys Met Asp Met Glu Glu He He Gin Arg He Glu Asn Val 
225 230 235 240 

Val Leu Asp Ala Asn Cys Ser Arg Asp Val Lys Gin Met Leu Leu Lys 
245 250 255 

Leu Val Glu Leu Arg Ser Ser Asn Trp Gly Arg Val His Ala Thr Ser 
260 265 270 

Thr Tyr Arg Glu Ala Thr Pro Glu Asn Asp Pro Asn Tyr Phe Met Asn 

275 280 285 

Glu Pro Thr Phe Tyr Thr Ser Asp Gly Val Pro Phe Thr Ala Ala Asp 
290 295 300 

Pro Asp Tyr Gin Glu Lys Tyr Gin Glu Leu Leu Glu Arg Glu Asp Phe 
305 310 315 320 

Phe Pro Asp Tyr Glu Glu Asn Gly Thr Asp Leu Ser Gly Ala Gly Asp 

325 330 335 

Pro Tyr Leu Asp Asp He Asp Asp Glu Met Asp Pro Glu He Glu Glu 
340 345 350 

Ala Tyr Glu Lys Phe Cys Leu Glu Ser Glu Arg Lys Arg Lys Gin 
355 360 365 



(2) INFORMATION FOR SEQ ID NO: 548: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 77 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC^: SEQ ID NO: 548: 

Met Leu Arg Leu Asp He He Asn Ser Leu Val Thr Thr Val Phe Met 
15 10 15 

Leu He Val Ser Val Leu Ala Leu He Pro Glu Thr Thr Thr Leu Thr 

20 25 30 

Val Gly Gly Gly Val Phe Ala Leu Val Thr Ala Val Cys Cys Leu Ala 
35 40 45 

Asp Gly Ala Leu He Tyr Arg Lys Leu Leu Phe Asn Pro Ser Gly Pro 
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55 



60 



Tyr Gin Lys Lys Pro Val His Glu Lys Lys Glu Val Leu 
65 70 75 



(2) INFORMATION FOR SEQ ID NO: 549: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 549: 

Met Leu Lys Gin Val Met Phe Val Phe Ser Gly Met Gly Pro Arg Ser 
15 10 15 

His Cys Trp Gly Leu Pro Leu Ala Cys Gly Thr Phe Val Gin Gly His 
20 25 30 

Gin Ala Asp Ser Ser His Leu Leu Pro Leu Lys His Gin Gly Ala 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 550: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 168 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 550: 

Met Leu Leu Ser Leu Ala Ala Phe Ser Val He Ser Val Val Ser Tyr 
15 10 15 

Leu He Leu Ala Leu Leu Ser Val Thr He Ser Phe Arg He Tyr Lys 
20 25 30 

Ser Val He Gin Ala Val Gin Lys Ser Glu Glu Gly His Pro Phe Lys 
35 40 45 

Ala Tyr Leu Asp Val Asp He Thr Leu Ser Ser Glu Ala Phe His Asn 
50 55 60 

Tyr Met Asn Ala Ala Met Val His He Asn Arg Ala Leu Lys Leu He 
65 70 75 80 

He Arg Leu Phe Leu Val Glu Asp Leu Val Asp Ser Leu Lys Leu Ala 
85 90 95 

Val Phe Met Trp Leu Met Thr Tyr Val Gly Ala Val Phe Asn Gly He 
100 105 110 

Thr Leu Leu He Leu Ala Glu Leu Leu He Phe Ser Val Pro He Val 
115 120 125 



Tyr Glu Lys Tyr Lys Thr Gin He Asp His Tyr Val Gly He Ala Arg 
130 135 140 
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Asp Gin Thr Lys Ser lie Val Glu Lys He Gin Ala Lys Leu Pro Gly 
145 150 155 160 

He Ala Lys Lys Lys Ala Glu Xaa 
165 



<2) INFOHMATIC»J FOR SEQ ID NO; 551: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 amino acids 

{B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 551: 

Ser Val Pro Phe His Leu Leu Val Val Leu Arg Ser Arg Ala Val Arg 
15 10 15 

Ala Arg Arg Arg Arg Glu Pro Arg Ser Leu Pro Arg Pro Gly Asp Glu 
20 25 30 

Glu Leu Gin Leu Leu Leu Cys Gly Ala Arg Ser Asp Phe Leu Glu Arg 
35 40 45 

Cys Glu Glu Asp Trp Val Cys Leu Trp His His Ala Asp His Ala Ala 
50 55 60 

Phe Pro Gly Ser Phe Gin Cys His Gin Cys Gly Phe Leu Pro His Pro 
65 70 75 80 

Gly Ser Ser Leu Cys His His Gin Leu Gin Asp Leu Gin Val Arg His 
85 90 95 

Pro Ser Cys Thr Glu Val Arg Arg Arg Pro Ser He Gin Ser Leu Pro 

100 105 110 

Gly Arg Arg His Tyr Ser Val Leu Arg Ser Phe Pro 
115 120 



(2) INFORMATION FOR SEQ ID NO: 552: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 177 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 552: 

Met Val His Leu Leu Val Leu Ser Gly Ala Trp Gly Met Gin Met Trp 
15 10 15 

Val Thr Phe Val Ser Gly Phe Leu Leu Phe Arg Ser Leu Pro Arg His 
20 25 30 

Thr Phe Gly Leu Val Gin Ser Lys Leu Phe Pro Phe Tyr Phe His He 
35 40 45 
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Ser Met Gly Cys Ala Phe lie Asn Leu Cys lie Leu Ala Ser Gin His 
50 55 60 

Ala Trp Ala Gin Leu Thr Phe Trp Glu Ala Ser Gin Leu Tyr Leu Leu 
65 70 75 80 

Phe Leu Ser Leu Thr Leu Ala Thr Val Asn Ala Arg Trp Leu Glu Pro 
85 90 95 

Arg Thr Thr Ala Ala Met Trp Ala Leu Gin Thr Val Glu Lys Glu Arg 
100 105 110 

Gly Leu Gly Gly Glu Val Pro Gly Ser His Gin Gly Pro Asp Pro Tyr 
115 120 125 

Arg Gin Leu Arg Glu Lys Asp Pro Lys Tyr Ser Ala Leu Arg Gin Asn 
130 135 140 

Phe Phe Arg Tyr His Gly Leu Ser Ser Leu Cys Asn Leu Gly Cys Val 
145 150 155 160 

Leu Ser Asn Gly Leu Cys Leu Ala Gly Leu Ala Leu Glu lie Arg Ser 
165 170 175 

Leu 



(2) INFORMATION FOR SBQ ID NO: 553: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC*!: SEQ ID NO: 553: 



Met Ala Phe He Leu Leu Phe Tyr 

1 5 

Glu Gin Asn Ser Ala Thr Val Glu 
20 

Leu Leu Gin Asn Cys Phe Glu I^eu 
35 40 

Thr Glu Gly He Pro Cys Ala Lys 
50 55 

Thr Trp Gin Thr Leu Lys Asn Ser 
65 70 



Cys Leu Met Thr Phe Leu Ser Leu 
10 15 

Pro Ser Ser His Glu He Leu His 
25 30 

Leu Arg Thr Ser Thr Ser Gin Cys 
45 

He Pro Glu Trp Val Thr His Leu 
60 



(2) INFORMATION FOR SEQ ID NO: 554: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



5 
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680 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 554: 

Val Leu Arg lie lie Cys Leu Trp Pro Cys Gly Thr Thr Leu Pro Leu 
15 10 15 

Val Glu Lys Ala His Asp Ser His Ser Ala Asp Pro Val Cys Pro Gly 
20 25 30 



Leu Thr Ala His Leu Pro Val Leu Leu Tyr Val Gin Leu 
10 35 40 45 



15 



(2) INFORMATION FOR SEQ ID NO: 555: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 251 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 555: 

Met Lys His Ala Asp Pro Arg lie Gin Gly Tyr Pro Leu Met Gly Ser 
15 10 15 

25 Pro Leu Leu Met Thr Ser lie Leu Leu Thr Tyr Val Tyr Phe Val Leu 
20 25 30 

Ser Leu Gly Pro Arg lie Met Ala Asn Arg Lys Pro Phe Gin Leu Arg 
35 40 45 



30 



Gly Phe Met lie Val Tyr Asn Phe Ser Leu Val Ala Leu Ser Leu Tyr 
50 55 60 



lie Val Tyr Glu Phe Leu Met Ser Gly Trp Leu Ser Thr Tyr Thr Trp 
35 65 70 75 80 

Arg Cys Asp Pro Gin Asp Cys Thr Leu Gly Gin Cys Pro Ser Val Pro 
85 90 95 

40 Ser Pro Xaa Thr Pro Val Thr Lys Ala Tyr Val Val Arg Thr Glu Gin 
100 105 110 

Gly Thr Gly Pro Pro Leu Pro Thr Ala Ala Leu Gin Gly Pro Arg Leu 

115 120 125 

45 

Trp Phe Leu Thr His Phe Pro Arg Ala Ala Pro Gly Met Trp Pro His 
130 135 140 

Cys Cys Leu Pro Leu Gin Ser Trp Gly Leu Lys Gly Leu Tyr Ser Tyr 
50 145 150 155 160 

Phe Pro Leu Pro Ala Leu Lys Leu Gly Arg Gly Ala Leu Arg Ala Gly 
165 170 175 

55 Pro Thr Lys Gly Leu Val Ala Phe Phe Leu Thr Gin Lys Arg Ser Ala 
180 185 190 

lie Met Ser Leu Trp Thr Gin Ser His Ser Ser Thr Pro His Thr Glu 
195 200 205 

60 
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Ala Val Ala Ser Gly Pro Lys Val Arg Val Gly Gly Gly Leu Gly He 
210 215 220 

Gin Pro Val Glu Ala Ala Tyr Ser Thr Cys Val Leu He Lys Ser Asp 
225 230 235 240 

Arg Gly Asn His Glu Lys Lys Lys Lys Lys Lys 
245 250 



(2) II^OPMATION FOR SEQ ID NO: 556: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 556: 

Gly Leu Ala Gly Leu Cys Gly Gin Leu Ser Ser Pro Ala Leu Cys Val 
15 10 15 

Asn Arg Leu 



(2) INFORMATION FOR SEQ ID NO: 557: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 217 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 557: 

Met He Thr Glu Lys Trp Gly Leu Asn Met Glu Tyr Cys Arg Gly Gin 
15 10 15 

Ala Tyr He Xaa Ser Ser Gly Phe Ser Ser Lys Met Lys Val Val Ala 
20 25 30 

Ser Arg Leu Leu Glu Lys Tyr Pro Gin Ala He Tyr Thr Leu Cys Ser 
35 40 45 

Ser Cys Ala Leu Asn Met Trp Leu Ala Lys Ser Val Pro Val Met Gly 
50 55 60 

Val Ser Val Ala Leu Gly Thr He Glu Glu Val Cys Ser Phe Phe His 
65 70 75 80 

Arg Ser Pro Gin Leu Leu Leu Glu Leu Asp Asn Val He Ser Val Leu 
85 90 95 

Phe Gin Asn Ser Lys Glu Arg Gly Lys Glu Leu Lys Glu He Cys His 
100 105 110 

Ser Gin Trp Thr Gly Arg His Asp Ala Phe Glu He Leu Val Glu Leu 
115 120 125 



Leu Gin Ala Leu Val Leu Cys Leu Asp Gly He Asn Ser Asp Thr Asn 
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130 135 140 

lie Arg Trp Asn Asn Tyr lie Ala Gly Arg Ala Phe Val Leu Cys Ser 
145 150 155 160 

5 

Ala Val Ser Asp Phe Asp Phe He Val Thr He Val Val Leu Lys Asn 
165 170 175 

Val Leu Ser Phe Thr Arg Ala Phe Gly Lys Asn Leu Gin Gly Gin Thr 
10 180 185 190 

Ser Asp Val Phe Phe Ala Ala Gly Ser Leu Thr Ala Val Leu His Ser 
195 200 205 

15 Leu Asn Glu Val He Gly Lys Tyr Xaa 
210 215 



20 (2) INFORMATION FOR SEQ ID NO: 558: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 amino acids 

(B) TYPE: amino acid 
25 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 558: 

Leu Leu Lys Val Leu Cys He Leu Pro Val Met Lys Val Glu Asn Glu 
15 10 15 

30 

Arg Tyr Glu Asn Gly Arg Lys Arg Leu Lys Ala Tyr Leu Arg Asn Thr 
20 25 30 

peu Thr Asp Gin Arg Ser Ser Asn Leu Ala Leu Leu Asn He Asn Phe 
35 35 40 45 

Asp He Lys His Asp Leu Asp Leu Met Val Asp Thr Tyr He Lys Leu 
50 55 60 

40 Tyr Thr Ser Lys Ser Glu Leu Pro Thr Asp Asn Ser Glu Thr Val Glu 
65 70 75 80 

Asn Thr 

45 



(2) INFORMATIC»I FOR SEQ ID NO: 559: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 559: 

55 

Met Val Leu He Leu Leu Asn Leu Leu Leu Gly Gin Phe Ser Cys Met 
15 10 15 



Ser Pro Ala Ser His His Cys His Pro Leu Pro Thr Glu Met Pro Cys 
60 20 25 30 
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Ser Ser Asp Trp Gly Phe Asp Ser 
35 40 

Asp Ala Leu Leu Pro Lys Pro Ser 
50 55 

Cys His Cys Gin Gly Leu Tyr Asn 
65 70 

Ala Glu Gly Pro Ala Ser Leu Arg 
85 



His Thr Val Tyr Pro Ser Cys Val 
45 

Ala Asn Ser Phe Pro Asn Gly Ser 
60 

Gin Gin Gin Gin Asn Leu His Ala 
75 80 

Cys Asn Lys Tyr Val Ser Thr 
90 95 



(2) INFORMATION FOR SEQ ID NO: 560: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acicSs 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 560: 

Met lie Pro Ala Tyr Ser Lys Asn Arg Ala Tyr Ala He Phe Phe He 
15 10 15 

Val Phe Thr Val He Gly Asp Ala Pro Gly Ala Val Leu Ser Cys Ala 

20 25 30 

Gly His Pro Cys Val Gly Phe Ala Ala Val Leu Val Ala Pro Leu Thr 
35 40 45 

Val Ala Val Ser Ser Xaa 
50 



(2) INFORMATION FOR SEQ ID NO: 561: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 amino aci(3s 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIOJ: SEQ ID NO: 561: 

Met Glu Val Pro Pro Pro Ala Pro Arg Ser Phe Leu Cys Arg Ala Leu 
15 10 15 

Cys Leu Phe Pro Arg Val Phe Ala Ala Glu Ala Val Thr Ala Asp Ser 

20 25 30 

Glu Val Leu Glu Glu Arg Gin Lys Arg Leu Pro Tyr Val Pro Glu Pro 
35 40 45 

Tyr Tyr Pro Glu Ser Gly Trp Asp Arg Leu Arg Glu Leu Phe Gly Lys 
50 55 60 

Asp Thr Val Asn Thr Ser Leu Asn Val Tyr Arg Asn Lys Asp Ala Leu 
65 70 75 80 
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Ser His Phe Val lie Ala Gly Ala Val Thr Gly Ser Leu Phe T^g He 
85 90 95 

Asn Val Gly Leu Arg Gly Trp Trp Leu Val Ala Xaa 
100 105 



(2) INFORMATION FOR SEQ ID NO: 562: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 562: 

Met Asn Trp Gly Leu Ser He Trp Leu His Tyr Tyr Glu Lys Lys Lys 
1 5 10 • 15 

Glu Gin Val Phe Leu Val He Leu Ala His Val Val Arg Arg Cys Ala 
20 25 30 

Ser Asp Gly He Leu Gin Phe Glu Ser Ser. Leu, Leu Lys Met Arg Arg 
35 40 45 

Ala Pro 
50 



(2) INFORMATION FOR SEQ ID NO: 563: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 253 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 563: 

Met Val Lys Val Cys Asn Asp Ser T^p Arg Trp Ser Leu He Ser Leu 
1 5 10 15 

Ser Asn Asn Ser Gly Lys Asn Val Glu Leu Lys Phe Val Asp Ser Leu 
20 25 30 

Arg Arg Gin Phe Glu Phe Ser Val Asp Ser Phe Gin He Lys Leu Asp 
35 40 45 

Ser Leu Leu Leu Phe Tyr Glu Cys Ser Glu Asn Pro Met Thr Glu Thr 
50 55 60 

Phe His Pro Thr He He Gly Glu Ser Val Tyr Gly Asp Phe Gin Glu 
65 70 75 80 

Ala Phe Asp His Leu Cys Asn Lys He He Ala Thr Arg Asn Pro Glu 
85 90 95 

Glu He Arg Gly Gly Gly Leu Leu Lys Tyr Cys Asn Leu Leu Val Arg 
100 105 110 

Gly Phe Arg Pro Ala Ser Asp Glu He Lys Thr Leu Gin Arg Tyr Met 
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115 120 125 

Cys Ser Arg Phe Phe lie Asp Phe Ser Asp lie Gly Glu Gin Gin Arg 
130 135 140 

5 

Lys Leu Glu Ser Tyr Leu Gin Asn His Phe Val Gly Leu Glu Asp Arg 
145 150 155 160 

Lys Tyr Glu Tyr Leu Met Thr Leu His Gly Val Val Asn Glu Ser Thr 
10 165 170 175 

Val Cys Leu Met Gly His Glu Arg Arg Gin Thr Leu Asn Leu lie Thr 
180 185 190 

15 Met Leu Ala lie Arg Val Leu Ala Asp Gin Asn Val lie Pro Asn Val 
195 200 205 

Ala Asn Val Thr Cys Tyr Tyr Gin Pro Ala Pro Tyr Val Ala Asp Ala 
210 215 220 

20 

Asn Phe Ser Asn Tyr Tyr lie Ala Gin Val Gin Pro Val Phe Thr Cys 
225 230 235 240 

Gin Gin Gin Thr Tyr Ser Thr Trp Leu Pro Cys Asn Xaa 
25 245 250 



(2) INFORMATION FOR SEQ ID NO: 564: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO; 564: 

Met Ser Phe Leu Met Trp Leu Met Ser Leu Ala lie Thr Ser Gin Pro 
15 10 15 

40 Pro Met 



45 (2) INFORMATICaa FOR SEQ ID NO: 565: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 amino acids 

(B) TYPE: amino acid 
50 <D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 565: 

Met Ala Pro Lys Gly Lys Val Gly Thr Arg Gly Lys Lys Gin lie Phe 
15 10 15 

55 

Glu Glu Asn Arg Glu Thr Leu Lys Phe Tyr Leu Arg lie lie Leu Gly 

20 25 30 

Ala Asn Ala He Tyr Cys Leu Val Thr Leu Val Phe Phe Tyr Ser Ser 
60 35 40 45 
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Ala Ser Phe Trp Ala Trp Leu Ala Leu Gly Phe Ser Leu Ala Val Tyr 
50 55 60 

Gly Ala Ser Tyr His Ser Met Ser Ser Met Ala Arg Ala Ala Phe Phe 
65 70 75 80 



(2) INFORMATION FOR SEQ ID NO: 566: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 amino acids 

(B) TYPE: amino acid 
(D) TOPOIiOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 566: 



His Leu Lys Asp Val He Leu Leu Thr Ala lie Val Gin Val Leu Ser 
15 10 15 

Cys Phe Ser Leu Tyr Val Trp Ser Phe Trp Leu Leu Ala Pro Gly Arg 
20 25 30 

Ala Leu Tyr Leu Leu Trp Val Asn Val Leu Gly Pro Trp Phe Thr Ala 
35 40 45 

Asp Ser Gly Thr Pro Ala Pro Glu His Asn Glu Lys Arg Gin Arg Arg 
50 55 60 

Gin Glu Arg Arg Gin Met Lys Arg Leu 
65 70 



(2) INFORMATION FOR SEQ ID NO: 567: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 263 amino acids 

(B) TYPE: aitiino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTICW: SEQ ID NO: 567: 

Met Asp Cys Pro Ala Leu Pro Pro Gly Trp Lys Lys Glu Glu Val He 
15 10 15 

Arg Lys Ser Gly Leu Ser Ala Gly Lys Ser Asp Val Tyr Tyr Phe Ser 
20 25 30 

Pro Ser Gly Lys Lys Phe Arg Ser Lys Pro Gin Leu Ala Arg Tyr Leu 
35 40 45 

Gly Asn Thr Val Asp Leu Ser Ser Phe Asp Phe Arg Thr Gly Lys Met 
50 55 60 

Met Pro Ser Lys Leu Gin Lys Asn Lys Gin Arg Leu Arg Asn Asp Pro 
65 70 75 80 
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Leu Asn Gin Asn Lys Gly Lys Pro Asp Leu Asn Thr Thr Leu Pro lie 
85 90 95 

Arg Gin Thr Ala Ser lie Phe Lys Gin Pro Val Thr Lys Val Thr Asn 
100 105 110 

His Pro Ser Asn Lys Val Lys Ser Asp Pro Gin Arg Met Asn Glu Gin 
115 120 125 

Pro Arg Gin Leu Phe Trp Glu Lys Arg Leu Gin Gly Leu Ser Ala Ser 
130 135 140 

Asp Val Thr Glu Gin He He Lys Thr Met Glu Leu Pro Lys Gly Leu 
145 150 155 160 

Gin Gly Val Gly Pro Gly Ser Asn Asp Glu Thr Leu Leu Ser Ala Val 
165 170 175 

Ala Ser Ala Leu His Thr Ser Ser Ala Pro He Thr Gly Gin Val Ser 
180 185 190 

Ala Ala Val Glu Lys Asn Pro Ala Val Trp Leu Asn Thr Ser Gin Pro 
195 200 205 

Leu Cys Lys Ala Phe He Val Thr Asp Glu Asp He Arg Lys Gin Glu 
210 215 220 

Glu Arg Val Gin Gin Val Arg Lys Lys Leu Glu Glu Ala Leu Met Ala 
225 230 235 240 

Asp He Leu Ser Arg Ala Ala Asp Thr Glu Glu Met Asp He Glu Met 
245 250 255 

Asp Ser Gly Asp Glu Ala Xaa 
260 



(2) INFORMATION FOR SEQ ID NO: 568: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 568: 

Met Met Arg Pro Phe Tyr Leu Leu Leu Pro Val Leu Cys Thr Gin Ala 
1 - 5 10 15 

Leu Arg Gin Ser Gin Gly Lys Ser Pro Leu Leu Trp Lys Arg Thr Leu 

20 25 30 

Leu Phe Gly Leu Thr His Leu Asn Pro Ser Ala Lys Leu Leu Leu Ser 
35 40 45 

Gin Met Lys Thr Ser Gly Asn Arg Lys Ser Glu Tyr Ser Lys Tyr Ala 
50 55 60 



Arg Asn Trp Lys Lys His 
65 70 
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(2) INFORMATION FOR SEQ ID NO: 569: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

{D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 569: 

Met Pro Val Thr Ser Lys Arg Thr Leu Phe Phe Pro Asp Pro Cys Ser 

15 10 15 

Tyr Asp Thr Pro Pro Pro Asp Cys His Cys His Ser Phe Arg Ala Glu 
20 25 30 

Leu Leu 



(2) INFORMATION FOR SEQ ID NO: 570: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 570: 

Met Asn Ser Arg Gly Met Txp Leu Thr Tyr Ala Leu Gly Val Gly Leu 
15 10 15 

Leu His lie Val Leu Leu Ser He Pro Phe Phe Ser Val Pro Val Ala 

20 26 30 

Trp Thr Leu Thr Asn He He His Asn Leu Gly Met Tyr Val Phe Leu 
35 40 45 

His Ala Val Lys Gly Thr Pro Phe Glu Thr Pro Asp Gin Gly Lys Ser 
50 55 60 

Lys Ala Pro Asn Ser Leu Gly Thr Thr Gly Leu Trp Ser Thr Val Tyr 

65 70 75 80 

He Phe Thr Glu Val Phe His Asn Phe Ser Asn Asn Ser He Phe Ser 
85 90 95 

Gly Lys Phe Leu Tyr Glu Val Xaa 
100 



(2) INFORMATION FOR SEQ ID NO: 571: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 132 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 571: 
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Met Trp Leu Thr Tyr Ala Leu Gly Val Gly Leu Leu His lie Val Leu 
15 10 15 

5 Leu Ser lie Pro Phe Phe Ser Val Pro Val Ala Trp Thr Leu Thr Asn 
20 25 30 

lie He His Asn Leu Gly Met Tyr Val Phe Leu His Ala Val Lys Gly 
35 40 45 

10 

Thr Pro Phe Glu Thr Pro Asp Gin Gly Lys Ala Arg Leu Leu Thr His 
50 55 60 

Trp Glu Gin Leu Asp Tyr Gly Val Gin Phe Thr Ser Ser Arg Lys Phe 
15 65 70 75 80 

Phe Thr He Ser Pro He He Leu Tyr Phe Leu Ala Ser Phe Tyr Thr 
85 90 95 

20 Lys Tyr Asp Pro Thr His Phe He Leu Asn Thr Ala Ser Leu Leu Ser 
100 105 110 

Val Leu He Pro Lys Met Pro Gin Leu His Gly Val Arg He Phe Gly 
115 120 125 

25 

He Asn Lys Tyr 
130 



30 

(2) I^DFORMATION FOR SEQ ID NO: 572: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LE^X?rH: 32 amino acids 
35 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTIC»J : SEQ ID NO: 572: 

Met Asn Lys Trp He Cys Glu Met His Cys Tyr Leu Val Leu Leu Ser 
40 1 5 10 15 

Val Cys Ser Pro Ser Ala Leu Arg Arg Val Arg His Thr Leu Ser Arg 
20 25 30 

45 



50 (2) INFORMATION FOR SEQ ID NO: 573: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
55 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 573: 

Met Pro Val Leu Ser Leu Leu Cys Thr Leu He Val Ser Phe Gin Ser 
15 10 15 

60 
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Ala Asp Ser Cys Glu Val Phe Leu Asn Cys Ser Leu 
20 25 



(2) INFORMATION FOR SEQ ID NO: 574: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 
10 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 574: 

Met Lys Val Ser Thr Met Leu Trp Phe Leu Cys Trp Glu Gin Ser His 
15 1 5 10 15 

Phe Leu Arg Glu Trp Glu Asp Leu Ser Thr Phe Leu He Leu He Gin 
20 25 30 

20 Met Glu Cys Gin Tyr Gly Asn Ser 
35 40 



25 (2) INFORMATION FOR SEQ ID NO: 575: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
30 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 575: 



35 



40 



55 



Met Gly Leu Pro Leu Met Ala Leu Met Trp Ser Thr Leu Pro Ala Ser 
15 10 15 

Ala Gly Val Asn Phe He Leu Ala Leu Pro Leu Leu Xaa Leu 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 576: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 
45 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 576: 

Met Lys Arg Gly Cys Leu Gly Leu Leu Phe Phe Ser Cys Cys Ser Ser 
50 1 5 10 15 

Ala Pro Thr Met Leu Leu Cys Asp Tyr Leu Asn Trp Phe 
20 25 



(2) INFORMATION FOR SEQ ID NO: 577: 



60 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 amino acids 
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(B) TYPE: amino acid 
<D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 577: 

5 Met Lys Leu Leu Leu Gly He Ala Leu Leu Ala Tyr Val Ala Ser Val 
15 10 15 

Trp Gly Asn Phe Val Asn Met Arg Ser He Gin Glu Asn Gly Glu Leu 
20 25 30 

10 

Lys He Glu Ser Lys lie Glu Glu Met Val Glu Pro Leu Arg Glu Lys 
35 40 45 

He Arg Asp Leu Glu Lys Ser Phe Thr Gin Lys Tyr Pro Pro Val Lys 
15 50 55 60 

Phe Leu Ser Glu Lys Asp Arg Lys Arg He Leu Xaa Asn Arg Arg Arg 
65 70 75 80 

20 Xaa Val Arg Gly Leu Pro Ser Xaa Leu Thr Asn Ser 
85 90 



25 (2) INFORMATION FOR SBQ ID NO: 578: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 
30 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 578: 



35 



Met Lys Phe Ser Leu Val Leu Leu He Lys He He Ser Phe Glu Arg 
15 10 15 

Leu Leu He Phe Leu Phe Pro Leu Ser Phe Leu Pro Asn He Trp Arg 

20 25 30 



Arg Val Met Val Asn Leu Asn He Leu Phe 
40 35 40 



45 



(2) INFORMATION FOR SEQ ID NO: 579: 



{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 579: 

Leu Ala Gin Glu Cys Pro Pro His He Pro Ser Ser Phe Phe Leu Val 
15 10 15 

55 Lys Leu Leu Phe He Pro Trp Leu Ala Ser Leu Leu Pro Pro Leu Ser 
20 25 30 

Thr Phe Thr Ser Asp Phe Tyr Phe Met Glu Phe Gly He Glu Val Lys 
35 40 45 

60 
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Leu Gin Gin Cys Arg Gin His Gin Val Leu Gin Glu Lys Asn Thr Lys 
50 55 60 
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Lys Phe Asn Lys Lys Lys 
5 65 70 



(2) INFORMATION FOR SEQ ID NO: 580: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 580: 

Met Leu Arg Leu Leu Leu Leu Val Ala Phe Ala Leu Val Val Val Leu 

1 5 10 15 

20 Phe His Val Leu Leu Ala Pro He Thr Ala Leu Phe His. Thr His Phe 
20 25 30 

Tyr Asp Arg Leu Gin Asp Ala Gly Ser Arg Trp Pro Glu Leu Tyr Leu 
35 40 45 

25 

Tyr Ser Arg Ala Asp Glu Val Val Leu Ala Arg Asp He Glu Arg Met 
50 55 60 

Val Glu Ala Arg Leu Ala Arg Arg Val Leu Ala Arg Ser Val Asp Phe 
30 65 70 75 80 

Val Ser Ser Ala His Val Ser His Leu Arg Asp Tyr Pro Thr Tyr Tyr 
85 90 95 

35 Thr Ser Leu Cys Val Asp Phe Met Arg Asn Cys Val Arg Cys 
100 105 110 



40 (2) INFORMATION FOR SEQ ID NO: 581: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LHTCTH: 30 amino acids 
(B) TYPE: amino acid 
45 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 581: 



50 



55 



Met Phe Lys Leu Glu Glu Cys Gly Lys Thr Thr Phe Leu Leu Ser Met 
1 5 * 10 15 

Ala Leu Tyr Phe Trp Trp He Val Gin Thr Thr Lys Gly Cys 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 582: 



60 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 amino acids 

(B) TYPE: amino acid 
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<D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 582: 

Met Glu Ser Asp Ala Leu Leu Leu Thr lie Phe Trp He He Ala Arg 
15 10 15 

Ser Ser Val Arg Ser Val Gly Lys Ser Ser Gin Arg Ser Phe Thr Thr 
20 25 30 

He Thr Gin Leu Arg Ser Thr His Thr Gly Pro Ser Arg Arg Ser Tyr 
35 40 45 

Leu He Trp Trp Asn Gly Gly Pro Lys Arg Thr He Ser Tyr Val Ser 
50 55 60 

Arg Arg Phe Arg Ser Phe Arg 
65 70 



(2) INFORMATION FOR SBQ ID NO: 583: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 583: 

Val Gly Leu Phe Gin Pro Lys Thr Phe Gin Val Pro Val Thr Asp Leu 
15 10 15 

Tyr He Phe He Lys He Tyr Ser Glu He Gly Pro He Met His Val 
20 • 25 30 

Leu Cys Pro Gly Tyr Ser Gin Ser Pro Ser Thr Pro Pro Trp Thr 
35 40 45 



(2) INFORMATICS FOR SEQ ID NO: 584: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 amino acids 

<B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 584: 

Met Trp Phe Gly Ser Asp Arg Ser Asp Leu Arg He Gly Thr Ala Phe 
15 10 15 

Leu Phe Asp Leu Val Cys Asp Leu Cys He His Ala Trp Lys Pro Pro 
20 25 30 

Gly Leu Val Arg Phe Ser Phe 
35 



(2) INFORMATION FOR SEQ ID NO: 585: 



25 



30 



50 



55 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SBQ ID NO: 585: 



Met Leu Asn Thr Ala Ser Leu Asn Leu Pro Trp Lys Val Gin Leu Phe 
1 5 10 15 

10 Ala His Ala 



15 (2) INFORMATICaJ FOR SEQ ID NO: 586: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 
20 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 586: 



Met Ser Ala Cys Leu Leu Leu Phe Leu Ala Phe Ser Trp Lys Arg Lys 
15 10 15 

Gly Leu Trp Ser Gly Pro Gly 
20 



(2) INFORMATION FOR SEQ ID NO: 587: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 
35 (B) TYPE: amino acid 

<D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 587: 

Met Leu Pro Pro Phe Ser Leu Val Tyr Thr His Phe Leu Val Ala Ser 
40 1 5 10 15 

Leu Leu Pro Val lie Leu Ala Val Phe Pro Asp Ser Ala Gin He Val 
20 25 30 

45 Pro Leu Leu Lys Pro He Pro Arg Pro Gin Pro Glu Val He Phe Pro 
35 40 45 



Ser Ser Glu Leu Leu Glu Gin Leu Leu Ser Val Gin Phe Val Trp Gin 
50 55 60 

Ala His Thr Val Ala 
65 



(2) INFORMATION FOR SEQ ID NO: 588: 



60 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 amino acids 
<B) TYPE: amino acid 
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(D) TOPOIiOGY: linear 
(xi) SEQUENCE IffiSCRIPTKM) : SBQ ID NO: 588: 

Met Gly Pro Pro Met Leu Gin Glu He Ser Asn Leu Phe Leu lie Leu 
5 1 5 10 

Leu Met Met Gly Ala He Phe Thr Leu Ala Ala Leu Lys Glu Ser Leu 
20 25 30 

10 ser Thr Cys lie Pro Ala He Val Cys Leu Gly Phe Leu Leu Leu Leu 
35 40 « 



Asn val Gly Gin Leu Leu Ala Gin Thr Lys Lys Val Val Arg Pro Thr 
50 55 60 

Arg Lys Lys Thr Leu Ser Thr Phe Lys Glu Ser Trp Lys 
65 70 •'S 



20 

(2) INFORMATICai FOR SEQ ID NO: 589: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEWJTH: 155 amino acids 
25 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 589: 

Met Ala Leu Leu Leu Ser Val Leu Arg Val Leu Leu Gly Gly Phe Phe 
30 1 5 10 1^ 

Ala Leu val Gly Leu Ala Lys Leu Ser Glu Glu He Ser Ala Pro Val 
20 25 30 

35 Ser Glu Arg Met Asn Ala Leu Phe Val Gin Phe Ala Glu Val Phe Pro 
35 40 45 

Leu Lys val Phe Gly Tyr Gin Pro Asp Pro Leu Asn Tyr Gin lie Ala 
50 55 60 



40 



val Gly Phe Leu Glu Leu Leu Ala Gly Leu Leu Leu Val Met Gly Pro 
65 70 



75 80 



Pro Met Leu Gin Glu He Ser Asn Leu Phe Leu He Leu Leu Met Met 
45 85 90 95 

Gly Ala He Phe Thr Leu Ala Ala Leu Lys Glu Ser Leu Ser Thr Cys 
100 105 110 

50 He Pro Ala He Val Cys Leu Gly Phe Leu Leu Leu Leu Asn Val Gly 
115 120 125 

Gin Leu Leu Ala Gin Thr Lys Lys Val Val Arg Pro Thr Arg Lys Lys 
130 135 140 



55 



Thr Leu Ser Thr Phe Lys Glu Ser Trp Lys Xaa 
145 150 155 



60 
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(2) UOFORMATIQN FOR SEQ ID NO: 590: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 590: 

Met Pro Glu Thr Arg LpCU Gly His Arg Gin Gin Phe Ala Val Phe His 
10 1 5 10 15 



15 



30 



35 



55 



Leu Xaa Pro Val Pro Pro Cys Gly 
20 



<2) INFORMATION FOR SEQ ID NO: 591: 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 38 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 591: 

25 Met Leu Thr Phe Leu Phe Ser Ala Cys Ala Thr Cys Leu Gly Lys Leu 
15 10 15 

Ala Ser Pro Leu Ala Pro Val Gly Pro Gin Gin Arg Gly Xaa Pro Pro 
20 25 30 



Gly Pro Pro Leu Leu Ser 
35 



(2) INFORMATION FOR SEQ ID NO: 592: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 
40 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 592: 

Met Asp Pro Phe His Tyr Asp Tyr Gin Thr Leu Arg He Gly Gly Leu 
45 1 5 10 15 

Val Phe Ala Val Val Leu Phe Ser Val Gly He Leu Leu He Leu Ser 
20 25 30 

50 Arg Arg Cys Lys Cys Ser Phe Asn Gin Lys Pro Arg Ala Pro Gly Asp 
35 40 45 



Glu Glu Ala Gin Val Glu Asn Leu He Thr Ala Asn Ala Thr Glu Pro 
50 55 60 

Gin Lys Ala Glu Asn 
65 



60 
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(2) INFORMATIC»I FOR SEQ ID NO: 593: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 308 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 593: 

Asn Leu Arg Val Arg Leu Gly Asp Val lie Ser He Gin Pro Cys Pro 
10 1 5 10 IS 

Asp Val Lys Tyr Gly Lys Arg He His Val Leu Pro He Asp Asp Thr 
20 25 30 

15 Val Glu Gly He Thr Gly Asn Leu Phe Glu Val Tyr Leu Lys Pro Tyr 
35 40 45 

Phe Leu Glu Ala Tyr Arg Pro He Arg Lys Gly Asp He Phe Leu Val 
50 55 60 

20 

Aro Gly Gly Met Arg Ala Val Glu Phe Lys Val Val Glu Thr Asp Pro 
65 70 76 80 

Ser Pro Tyr Cys He Val Ala Pro Asp Thr Val He His Cys Glu Gly 
25 85 90 95 

Glu Pro He Lys Arg Glu Asp Glu Glu Glu Ser Leu Asn Glu Val Gly 
100 105 110 

30 Tyr Asp Asp He Gly Gly Cys Arg Lys Gin Leu Ala Gin He Lys Glu 
115 120 125 

Met Val Glu Leu Pro Leu Arg His Pro Ala Leu Phe Lys Ala He Gly 
130 135 140 

35 

val Lys Pro Pro Arg Gly He Leu Leu Tyr Gly Pro Pro Gly Thr Gly 
145 150 155 160 

Lys Thr Leu He Ala Arg Ala Val Ala Asn Glu Thr Gly Ala Phe Phe 
40 165 170 175 

Phe Leu He Asn Gly Pro Glu He Met Ser Lys Leu Ala Gly Glu Ser 
180 185 190 

45 Glu ser Asn Leu Arg Lys Ala Phe Glu Glu Ala Glu Lys Asn Ala Pro 
195 200 205 

Ala He He Phe He Asp Glu Leu Asp Ala He Ala Pro Lys Arg Glu 
210 215 220 

Lys Thr His Gly Glu Val Glu Arg Arg He Val Ser Gin Leu Leu Thr 
225 230 235 240 

Leu Met Asp Gly Leu Lys Gin Arg Ala His Val He Val Met Ala Ala 
55 245 250 255 

Thr Asn Arg Pro Asn Ser He Asp Pro Ala Leu Arg Arg Phe Gly Arg 
260 265 270 

60 Phe Asp Arg Glu Val Asp He Gly He Pro Asp Ala Thr Gly Arg Leu 
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275 280 285 

Glu He Leu Gin He His Thr Lys Asn Met Lys Leu Ala Asp Asp Val 
290 295 300 

Asp Leu Glu Gin 
305 



10 



25 



40 



(2) INFORMATION FOR SEQ ID NO: 594: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENOTH: 22 amino acids 
15 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 594: 

Met Gin He Lys Leu Leu Lys Ser Val Lys Thr Val Phe Ala He Thr 
20 1 5 10 15 

Leu Leu Val Leu Phe Leu 
20 



(2) INFORMATION FOR SEQ ID NO: 595: 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: . 595: 

35 Met Phe Pro Lys Phe Cys Pro He Leu Ser Leu Val Asp Phe He Ser 
15 10 



His Arg Asp Lys Pro Glu Thr Glu 
20 



(2) INFORMATION FOR SEQ ID NO: 596: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 596: 

Met Leu He Glu Cys Ala Trp Gin Leu Met Phe Leu Leu Leu Lys Val 
15 10 15 



Glu Gin Leu Gly He Leu Asp Lys 
55 20 



(2) INFORMATION FOR SEQ ID NO: 597: 

60 
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(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
5 (xi) SEQUENCE DESCRIPTION: SBQ ID NO: 597: 

Met 
1 

10 

(2) INFORMATION FOR SEQ ID NO: 598: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE WBSCRIPTIQN: SEQ ID NO: 598: 

20 Met Cys He Met Ser Ala Leu Val 
1 5 



25 (2) INFORMATION FOR SEQ ID NO: 599: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LQ^GTU: 25 amino acids 

(B) TYPE: amino acid 
30 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 599: 



35 



yiet Phe Leu Val Trp Phe Phe Trp Gly Leu He Ser Ala Leu Ser Asn 
^5 10 15 



Val His Thr Pro Ser Arg Leu Pro Ala 
20 25 



40 



45 



50 



55 



(2) INFORMATION FOR SEQ ID NO: 600: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SBQUEN(3: DESCRIPTIC»^: SBQ ID NO: 600: 

Met Xaa Gly Leu Ser Leu He Leu Thr Val Thr Leu Leu Ala Val Ser 
^5 10 15 

Asp Ser Ala Ala Thr Cys He Val Ala Lys Gly 
20 25 



(2) INFORMATION FOR SEQ ID NO: 601: 



60 



(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 61 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 601: 



5 Met Trp Thr Arg Ser Ser Arg Cys Leu Leu Leu Cys lie Pro Gly Xaa 
15 10 15 



10 



Ser Arg Arg Arg Arg Ala Gly Ser Gly Met Lys Pro Arg Ser Trp Ser 
20 25 30 

Ala Trp Arg Pro Ser Gly Gly Thr Gly Thr Ser Ser Ser Gin Ser Ser 
35 40 45 



Thr Gin Ser Arg Thr Leu Ser Ala Thr Ala Ser Pro Ala 
15 50 55 60 



20 



(2) INFORMATION FOR SEQ ID NO: 602: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 602: 

Met Arg Glu Thr Ser lie Arg Val Leu Leu Met Leu Pro Ala Leu Glu 
15 10 15 

30 ger Thr Ser Gly Leu Ser Ala Phe Met Gly Leu Gly Thr 
20 25 



35 (2) INFORMATION FOR SEQ ID NO: 603: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 
40 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 603: 

Met Pro Pro Lys Gin Glu Leu Gly Ser Gly Val Gly Glu Leu Ala Lys 
15 10 15 



45 



Asn Ser Lys Arg Gin His Trp Asn His Arg Trp Lys Lys Tyr Leu Lys 
20 25 30 



L^ lie Arg Trp Glu Asp Gly Leu Leu Leu Glu Gly Leu Leu Leu Val 
50 35 40 45 

Leu Glu His Cys Ala Thr Met Ala Trp Asp Cys Leu Met Arg Leu Glu 
50 55 60 

55 Leu Leu Lys Arg Leu 
65 



60 (2) INFORMATICaa FOR SEQ ID NO: 604: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY; linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 604: 

Lys He Val Tyr He Leu Gly Asn Pro Leu Lys Phe Asn Ser Arg Val 
15 10 15 

He His His Leu Val Leu Leu Gin 



(2) INFORMATION FOR SEQ ID NO: 605: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 605: 

Met Asn Leu His Gin Arg Arg Leu Leu Leu He Gly His Leu Met Thr 
1 5 10 15 

Leu Val Lys Ala Ser Lys Ser Phe Ser Phe Thr Glu He Thr Ser Ser 
20 25 30 

Arg Lys Lys 
35 



(2) INFORMATION FOR SEQ ID NO: 606: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC»J: SEQ ID NO: 606: 

Leu Leu Gly Tyr Gly Leu Phe Gly His Cys He Val Leu Phe He Thr 
15 10 15 

Tyr Asn He His Leu His Ala Leu Phe Tyr Leu Phe Trp Leu Leu Val 

20 25 30 

Gly Gly Leu Ser Thr Leu Arg Met Val Ala Val Leu Val Ser Arg Thr 
35 40 45 

Val Gly Pro Thr Gin Arg Leu Leu Leu Cys Gly Thr Leu Ala Ala Leu 
50 55 60 

His Met Leu Phe Leu Leu Tyr Leu His Phe Ala Tyr His Lys Val Xaa 
65 70 75 80 

Glu Gly He Leu Asp Thr Leu Glu Gly Pro Asn He Pro Pro He Gin 
85 90 95 
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Arg Val Pro Arg Asp He Pro Ala Met Leu Pro Ala Ala Arg Leu Pro 
100 105 110 

Thr Thr Val Leu Asn Ala Thr Ala Lys Ala Val Ala Val Thr Leu Gin 
115 120 125 

Ser His • 
130 



(2) INFORMATION FOR SEQ ID NO: 607: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 607: 

Met Leu Val He Phe Leu Phe Thr Ser Leu Leu Lys He Pro Ser Ser 
15 10 15 

Val Pro Gly Leu He Asn Val 
20 



(2) INFORMATION FOR SEQ ID NO: 60B: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 608: 

Glu Leu Asp Tyr He Leu 
1 5 



(2) INFORMATION FOR SEQ ID NO: 609: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 232 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 609: 

Met Ala Pro Pro Gly Trp Gin Xaa Xaa Xaa Xaa Xaa Trp Leu Ala Cys 

15 10 15 

Pro Asp Arg Gly Glu Leu Ser Ser Arg Ser Pro Pro Cys Arg Leu Ala 
20 25 30 

Arg Trp Ala Glu Gly Asp Arg Glu Thr Arg Thr Cys Leu Leu Glu Leu 
35 40 45 



Ser Ala Gin Ser Trp Gly Gly Arg Phe Arg Arg Ser Ser Ala Val Ser 
50 55 60 
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Ala Gly Ser Pro Ser Arg Leu His Phe Leu Pro Gin Pro Leu Leu Leu 
65 70 75 80 

Arg Ser Ser Gly He Pro Ala Ala Ala Thr Pro Trp Pro Gin Pro Ala 
85 90 95 

Gly Leu Pro Val Arg Pro Thr Pro Thr Arg Thr Gly Glu Glu Asp Arg 
100 105 110 

Thr Leu Asp He Ser He Cys Thr Glu Val Leu Ala Gly Thr Glu Gin 

115 120 125 

Pro Pro Pro Pro Arg Met Thr Ser Pro Ser Ser Ser Pro Val Phe Arg 
130 135 140 

Leu Glu Thr Leu Asp Gly Gly Gin Glu Asp Gly Ser Glu Ala Asp Arg 
145 150 155 160 

Gly Lys Leu Asp Phe Gly Ser Gly Leu Pro Pro Met Glu Ser Gin Phe 

165 170 175 

Gin Gly Glu Asp Arg Lys Phe Ala Pro Ser Asp Lys Ser Gin Pro Pro 
180 185 190 

Thr Thr Glu Arg Glu Gin Val Pro Val Ser Arg He Gin Thr Asp Leu 
195 200 205 

Thr Glu He Gly Ser Ser Met focg Ser Pro Gly Val Ser Pro Arg He 
210 215 220 

Trp Leu Asp Phe Gin Ser Thr Xaa 
225 230 



(2) INFORMATIC»J FOR SEQ ID NO: 610: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 610: 

Met Val Leu Leu Leu Leu Leu Ala Tyr Val Leu Leu Thr Tyr He Leu 
1 5 10 15 

Leu Leu Asn Met Leu He Ala Leu Met Xaa Arg Asp Arg Gin Gin Cys 
20 25 30 

Arg His 



(2) INFORMATION FOR SEQ ID NO: 611: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 611: 

Met Val Phe Glu Gly Phe Ser Ser Ala Phe Cys Leu Ser Ser Thr Ala 
15 10 15 

Pro Thr Ser His Pro 
20 



(2) INFORMATION FOR SEQ ID NO: 612: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 612: 

Gly Lys Lys Asn Gin Leu I*eu Val lie 
1 5 



(2) INFORMATION FOR SEQ ID NO: 613: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 613: 

Met Val Trp Val Leu Trp Ser Ala Pro Ser Leu Ala Pro Pro Trp Val 
15 10 15 

Gly Pro Cys Trp Pro Ser Thr Gly Asn Cys Cys Leu Cys 
20 25 



(2) INFORMATION FOR SEQ ID NO: 614: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 614: 

Met Ala Lys Arg Ser Pro Gly Gly Cys Gly Ser Gly Leu lie Leu Leu 
15 10 15 

Cys Cys Gin Pro Cys Arg Pro Thr Ser Ser Ala Pro Met Arg 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 615: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 615: 

lie Thr lie Ala lie Gin Met lie Cys Leu Val Asn Xaa Glu Leu Tyr 
15 10 15 

Pro Thr Phe Val Arg Asn Xaa Gly Val Met Val Cys Ser Ser Leu Cys 
20 25 30 

Asp lie Gly Gly He He Thr Pro Phe He Val Phe Arg Leu Arg Glu 
35 40 45 

Val Trp Gin Ala Leu Pro Leu He Leu Phe Ala Val Leu Gly Leu Leu 
50 55 60 

Ala Ala Gly Val Thr Leu Leu Leu Pro Glu Thr Lys Gly Val Ala Leu 
65 70 75 80 

Pro Glu Thr Met Lys Asp Ala Glu Asn Leu Gly Picg Lys Ala Lys Pro 

85 90 95 

Lys Glu Asn Thr He Tyr Leu Lys Val Gin Thr Ser Glu Pro Ser Gly 
100 105 110 

Thr 



(2) INFORMATION FOR SEQ ID NO: 616: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 616: 

Thr Met Lys Asp Ala Glu Asn Leu Gly Arg Lys Ala Lys Pro Lys Glu 
1 5 . 10 15 

Asn Thr 



(2) INFORMATION FOR SEQ ID NO: 617: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 617: 

Pro Arg Val Arg Asn Ser Pro Glu Asp Leu Gly Leu Ser Leu Thr Gly 
15 10 15 

Asp Ser Cys Lys Leu 
20 
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(2) INFORMATION FOR SEQ ID NO: 618: 

<i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 52 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTIC»I: SEQ ID NO: 618: 

10 Gin Ala Asp Asp Leu Gin Ala Thr Val Ala Ala Leu Cys Val Leu Arg 
15 10 15 

Gly Gly Gly Pro Trp Ala Gly Ser Trp Leu Ser Pro Lys Thr Pro Gly 

20 25 30 

15 

Ala Met Gly Gly Asp Leu Val Leu Gly Leu Gly Ala Leu Arg Arg Arg 
35 40 45 

Lys Arg Leu Leu 
20 50 



(2) INFORMATION FOR SEQ ID NO: 619: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 232 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 619: 

Glu Gin Glu Lys Ser Leu Ala Gly Trp Ala Leu Val Leu Ala Xaa Xaa 
15 10 15 

35 Gly lie Gly Leu Met Val Leu His Ala Glu Met Leu Trp Phe Gly Gly 
20 25 30 

Cys Ser Ala Val Asn Ala Thr Gly His Leu Ser Asp Thr Leu Trp Leu 
35 40 45 

40 . - 

He Pro He Thr Phe Leu Thr He Gly Tyr Gly Asp Val Val Pro Gly 
50 55 60 

Thr Met Trp Gly Lys He Val Cys Leu Cys Thr Gly Val Met Gly Val 
45 65 70 75 80 

Cys Cys Thr Ala Leu Leu Val Ala Val Val Ala Arg Lys Leu Glu Phe 
85 90 95 

50 Asn Lys Ala Glu Lys His Val His Asn Phe Met Met Asp He Gin Tyr 
100 105 110 

Thr Lys Glu Met Lys Glu Ser Ala Ala Arg Val Leu Gin Glu Ala Trp 
115 120 125 

55 

Met Phe Tyr Lys His Thr Arg Arg Lys Glu Ser His Ala Ala Arg Xaa 
130 135 140 



His Gin Arg Xaa Leu Leu Ala Ala He Asn Ala Phe Arg Gin Val Arg 
60 145 150 155 160 
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Leu Lys His Arg Lys Leu Arg Glu Gin Val Asn Ser Met Val Asp lie 
165 170 175 

Ser Lys Met His Met lie Leu Tyr Asp Leu Gin Gin Asn Leu Ser Ser 
180 185 190 

Ser His Arg Ala Leu Glu Lys Gin He Asp Thr Leu Ala Gly Lys Leu 

195 200 205 

Asp Ala Leu Thr Glu Leu Leu Ser Thr Ala Leu Gly Pro Arg Gin Leu 
210 215 220 

Pro Glu Pro Ser Gin Gin Ser Lys 
225 230 



(2) INFORMATION FOR SEQ ID NO: 620: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 620: 

Tyr Gin Ala His His Val Ser Arg Asn Lys Arg Gly Gin Val Val Gly 
15 10 15 

Thr Arg Gly Gly Phe Arg Gly Cys Thr Val Trp Leu Thr Gly Leu Ser 
20 25 30 

Gly Ala Gly Lys 
35 



(2) INFORMATION FOR SEQ ID NO: 621: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 621: 

Leu Gin Cys Glu He Cys Gly Phe Thr Cys Arg Gin Lys Ala Ser Leu 

1 5 10 15 

Asn Trp His Met Lys Lys His Asp Ala Asp Ser Phe Tyr Gin Phe Ser 
20 25 30 

Cys Asn He Cys Gly Lys Lys Phe Glu Lys Lys Asp Ser Val Val Ala 
35 40 45 

His Lys Ala Lys Ser His Pro Glu Val 
50 55 



(2) INFORMATION FOR SEQ ID NO: 622: 
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(i) SEQUENCE CHARACTERISTICS: 

<A) IjENGTH: 19 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 622: 

He Thr Ser Thr Asp He Leu Gly Thr Asn Pro Glu Ser Leu Thr Gin 
15 10 15 

Pro Ser Asp 



(2) INFORMATION FOR SEQ ID NO: 623: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 623: 

Asn Ser Thr Ser Gly Glu Cys Leu Leu Leu Glu Ala Glu Gly Met Ser 
15 10 15 

Lys Ser Tyr 



(2) INFORMATION FOR SEQ ID NO: 624: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC»a: SBQ ID NO: 624: 

Cys Ser Gly Thr Glu Arg Val Ser Leu Met Ala Asp Gly Lys He Phe 
15 10 15 

Val Gly Ser Gly Ser Ser Gly Gly Thr Glu Gly Leu Val Met Asn Ser 
20 25 30 

Asp He Leu Gly Ala Thr Thr Glu Val Leu He Glu Asp Ser Asp Ser 
35 40 45 

Ala Gly Pro 
50 



(2) INFORMATION FOR SEQ ID NO: 625: 

(i) SEQUENCE OIARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 625: 
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He Gin Tyr Val Arg Cys Glu Met Glu Gly Cys Gly Thr Val Leu Ala 
15 10 15 

His Pro Arg Tyr Leu Gin His His He Lys Tyr Gin His Leu Leu Lys 
20 25 30 

Lys Lys Tyr Val Cys Pro His Pro Ser Cys Gly Arg Leu Phe Arg Leu 
35 40 45 

Gin Lys Gin Leu Leu Arg His Ala Lys His His Thr 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 626: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 626: 

Asp Gin Arg Asp Tyr He Cys Glu Tyr Cys Ala Arg Ala Phe Lys Ser 
15 10 15 

Ser His Asn Leu Ala Val His Arg Met He His Thr Gly Glu Lys 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 627: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 627: 

Arg Ser Ser Arg Ser Lys Thr Gly Ser Leu Gin Leu He Cys Lys Ser 
15 10 15 

Glu Pro Asn Thr Asp Gin Leu Asp Tyr 
20 25 



(2) INFORMATION FOR SEQ ID NO: 628: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 183 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 628: 

Leu Gin Cys Glu He Cys Gly Phe Thr Cys Arg Gin Lys Ala Ser Leu 
15 10 15 



Asn Trp His Met Lys Lys His Asp Ala Asp Ser Phe Tyr Gin Phe Ser 
20 25 30 
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Cys Asn lie Cys Gly Lys Lys Phe Glu Lys Lys Asp Ser Val Val Ala 
35 40 45 

His Lys Ala Lys Ser His Pro Glu Val Xaa lie Thr Ser Thr Asp lie 
50 55 60 

Leu Gly Thr Asn Pro Glu Ser Leu Thr Gin Pro Ser Asp Xaa Asn Ser 
65 70 75 80 

Thr Ser Gly Glu Cys Leu Leu Leu Glu Ala Glu Gly Met Ser Lys Ser 
85 90 95 

Tyr Xaa Cys Ser Gly Thr Glu Arg Val Ser Leu Met Ala Asp Gly Lys 
100 105 110 

He Phe Val Gly Ser Gly Ser Ser Gly Gly Thr Glu Gly Leu Val Met 
115 120 125 

Asn Ser Asp He Leu Gly Ala Thr Thr Glu Val Leu He Glu Asp Ser 
130 135 140 

Asp Ser Ala Gly Pro Xaa Gin Arg Asp Tyr He Cys Glu Tyr Cys Ala 
145 150 155 160 

Arg Ala Phe Lys Ser Ser His Asn Leu Ala Val His Arg Met He His 
165 170 175 

Thr Gly Glu Lys His Tyr Xaa 
180 



(2) INFORMATION FOR SEQ ID NO: 629: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 629: 



Gin Tyr Val Arg Cys Glu Met Glu 
1 5 

Pro Arg Tyr Leu Gin His His He 
20 

Lys Tyr Val Cys Pro His Pro Ser 
35 40 

Lys Gin Leu Leu Arg His Ala Lys 
50 55 



Gly Cys Gly Thr Val Leu Ala His 
10 15 

Lys Tyr Gin His Leu Leu Lys Lys 
25 30 

Cys Gly Arg Leu Phe Arg Leu Gin 
45 

His His Thr Asp 
60 



(2) INFORMATION FOR SEQ ID NO: 630: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 630: 

Pro Phe Lys Asp Asp Pro Arg Asp Glu Thr Tyr Lys Pro His Leu Glu 
15 10 15 

Arg Glu Thr Pro Lys Pro Arg Arg Lys Ser Gly 
20 25 



(2) INFORMATION FOR SEQ ID NO: 631: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 631: 

Glu Met Phe Asp Ser Leu Ser Tyr Phe Lys Gly Ser Ser Leu Leu Leu 
15 10 15 

Met Leu Lys Thr Tyr Leu Ser Glu Asp Val Phe Gin His Ala Val Val 
20 25 30 

Leu Tyr Leu His Asn His Ser Tyr Ala Ser lie Gin Ser Asp Asp Leu 
35 40 45 

Trp Asp Ser Phe Asn Glu Val Thr Asn Gin Thr Leu Asp Val Lys Arg 
50 55 60 

Met Met Lys Thr Trp Thr Leu Gin Lys Gly Phe Pro Leu Val Thr Val 
65 70 75 80 

Gin Lys Lys Gly Lys Glu Leu Phe He Gin Gin Glu Arg Phe Phe Leu 
85 90 95 

Asn Met Lys Pro Glu He Gin Pro Ser Asp Thr Arg Tyr Met 
100 105 110 



(2) INFORMATION FOR SEQ ID NO: 632: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 632: 

Leu Glu Lys Val Ala Ser Val Gly Asn Ser Arg Pro Thr Gly Gin Gin 
15 10 15 

Leu Glu Ser Leu Gly Leu Leu Ala 
20 



(2) INFORMATION FOR SEQ ID NO: 633: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 633: 

Val His Arg Glu Glu Ala Ser Cys Tyr Cys Gin Ala Glu Pro Ser Gly 
^5 10 15 

Asp Leu 



(2) INFOPMATION FOR SEQ ID NO: 634: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 634: 

Arg Pro Ala Leu Arg Gin Ala Gly Gly Gly Thr Arg Glu Pro Arg Gin 
15 10 15 

Lys Arg Trp Ala Gly Leu 
20 



(2) INFORMATION FOR SEQ ID NO: 635: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 635: 

Ala Val Asn Phe Arg Pro Gin Arg Ser Gin Ser Met 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 636: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTIC»«: SEQ ID NO: 636: 

Met He Thr Asp Val Gin Leu Ala He Phe Ala Asn Met Leu Gly Val 
15 10 15 

Ser Leu Phe Leu Leu Val Val Leu Tyr His Tyr Val Ala Val Asn Asn 
20 25 30 

Pro Lys Lys Gin Glu 
35 
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(2) INFORMATION FOR SEQ ID NO: 637: 

5 (i) SEQUENCE CHARACTERISTICS: 

<A) LENCrTH: 342 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 637: 

10 

Glu Glu Met Ala Asp Ser Val Lys Thr Phe Leu Gin Asp Leu Ala Arg 
15 10 15 

Gly He Lys Asp Ser He Trp Gly He Cys Thr He Ser Lys Leu Asp 
15 20 25 30 

Ala Arg He Gin Gin Lys Arg Glu Glu Gin Arg Arg Arg Arg Ala Ser 
35 40 45 

20 Ser Val Leu Ala Gin Arg Arg Ala Gin Ser He Glu Arg Lys Gin Glu 
50 55 60 

Ser Glu Pro Arg He Val Ser Arg He Phe Gin Cys Cys Ala Trp Asn 
65 70 75 80 

25 

Gly Gly Val Phe Trp Phe Ser Leu Leu Leu Phe Tyr Arg Val Phe He 
85 90 95 

Pro Val Leu Gin Ser Val Thr Ala Arg He He Gly Asp Pro Ser Leu 
30 100 105 110 

His Gly Asp Val Trp Ser Trp Leu Glu Phe Phe Leu Thr Ser He Phe 
115 120 125 

35 Ser Ala Leu Trp Val Leu Pro Leu Phe Val Leu Ser Lys Val Val Asn 
130 135 140 

Ala He Trp Phe Gin Asp He Ala Asp Leu Ala Phe Glu Val Ser Gly 
145 150 155 160 

40 

Arg Lys Pro His Pro Phe Pro Ser Val Ser Lys He He Ala Asp Met 
165 170 175 

Leu Phe Asn Leu Leu Leu Gin Ala Leu Phe Leu He Gin Gly Met Phe 
45 180 185 190 

Val Ser Leu Phe Pro He His Leu Val Gly Gin Leu Val Ser Leu Leu 
195 200 205 

50 His Met Ser Leu Leu Tyr Ser Leu Tyr Cys Phe Glu Tyr Arg Trp Phe 
210 215 220 

Asn Lys Gly He Glu Met His Gin Arg Leu Ser Asn He Glu Arg Asn 
225 230 235 240 

55 

Trp Pro Tyr Tyr Phe Gly Phe Gly Leu Pro Leu Ala Phe Leu Thr Ala 
245 250 255 



60 



Met Gin Ser Ser Tyr He He Ser Gly Cys Leu Phe Ser He Leu Phe 
260 265 270 
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Pro Leu Phe lie lie Sex Ala Asn Glu Ala Lys Thr Pro Gly Lys Ala 
275 280 285 

5 Tyr Leu Phe Gin Leu Arg Leu Phe Ser Leu Val Val Phe Leu Ser Asn 
290 295 300 

Arg Leu Phe His Lys Thr Val Tyr Leu Gin Ser Ala Leu Ser Ser Ser 
305 310 315 320 

10 

Thr Ser Ala Glu Lys Phe Pro Ser Pro His Pro Ser Pro Ala Lys Leu 
325 330 335 

Lys Ala Thr Ala Gly His 
IS 340 



(2) INFORMATION FOR SEQ ID NO: 638: 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 529 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 638: 

Met Ala Lys Phe Met Thr Pro Val lie Gin Asp Asn Pro Ser Gly Trp 
15 10 15 

30 Gly Pro Cys Ala Val Pro Glu Gin Phe Arg Asp Met Pro Tyr Gin Pro 
20 25 30 

Phe Ser Lys Gly Asp Arg Leu Gly Lys Val Ala Asp Trp Thr Gly Ala 
35 40 45 

35 

Thr Tyr Gin Asp Lys Arg Tyr Thr Asn Lys Tyr Ser Ser Gin Phe Gly 
50 55 60 

Gly Gly Ser Gin Tyr Ala Tyr Phe His Glu Glu Asp Glu Ser Ser Phe 
40 65 70 75 80 

Gin Leu Val Asp Thr Ala Arg Thr Gin Lys Thr Ala Tyr Gin Arg Asn 

85 90 95 

45 Arg Met Arg Phe Ala Gin Arg Asn Leu Arg Arg Asp Lys Asp Arg Arg 
100 105 110 

Asn Met Leu Gin Phe Asn Leu Gin He Leu Pro Lys Ser Ala Lys Gin 
115 120 125 

50 

Lys Glu Arg Glu Arg He Arg Leu Gin Lys Lys Phe Gin Lys Gin Phe 
130 135 140 

Gly Val Arg Gin Lys Trp Asp Gin Lys Ser Gin Lys Pro Arg Asp Ser 
55 145 150 155 160 

Ser Val Glu Val Arg Ser Asp Trp Glu Val Lys Glu Glu Met Asp Phe 
165 170 175 



60 



Pro Gin Leu Met Lys Met Arg Tyr Leu Glu Val Ser Glu Pro Gin Asp 
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160 



185 



190 



He Glu Cys Cys Gly Ala Leu Glu Tyr Tyr Asp Lys Ala Phe Asp Arg 
195 200 205 

He Thr Thr Arg Ser Glu Lys Pro Leu Arg Xaa Xaa Lys Arg He Phe 
210 215 220 

His Thr Val Thr Thr Thr Asp Asp Pro Val He Arg Lys Leu Ala Lys 
225 230 235 240 

Thr Gin Gly Asn Val Phe Ala Thr Asp Ala He Leu Ala Thr Leu Met 

245 250 255 

Ser Cys Thr Arg Ser Val Tyr Ser Trp Asp He Val Val Gin Arg Val 
260 265 270 

Gly Ser Lys Leu Phe Phe Asp Lys Arg Asp Asn Ser Asp Phe Asp Leu 
275 280 285 

Leu Thr Val Ser Glu Thr Ala Asn Glu Pro Pro Gin Asp Glu Gly Asn 
290 295 300 

Ser Phe Asn Ser Pro Arg Asn Leu Ala Met Glu Ala Thr Tyr He Asn 
305 310 315 320 

His Asn. Phe Ser Gin Gin Cys Leu Arg Met Gly Lys Glu Arg Tyr Asn 
325 330 335 

Phe Pro Asn Pro Asn Pro Phe Val Glu Asp Asp Met Asp Lys Asn Glu 

340 345 350 

He Ala Ser Val Ala Tyr Arg Tyr Arg Ser Gly Lys Leu Gly Asp Asp 
355 360 365 

He Asp Leu He Val Arg Cys Glu His Asp Gly Val Met Thr Gly Ala 
370 375 380 

Asn Gly Glu Val Ser Phe He Asn He Lys Thr Leu Asn Glu Trp Asp 
385 390 395 400 

Ser Arg His Cys Asn Gly Val Asp Trp Arg Gin Lys Leu Asp Ser Gin 
405 410 415 

Arg Gly Ala Val He Ala Thr Glu Leu Lys Asn Asn Ser Tyr Lys Leu 
420 425 430 

Ala Arg Trp Thr Cys Cys Ala Leu Leu Ala Gly Ser Glu Tyr Leu Lys 
435 440 445 

Leu Gly Tyr Val Ser Arg Tyr His Val Lys Asp Ser Ser Arg His Val 
450 455 460 

He Leu Gly Thr Gin Gin Phe Lys Pro Asn Glu Phe Ala Ser Gin He 
465 470 475 480 

Asn Leu Ser Val Glu Asn Ala Trp Gly He Leu Arg Cys Val He Asp 
485 490 495 



He Cys Met Lys Leu Glu Glu Gly Lys Tyr Leu He Leu Lys Asp Pro 
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500 



505 



510 



Asn Lys Gin Val He Arg Val Tyr Ser Leu Pro Asp Gly Thr Phe Ser 
515 520 525 



Ser 



10 

(2) INFORMATION FOR SEQ ID NO: 639: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 194 amino acids 
15 (B) TYPE: amino acid 

(D) TOPOLCXSY: linear 
(xi) SEQUENCE DESCRIPTICTJ : SEQ ID NO: 639: 

Lys Lys Arg His Thr Asp Val Gin Phe Tyr Thr Glu Val Gly Glu He 
20 1 5 10 15 

Thr Thr Asp Leu Gly Lys His Gin His Met His Asp Arg Asp Asp Leu 
20 25 30 

25 Tyr Ala Glu Gin Met Glu Arg Glu Met Arg His Lys Leu Lys Thr Ala 
35 40 45 

Phe Lys Asn Phe He Glu Lys Val Glu Ala Leu Thr Lys Glu Glu Leu 
50 55 60 

30 

Glu Phe Glu Val Pro Phe Arg Asp Leu Gly Phe Asn Gly Ala Pro Tyr 
65 70 75 80 

Arg Ser Thr Cys Leu Leu Gin Pro Thr Ser Ser Ala Leu Val Asn Ala 
35 85 90 95 

Thr Glu Trp Pro Pro Phe Val Val Thr Leu Asp Glu Val Glu Leu He 
100 105 110 

40 His Phe Xaa Arg Val Gin Phe His Leu Lys Asn Phe Asp Met Val He 
115 120 125 

Val Tyr Lys Asp Tyr Ser Lys Lys Val Thr Met He Asn Ala He Pro 
130 135 140 

45 

Val Ala Ser Leu Asp Pro He Lys Glu Trp Leu Asn Ser Cys Asp Leu 
145 150 155 160 

Lys Tyr Thr Glu Gly Val Gin Ser Leu Asn Trp Thr Lys He Met Lys 
50 165 170 175 

Thr He Val Asp Asp Pro Glu Gly Phe Phe Glu Gin Gly Gly Trp Ser 
180 185 190 

55 Phe Leu 



60 



<2) IMFORMATIOJ FOR SEQ ID NO: 640: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 640: 

Arg Ser Gly Leu Gly Leu Gly He Thr He Ala Phe Leu Ala Thr Leu 

He Thr Gin Phe Leu Val Tyr Asn Gly Val Tyr Gin Tyr Thr Ser Pro 
20 25 30 

Asp Phe Leu Tyr He Arg Ser Trp Leu Pro Cys He Phe Phe Ser Gly 
35 40 45 

Gly Val Thr Val Gly Asn He Gly Arg Gin Leu Ala Met Gly Val Pro 
50 55 60 

Glu Lys Pro His Ser Asp 
65 70 



(2) INFORMATION FOR SEQ ID NO: 641: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 641: 

Val Thr Gin Pro Lys His Leu Ser Ala Ser Met Gly Gly Ser Val Glu 
15 10 15 

He Pro Phe Ser Phe Tyr Tyr Pro Trp Glu Leu Ala Xaa Xaa Pro Xaa 
20 25 30 

Val Arg He Ser Trp Arg Arg Gly His Phe His Gly Gin Ser Phe Tyr 
35 40 45 

Ser Thr Arg Pro Pro Ser He His Lys Asp Tyr Val Asn Arg Leu Phe 
50 55 60 

Leu Asn Trp Thr Glu Gly Gin Glu Ser Gly Phe Leu Arg He Ser Asn 
^5 70 75 80 

Leu Arg Lys Glu Asp Gin Ser Val Tyr Phe Cys Arg Val Glu Leu Asp 
85 90 95 

Thr Arg Arg Ser Gly 
100 



(2) INFORMATION FOR SEQ ID NO: 642: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 233 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY; linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 642: 

Met Glu Ala Gin Gin Val Asn Glu Ala Glu Ser Ala Arg Glu Gin Leu 
5 1 5 10 15 

Gin Xaa Leu His Asp Gin He Ala Gly Gin Lys Ala Ser Lys Gin Glu 
20 25 30 

10 Leu Glu Thr Glu Leu Glu Arg Leu Lys Gin Glu Phe His Tyr He Glu 
35 40 45 

Glu Asp Leu Tyr Arg Thr Lys Asn Thr Leu Gin Ser Arg He Lys Asp 
50 55 60 

Arg Asp Glu Glu He Gin Lys Leu Arg Asn Gin Leu Thr Asn Lys Thr 
65 70 75 80 

Leu Ser Asn Ser Ser Gin Ser Glu Leu Glu Asn Arg Leu His Gin Leu 
20 85 90 95 

Thr Glu Thr Leu He Gin Lys Gin Thr Met Leu Glu Ser Leu Ser Thr 
100 105 110 

25 Glu Lys Asn Ser Leu Val Phe Gin Leu Glu Arg Leu Glu Gin Gin Met 
115 120 125 



30 



45 



50 



Asn Ser Ala Ser Gly Ser Ser Ser Asn Gly Ser Ser He Asn Met Ser 
130 135 140 

Gly He Asp Asn Gly Glu Gly Thr Arg Leu Arg Asn Val Pro Val Leu 
145 150 155 160 



Phe Asn Asp Thr Glu Thr Asn Leu Ala Gly Met Tyr Gly Lys Val Arg 
35 165 170 175 

Lys Ala Ala Ser Ser He Asp Gin Phe Ser He Arg Leu Gly He Phe 
180 185 190 

40 Leu Arg Arg Tyr Pro He Ala Arg Val Phe Val He He Tyr Met Ala 

195 200 205 



Leu Leu His Leu Tip Val Met He Val Leu Leu Thr Tyr Thr Pro Glu 
210 215 220 

Met His His Asp Gin Pro Tyr Gly Lys 
225 230 



(2) INFORMATION FOR SEQ ID NO: 643: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 
55 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCR1PTIC»I : SEQ ID NO: 643: 

He Arg His Glu Gin His Pro Asn Phe Ser Leu Glu Met His Ser Lys 
60 1 5 10 15 
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Gly Ser Ser Leu Leu Leu Phe Leu Pro Gin Leu He Leu He Leu Pro 
20 25 30 

5 Val Cys Ala His Leu His Glu Glu Leu Asn Cys 
35 40 



10 (2) INFORMATION FOR SBQ ID NO: 644: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 644: 



20 



30 



Ser Phe Phe He Ser Glu Glu Lys Gly His Leu Leu Leu Gin Ala Glu 
15 10 15 

Arg His Pro Trp Val Ala Gly Ala Leu Val Gly Val Ser Gly Gly Leu 
20 25 30 



Thr Leu Thr Thr Cys Ser Gly Pro Thr Glu Lys Pro Ala Thr Lys Asn 
25 35 40 45 



Tyr Phe Leu Lys Arg Leu Leu Gin Glu Met His He Arg Ala Asn 
50 55 60 



35 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRulel3to; 



A The indications made below relate to the microorganism referred to in the description 
116 ^ 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet Q) 



Name of depositary institution 



American Type Culture Collection 



Address of depositary institution {including postal code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



I Accession Number 97897 



I Date of deposit February 26, 1 997 

^^i^rrinUXlAr^DlCATlOm Qeave blank if not appii^^^^^ This information ^continued on an additional sheet □ 



I n n^iSIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if ike indications are noiforalidesignaiedSiaies) 



E. SEPARATE FURNISHING OF INDICATIONS Oeave blank if not applicable) 



I The indications listed 
\ Number cf Deposit") 



below will be submitted to the Internation al Bureau later {specify the general namre ojtne ma^cations, e.g.. "Accession 



p^This sheet was received with the international application 


|— 1 This sheet was received by the Iniemational Bureau on: 


Authorized officer 

Susan White 

PCT International Division 


Authorized officer 



Form PCT/RO/134 (July 1992) 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule \3bis) 



A. The indications made below relate to the microorganism referred to in the description 
on page 116 > WA 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet Q 



Name of depositary institution 



American Type Culture Collection 



Address of depositary institution {including postal code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit May 1 5, 1 997 



Accession Number 209043 



C. ADDITIONAL INDICATIONS (leave blank {fnoi applicable) This information is continued on an additional sheet □ 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later {specif the general nature of the indications, eg, "Accession 
Number of Deposit*^ 



For receiving Office use only . 



This sheet was received with the international application 



Authorized officer 



Susan White 

per IntemefHonal DMston 



i For International Bureau use only 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 



FonnPCT/RO/134 (July 1992) 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule \3bis) 



A. The indications made below relate to the microorganism referred to in the description 
on page 119 » ""c N/A 



B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet Q 



Name of depositary institution 

American Type Culture Collection 



Address of depositary institution (including postal code and country) 

12301 Parklawn Drive 
Rockville. Maryland 20852 
United States of America 



Date of deposit September 4, 1 997 



Accession Number 20923 5 ^ 



C. ADDITIONAL INDICATIONS (leave blank if not explicable) This information is continued on an additional sheet □ 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE Of the indicaiions are not for ail designated States) 



E, SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later {specif the general nature of the indicaiions, e.g., ''Accession 
Number of Deposit*^ 



For receiving Office use only ■ 



This sheet was received with the international application 



Authorized officer 



8ii8en White 

PCT Internationa? Division 



■ For International Bureau use only < 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 



Forni PCr/RO/I34 (July 1992) 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRu!el3/>w; 



B. IDENTIFICATION OF DEPOSIT 



A. The indications made below relate to the microorganism referred to in the description 
on page 122 , line N/A 



Further deposits are identified on an additional sheet 



Name of depositary institution 



American Type Culture Collection 



Address of depositary institution {including postal code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit February 26. 1 997 



Accession Number 97898 



C. ADDITIONAL INDICATIONS (ieave blank i/not applicable) This information is continued on an additional sheet Q 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE fl/ihe indications are not for ait designated States) 



E. SEPARATE FURNISHING OF INDICATIONS Oeave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later {specify the general nature of the indications, e.g., "Accession 
Number of Deposit'^ 



For receiving Office use only , 



Jhis sheet was received with the international application 



Authorized officer 



Sufien White 

PCT International Division 



■ For International Bureau use only 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 



Fomi PCT/RO/134 (July 1992) 



wo 98/39448 



PCTAJS98/a4493 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule \3bis) 




JVT™s sheet was fcceived with the iniemational application 


j— j This sheet was received by the International Bureau on: 


Authorized officer 

Siisen White 

PCT International Dhrtston 


Authorized officer 



Form PCT/RO/134 (July 1992) 



wo 98/39448 



PCTAJS98/04493 



725 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRulel3/)w; 



B. IDENTIFICATION OF DEPOSIT 



A. The indications made below relate to the microorganism referred to in the description 
on page 126 N/A 



Further deposits arc identified on an additional sheet Q 



Name of depositary institution 



American Type Culture Collection 



Address of depositary institution {including postal code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit February 26, 1 997 



Accession Number 97899 




D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (Jeave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later (specif the general nature of the indications, e.g., "Accession 
Number of Deposit'^ 



For receiving Office use only . 



This sheet was received with Uie international application 



Authorized ofiicer 



Susan White 

PCT International Division 



I For International Bureau use only 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 



Foim PCT/RO/134 (July 1992) 



wo 98/39448 



PCT/US98/04493 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Rule Ubis) 



A. The indications made below relate to the microorganism referred to in the description 
on page 126 > ^/A 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet 



Name of depositary institution 



American Type Cuhure Collection 



Address of depositary institution {including postal code and country) 

12301 Parklawn Mve 
Rockville, Maryland 20852 
United States of America 



Date of deposit May 15,1 997 


Accession Number 209045 


C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is conimued on an additional sheet Q 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indicadom are not for all designated States) 



|E. separate FURNISHING OF INDICATIONS (leave blan k if not applicable) 

The indications listed below will be submitted to the international Bureau later {specific the general nature of the indications, e.g., "Accession 
I Number of Deposit") 



For receiving Office use only ■ 



lis sheet was received with the international application 



Authorized officer 



Susan White 

POT International Division 



- For International Bureau use only 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 



Fonn PCT/RO/134 (July 1992) 



wo 98/39448 



PCT/US98/04493 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule \3bis) 



A. The indications made below relate to the microorganism referred to in the description 
on page 130 , line N/A 



B. IDENTIFICATION OF DEPOSIT 



Further deposits arc identified on an additional sheet Q 



Name of depositary institution 



American Type Culture Collection 



Address of depositary institution (including postal code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit April 28, 1 997 



Accession Number 2090 1 1 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet Q 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE fif the indicadons are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later {specify the general nature of the indications, e.g. '^Accession 
Number of Deposit") 



For receiving Office use only . 



This sheet was received with the international application 



Authorized officer 



Susan White 

PCT tmemattonal Division 



. For International Bureau use only - 



□ 



This sheet was received by Uic International Bureau on: 



Authorized officer 



Vom PCT/RO/134 (July 1992) 



wo 98/39448 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 



(PCT Rule \3bis) 



A. The indications made below relate to the microorganism referred to in the description 
on page 131 , line N/A 


B. IDENTIFICATION OF DEPOSIT 


Further deposits arc identified on an additional sheet 


Name of depositary institution 

American Type Culture Collection 


Address of depositaiy institution (including postal code and country) 


12301 Parklawn Drive 
Rockviile, Maryland 20852 
United States of America 




Date of deposit Febniary 26. 1 997 


Accession Number 97900 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This Information is continued on an additional sheet 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the inmca^ons are not for all designaied States) 



E. SEPARATE FURNISHING OF INDICATIONS Oeave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later {specify the general nature of the indications, e.g., "Accession 
Number of Deposit*'^ 



For receiving Office use only , 



This sheet was received with the international application 



Authorized officer 



W3T Intemattonal OWUton 



. For International Bureau use only 



□ 



This sheet was received by the International Bureau on; 



Authorized officer 



Fonn PCT/RO/134 (July 1992) 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRulel36iy; 



A. The indications made below relate to the microorganism referred to in the description 
on page 137 ^^^^ , line N/A 



B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet 



Name of depositary institution 

American Type Culture Collection 



Address of depositary institution {including postal code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit February 26, 1 997 



Accession Number 9790 1 



C. ADDITIONAL INDICATIONS {leave blank if not applicable) This information is continued on an additional sheet Q 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later {specify the general nature of the indications, e.g., "Accession 
Number of Deposit*^ 



For receiving Office use only . 



This sheet was received with the international application 



Authorized officer 



Susan Whito 

PCT Intemattenal DMston 



• For International Bureau use only i 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 



Fonn PCT/RO/134 (July 1992) 



wo 98/39448 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule \3bis) 



A. The indications made below relate to the microorganism referred to in the description 
on page 131 ^ > line N/A 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet Q] 



Name of depositary institution 



American Type Culture Collection 



Address of depositary institution (including postal code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit May 1 5, 1 997 



Accession Number 209046 



C. ADDITIONAL INDICATIONS Oeave blank if not applicable) This information is continued on an additional sheet Q 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later {specif the general nature of the indications, e.g. ''Accession 
Number of Deposit**) 



For receiving Office use only . 



This sheet was received with the international application 



Authorized officer 



Susan White 

PCX tntemaflona) DMsion 



• For International Bureau use only 



□ 



This sheet was received by the International Bureau on: 



Authortzed ofTicer 



Form PCT/RO/134 (July 19W) 



wo 98/39448 



PCT/US98/04493 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCX Rule \3bis) 



A. The indications made below reJaie lo the microorganism referred to in the description 

137 , line N/A . 



on page 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet Q 



Name of depositary institution 



American Type Culture Collection 



Address of depositary institution (including postal code and country) 

12301 Parklawn Drive 
Rockville. Maryland 20852 
United States of America 



Date of deposit May 1 5, 1 997 



Accession Number 209047 



C ADDITIONAL INDICATIONS {leave blank if not applicable) This information is continued on an additional sheet \J 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be su bmitted to the International Bureau later {specijy ike general nature of the indications, e.g.. "Accession 
Number of Deposit") 



For receiving Office use only . 



is sheet was received with the iniemational application 



Authorized oificer SuSdH White 

PCI Intemeitonal DMsion 



. For International Bureau use only 



□ 



This shcci was received by the International Bureau on: 



Authorized officer 



FonnPCT/RO/134 (July 1992) 



wo 98/39448 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCX Rule Ubis) 



on page 137 



I b. identification of deposit 

i Name of depositary institution ^^^^^ xype Culture CoUection 



Further deposits are identified on an additional sheet Q 



of deposiuuy institution {including postal code ondcounlry) 



12301 Parklawn Drive 
RockviHe, Maryland 20852 
United States of Atnenca 



Date of deposit May 22, 1997 



1 Accession Number 209076 



' additional INDICATIONS ^^tto«t.ym,>^;i^>fe; This 



information is continued on an additional sheet Q 



b. DESIGNATED STATES FOR WHICH 



TNnirATlONS ARE MADE (V,H.t.Hlicado^areno,foraU<lesign.ueasmes) 



. o A^.r lamNlSHlNG OF INDICATIONS (lea^ blank if no. applicat,e)_ 



IE. SEPARATE yUKnianii -o i^ications. e .g.. "Accession 

L indications listed below Will be subn.med to U.e.nMua .ional Bureau later (.pec,... .^^^^ 

\ Number of Deposit") 



^ For receiving Office use only , 



This sheet was received with the international application 



Authorized ofTiccr SUUn WhitS 

per intemflttonal DM»ton 



. For International Bureau use only 



□ 



This sheet was received by Oie Imemalional Bureau on: 



Authorized officer 



FonnPCT/RO/134 (July 1992) 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Rule \3bis) 




For receiving Office use only 



5 Sheet was received with the international application 



Authorized ofTicer 



PCT Internattonal Dlvtelcw 



For International Bureau use only 



□ 



This sheet was received by the Intcmaiional Bureau on: 



Aulhori2ed officer 



Form PCT/RO/134 (July 1992) 



wo 98/39448 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRuie \3bi5) 



A. The indications made below relate to the microorganism refe^ed to in the description 
1 £A » line pi/A 
on page IW == 

Ftiithcr deposits are identified on an additional sheet Q 



B. IDENTIFICATION OF DEPOSIT 



Name of depositary institution ^^^^.^^^ ^^^^^ Collection 



Address 



of depositary institution {including postal code and country) 



12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit February 26, 1997 



C APPlTtONAL INDICATIONS (leave blank 



Accession Number 97904 
7r... .nnnrakle^ This information is conuiiued on an additional sheet Q 



b. DESIGNATED STATES FOR WHICH 



INDICATIONS ARE MADE Ofiheindicaiions are not for all designated States) 



\ Number of Deposit") 



For receiving Office use only . 



-0 



This sheet was received with the international application 



Authorized oflicer SUSBn White 

PCT Intemaltona? DMstoP 



. For International Bureau use only 



□ 



This sheet was received by the Intcmaiional Bureau on: 



Authorized officer 



FonnPCT/RO/l34(July 1992> 



PCT/US98/04493 



WO 98/39448 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule \3bis) 



A- The indications 
on page 154 



made below relate to the microorganism referred to in the description 

, line N/A , 



[b. identification of deposit 



Further deposits are identified on an additional sheet Q 



|Nameofdepositao.inst.tuUon . ^^y^. Culture Collection 

Address of depositary institution {including postal code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit July 3. 1997 



"C. iipDITIONAL INDICATIONS {leave blank 



Accession Number 209 1 39 
.r.^W/cft/ri This infonnation is continued on an additional sheet Q 



Id. designated states for which 



INDICATIONS ARE MADE (i/iheMicaiiom are net/or all designated S,ates) 



E SEPARATE FURNISHING OF INDICA HONS rte«« blank if no, applicable) 



Number of Deposit") 



l^^jThis sheet was received with the intcmaiional application 


1 1 This shcei was received by the International Bureau on: 


Authorized officer guggn Whtt0 

PCT Intemfirttonal DMslon 


Authorized officer 



Form PCT/RO/134 (July 1992) 



wo 98/39448 



PCT/US98/04493 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Rulel36w; 



A The indications made below relate to the microorganism referred to in the description 
* on page 153 - ""^ 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet Q 



I Name of depositary institution 



American Type Culture Collection 



Address of depositary institution {including postal code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit May 1 5 , 1 997 



Accession Number 209049 



e ADDITIONAL INDICATIONS (ieave blank if not applicable) This information is continued on an additional sheet □ 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indicaiions are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (ieave blank if not appli cable) ^ 

The indications listed below will be subm itted to the International Bureau later {specify the general nature of the indications, e.g.. "Accession j 
Number of Deposit") 



j^^This sheet was received with the international application 


j j This sheet was received by the Imemaiional Bureau on: 


Authorized officer 

Susan White 

PCT Intematfonrt DMsion 


Authorized officer 



Form PCT/RO/134 (July 1992) 



wo 98/39448 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule \3bis) 



A. The indications made below relaic to the microorganism rcfeaed to in the description 
' on page 153 > ^ne N/A 



B. IDENTIFICATION OF DEPOSIT Further deposits arc identified on an additional sheet Q 



Name of depositary institution 

American Type Culture Collection 



Address of depositary institution {including postal code and country) 

12301 Parklawn Drive 
RocHville. Maryland 20852 
United States of America 



Date of deposit February 26, 1 997 



Accession Number 97903 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet Q 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later {specify the general nature of the indications, e.g., "Accession 
Number of Deposit*^ 



For receiving Office use only , 



0: 



This sheet was received with the imcmational application 



Authorized officer 



Susan White 

PCT Intamatlonal DMfiion 



» For International Bureau use only 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 



FonnPCT/RO/l34 (July 1992) 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCX Rule \3bis) 



A. The indications made below relate to the microorganism referred to in the description 
on page 142 ^ line N/A 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet Q 



Name of depositary institution 



American Type Culture Collection 



Date of deposit June 1 2, 1 997 


Accession Number 209 1 1 9 


C. ADDITIONAL INDICATIONS (teave blank if not applicable) This information is conunucd on an additional sheet Q 



Address of depositary institution {including postal code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for aU designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later {specify the general nature of the indications, e.g.. "Accession 
Number of Deposit") 



For receiving Office use only , 



his sheet was received with the international application 



Authorized ofTicer 



Susan White 

PCT International Division 



. For International Bureau use only 



□ 



This sheet was received by the Iniemaiional Bureau on: 



Authorized officer 



Forni PCT/RO/134 (July 1992) 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCX Rule \3bis) 



A. The indications made below relate to the microorganism referred to in the description 
on page 146 .line N/A 



B. IDENTIFICATION OF DEPOSIT Funher deposits are identified on an additional sheet Q 



Name of depositary institution 

American Type Culture Collection 



Address of depositary institution {including postal code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit February 26, 1997 



Accession Number 97902 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet Q 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE Of the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later {specify the general nature of the indications, e.g., ''Accession 
Number of Deposit*^ 



For receiving Office use only . 



This sheet was received with the tntemational application 



Authorized oflicer SUSfiH WhW 



PCT International DMslon 



• For International Bureau use only 



□ 



This sheet was received by the International Bureau on: 



Authorized ofTiccr 



Form PCT/RO/134 (July 1992) 



wo 98/39448 



740 



PCT/US98/04493 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCX Rule \3bisJ 



A. The indications made below relate to the microorganism referred to in the description 
on page 146 * hne N/A 



B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet r] 



Name of depositary institution 

American Type Culture Collection 



Address of depositary institution {including postal code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit May 15, 1997 



Accession Number 2Q9048 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet [J 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (i/the indications are not/or all designated States) 



E, SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later [specify the general nature of the indications, e.g.. "Accession 
Number of Deposit") 



For receiving Office use only , 



lis sheet was received with the international application 



Authorized ofTicer 



Susan WMM 

PCT International DMslon 



• For International Bureau use only 



□ 



This sheet was received by the International Bureau on: 



Authorized officer 



Form PCT/RO/134 (July I992> 



wo 98/3944S 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCX Rule 13te; 



A. The indications made below relate to the microorganism referred to in the description 
on page 160 Jine N/A 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet H] 



Name of depositary institution 



American Type Culture Collection 



Address of depositary institution {including postal code and country) 

12301 Parklawn Drive 
Rockville» Maryland 20852 
United Slates of America 



Date of deposit May 1 5, 1997 


Accession Number 209050 


C. ADDITIONAL INDICATIONS (leay^e blank if not applicable) This information is continued on an additional sheet Q 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later (specify the general nature of the indications, e.g., ''Accession 
Number of Deposit") 



For receiving Office use only , 



EC' 



This sheet was received with the international application 



Authorized officer 


Susan White 




PCT intemaMonal OMshm 



. For International Bureau use only 



□ 



This sheet was received by the Iniemational Bureau on: 



Authorized officer 



Form PCr/RO/134 (July 1992) 



wo 98/39448 PCT/DS98/04493 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCX Rule 136w; 



A. The indications made below relate to the microorganism referred to in the description 

on page 142 , line N/A 



B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet 



Name of depositary institution American Type Culture Collection 



Address of depositary institution (including postal code and country) 

12301 Parklawn Drive 
Rockville, Maryland 20852 
United States of America 



Date of deposit February 12» 1998 



Accession Number 209627 



C, ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet Q 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later {specify the general nature of the indications, e.g. "Accession 
Number of Deposit*^ 



For receiving Office use only . 



EL' 



This sheet was received with the international application 



Authorized officer SUSan WNtS 

PCT Intermrttanftl DMsion 



□ 



— For International Bureau use only 
This sheet was received by the International Bureau on: 



Authorized ofltcer 



Form PCT/RO/I34 (July 1992) 



wo 98/39448 



743 



PCTAJS98/04493 



What Is Claimed Is: 

1 . An isolated nucleic acid molecule comprising a polynucleotide having a 
nucleotide sequence at least 95% identical to a sequence selected from the group 
5 consisting of: 

(a) a polynucleotide fragment of SEQ ID NO:X or a polynucleotide fragment of 
the cDNA sequence included in ATCC Deposit No:Z, which is hybridizable to SEQ ID 
NO:X; 

(b) a polynucleotide encoding a polypeptide fragment of SEQ ID NOiY or a 

10 polypeptide fragment encoded by the cDNA sequence included in ATCC Deposit No:Z, 
which is hybridizable to SEQ ID NO:X; 

(c) a polynucleotide encoding a polypeptide domain of SEQ ID NO: Y or a 
polypeptide domain encoded by the cDNA sequence included in ATCC Deposit No:Z, 
which is hybridizable to SEQ ID NO:X; 

15 (d) a polynucleotide encoding a polypeptide epitope of SEQ ID NO: Y or a 

pplypeptide epitope encoded by the cDN A sequence included in ATCC Deposit No:Z, 
W(uch is hybridizable to SEQ ID NO:X; 

(e) a polynucleotide encoding a polypeptide of SEQ ID NO: Y or the cDNA 
sequence included in ATCC Deposit No:Z, which is hybridizable to SEQ ID NO:X, 

20 having biological activity; 

(f) a polynucleotide which is a variant of SEQ ID NO:X; 

(g) a polynucleotide which is an allelic variant of SEQ ID NO:X; 

(h) a polynucleotide which encodes a species homologue of the SEQ ID NO: Y; 

(i) a polynucleotide enable of hybridizing under sUingent conditions to any 
25 one of the polynucleotides specified in (a)-(h), wherein said polynucleotide does not 

hybridize under stringent conditions to a nucleic acid molecule having a nucleotide 
sequence of only A residues or of only T residues. 

2. The isolated nucleic acid molecule of claim 1 , wherein the 

30 polynucleotide fragment comprises a nucleotide sequence encoding a secreted protein. 

3 . The isolated nucleic acid molecule of claim 1 , wherein the 
polynucleotide fragment comprises a nucleotide sequence encoding the sequence 
identified as SEQ ID NO: Y or the polypeptide encoded by the cDNA sequence included 

35 in ATCC Deposit No:Z, which is hybridizable to SEQ ID NO:X. 
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4 . The isolated nucleic acid molecule of claim 1 . wherein the 
polynucleotide fragment comprises the entire nucleotide sequence of SEQ ID NO:X or 
the cDNA sequence included in ATCC Deposit No:Z, which is hybridizable to SEQ ID 
NO:X. 

5 

5 . The isolated nucleic acid molecule of claim 2. wherein the nucleotide 
sequence comprises sequential nucleotide deletions from either the C-terminus or the N- 
terminus. 

10 6. The isolated nucleic acid molecule of claim 3, wherein the nucleotide 

sequence comprises sequential nucleotide deletions from dtiier tiie C-terminus or tiie N- 
terminus. 

7. A recombinant vector comprising the isolated nucleic acid molecule of 
IS claim 1. 

8. A metiiod of making a recombinant host cell comprising the isolated 
nucleic acid molecule of claim 1. 

20 9. A recombinant host cell produced by tiie method of claim 8. 

10. The recombinant host cell of claim 9 comprising vector sequences. 

11. An isolated polypeptide comprising an amino acid sequence at least 95% 
25 identical to a sequence selected from tiie group consisting of: 

(a) a polypeptide fragment of SEQ ID NO: Y or the encoded sequence included 

in ATCC Deposit No:Z; 

(b) a polypeptide fragment of SEQ ID NO: Y or the encoded sequence included 

in ATCC Deposit Noi, having biological activity; 
30 (c) a polypeptide domain of SEQ ID NO: Y or the encoded sequence included in 

ATCC Deposit No:Z; 

(d) a polypeptide epitope of SEQ ID NO: Y or die encoded sequence included in 

ATCC Deposit No:Z; 

(e) a secreted form of SEQ ID NO:Y or the encoded sequence included in 

35 ATCC Deposit No:Z; 

(f) a full length protein of SEQ ID NO:Y or die encoded sequence included in 

ATCC Deposit No:Z; 
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(g) a variant of SEQ ID NO: Y; 

(h) an allelic variant of SEQ ID NO: Y; or 

(i) a species homologue of the SEQ ID NO: Y. 

12. The isolated polypeptide of claim 1 1 , wherein the secreted form or the 
full length protein comprises sequential amino acid deletions from either the C-terminus 
or the N-terminus. 

13. An isolated antibody that binds specifically to the isolated polypeptide of 
claim 11. 

14. A recombinant host cell that expresses the isolated polypeptide of claim 

11. 

15. A method of making an isolated polypeptide comprising: 

(a) culturing the recombinant host cell of claim 14 under conditions such that 
said polypeptide is expressed; and 

(b) recovering said polypeptide. 

16. The polypeptide produced by claim 15. 

17. A method for preventing, treating, or ameliorating a medical condition, 
comprising administering to a manmialian subject a therapeutically effective amount of 
the polypeptide of claim 1 1 or the polynucleotide of claim 1 . 

18. A method of diagnosing a pathological condition or a susceptibility to a 
pathological condition in a subject comprising: 

(a) determining the presence or absence of a mutation in the polynucleotide of 

claim 1; and 

(b) diagnosing a pathological condition or a susceptibility to a pathological 
condition based on the presence or absence of said mutation. 

19. A method of diagnosing a pathological condition or a susceptibility to a 
pathological condition in a subject comprising: 

(a) determining the presence or amount of expression of the polypeptide of 
claim 1 1 in a biological sample; and 

(b) diagnosing a pathological condition or a susceptibility to a pathological 
condition based on the presence or amount of expression of the polypeptide. 
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20. A method for identifying a binding partner to the polypeptide of claim 
comprising: 

(a) contacting the polypeptide of claim 1 1 with a binding partner; and 

(b) determining whether the binding partoer effects an activity of the 
polypeptide. 

2 1 . The gene corresponding to the cDN A sequence of SEQ ID NO: Y. 

22. A method of identifying an activity in a biological assay, wherein the 
method comprises: 

(a) expressing SEQ ID NO:X in a cell; 

(b) isolating the supernatant; 

(c) detecting an activity in a biological assay; and 

(d) identifying the protein in the supernatant having the activity. 



23. The product produced by the method of claim 22. ' 



CORRECTED 
VERSION* 



WORLD INlHJJECnJAL PROPERTY ORGANIZATION 
Intenuuxonal Boxeau 



PCX 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification ^ : 

C12N 15/12, 5/10, 1/21, COTK 14/47, 
16/18, C12Q 1/68, COIN 33/50, 33/53, 
33/68, A61K 38/17 



A3 



(11) International Publication Number: WO 98/39448 

(43) International Publication Date: 1 1 September 1998 (1 1.09.98) 



(21) International Application Number: PCr/US98/04493 

(22) International Filing Date: 6 March 1998 (06.03.98) 



(30) Priority Data: 






60/040.162 


7 March 1997 (07.03.97) 


US 


60/040,333 


7 March 1997 (07.03.97) 


US 


60/038.621 


7 March 1997 (07.03.97) 


US 


60/040,161 


7 March 1997 (07.03.97) 


US 


60/040,626 


7 March 1997 (07.03.97) 


US 


60/040334 


7 March 1997 (07.03.97) 


US 


60/040,336 


7 March 1997 (07.03.97) 


US 


60/040,163 


7 March 1997 (07j03.97) 


US 


60/043,580 


11 April 1997 (11.04.97) 


US 


60/043,568 


n April 1997(11.04.97) 


US 



(Continued on the following page) 

(71) Applicant (for all designated States except US): HUMAN 

GENOME SOENCrES, INC. [US^JS]; 9410 Key West 
Avenue. Rockville. MD 20850 (US). 

(72) Inventors; and 

(75) Inventors/AppUcants (for US only): RUBEN, Steven, M. 
[US/US]; 18528 Heritage Hills Drive, OIney. MD 20832 
(US). ROSEN. Craig, A. (US/USI; 22400 RoUing Hills 
Road. Laytonsville. MD 20882 (US). FISC^HER. Carrie. L. 
[USAJS]; 5810 HaU Street, Burke, VA 22015 (US). SOP- 
PET, Daniel, R. [USAJS]; 15050 Stillfield, Place, Cenlrc- 
ville. VA 22020 (US). CARTER, Kenneth, C. [USAJS]; 
11601 Brandy Hall Lane, North Potomac. MD 20878 
(US). BEDNARIK, Daniel. P. [USAJSJ; 8822 Blue Sea 
Drive, Columbia. MD 21046 (US). ENDRESS, Gregory. A. 
[USAJS]; 9729 Clagett Farm Drive, Potomac. MD 20854 
(US). YU. Guo-Liang [CNAJS]; 13524 Straw Bale Lane, 
Damestown, MD 20878 (US). NT, Jian [CNAJS); 5502 
Manorfield Road. Rockvillc, MD 20853 (US). I^G, Ping 
[CNAJS]; 4 Relda Court. Gaithersburg, MD 20878 (US). 
YOUNG. Paul. E. [USAJS); 122 Beckwith Street. GaiAers- 
burg. MD 20878 (US). GREENE. John, M. (USAJS]; 872 
Diamond Drive, Gaithersburg. MD 20878 (US). FQUUE, 
Ann, M. [USAJS]; 13203 L Astoria Hill Court. Gennantown. 



MD 20874 (US). DUAN, Roxannc (USAJS]; 4541 Fairiield 
Drive. Bethesda, MD 20814 (US). HU. Jing-Shan (CNAJS); 
1247 Lakeside Drive #3034, Sunnyvale, CA 94086 (US). 
FLORENCE Kimberiy. A. [USAJS]; 12805 AUantic Av- 
enue, Rockvillc. MD 20851 (US). OLSEN, Hcnrik. S. 
[DKAJS]; 1 82 Kendrick Place #24. Gaithersburg, MD 20878 
(US). EBNER. Reinhard [DEAJS]; 9906 Shelbume Tenacc 
#316. Gaithersburg. MD 20878 (US). BREWER, Laurie. A. 
[USAJS]; 14920 Mount Nebo Road, Poolcsvillc. MD 20837 
(US). MOORE, Paul. A. [GBAJS]; Apartment #104, 1908 
Holly Ridge Drive. McLean. VA 22102 (US). SHI, Yanggu 
[CNAJS]; 437 West Side Drive, Gaithersburg, MD 20878 
(US). LAFLEUR, David. W. [USAJSJ; 1615 Q Street, N.W. 
#807. Washington. DC 20009 (US). U, Yi [CNAJS]; 1247 
Lakeside Drive #3034, Sunnyvale, CA 94086 (US). ZENG, 
Zhizhen [CNAJS]; 13950 Saddleview Drive, Gaitheisburg, 
MD 20878 (US). KYAW. HIa [BUAJS]; 520 Sugarbush Cir- 
cle. Frederick. MD 21703 (US). 

(74) Agents: BROOKES. Anders. A. et al.; Human Genome 
Sciences, Inc., 9410 Key West Avenue, Rockville. MD 
10850 (US). 

(81) Designated States: AL. AM. AT. AU, AZ. BA. BB, BG. BR, 
BY. CA. CH. CN. CU, CZ. DE. DK. EE, ES. Fl, GB. GE. 
GH, GM. GW, HU. ID. IL. IS. JP. KE, KG. KP. KR. KZ, 
LC. LK. LR. LS. LT. LU. LV, MD, MG. MK. MN, MW, 
MX. NO. NZ. PL. FT, RO, RU, SD. SE, SG, SI, SK. SL. 
TJ, TM, TR, TT, UA, UG. US. UZ. VN, YU. ZW. ARIPO 
patent (GH. GM, KE. LS. MW, SD, SZ, UG. ZW). Eurasian 
patent (AM, AZ. BY, KG. KZ. MD, RU, TJ, TM). European 
patent (AT. BE, CH. DE. DK. ES. H, FR, GB, GR. IE, IT, 
LU. MC. NL. PT. SE), OAPI patent (BF, BJ. CF, CG, CI, 
CM. GA. GN. ML. MR. NE, SN. TO, TG). 

Published 

With international search report. 

Before the expiration of the time limit for amending the claims 
and to be republished in the event of the receipt of amendments. 
With an indication in relation to deposited biological material 
furnished under Rule 75*" separately from the description. 

(88) Date of publication of the international search report: 

26 November 1998 (26.11.98) 



(54) Title: 186 HUMAN SECRETED PROTEINS 



(57) Abstract 



The present invention relates to 186 novel human secreted proteins and isolated nucleic acids containing the coding regions of the 
genes encoding such proteins. Also provided arc vectors, host cells, antibodies, and recombinant methods for producing human secreted 
proteins. The invention further relates to diagnostic and therapeutic methods useful for diagnosing and treating disorders related to these 
novel human secreted proteins. 



*(ReferTed to in PCT Gazette No. 43/1998. Section n) 



60/043.314 

60/043,569 

60/043.311 

60/043,671 

60/043.674 

60/043.669 

6(y043,312 

60/043,313 

60/043.672 

60^043,578 

60/043,576 

60/043,670 

60/047.600 

60/047,615 

60^)47.597 

60/047.502 

60/047.633 

60/047,583 

60/047,617 

60/047.618 

60/047,503 

60/047.592 

60/047,581 

60/047,584 

60/047,500 

60/047,587 

60/047,492 

60/047.598 

60/047.613 

60/047,582 



n aprill997(ll 
llaprin997(n 
llaprill997(Il 
napriJ1997(n 
11 april 1997(11 
11 april 1997(11 
U april 1997(1 
U april 1997 (II 
11 april 1997 (11 
llaprin997(ll 
11 april 1997(11 
11 april 1997 (11 
23 May 1997(23, 
23 May 1997 (23 
23 May 1997(23 
23 May 1997(23 
23 May 1997 (23, 
23 May 1997(23 
23 May 1997 (23 
23 May 1997 (23. 
23 May 1997 (23, 
23 May 1997 (23. 
23 May 1997(23 
23 May 1997(23, 
23 May 1997(23 
23 May 1997(23. 
23 May 1997 (23. 
23 May 1997(23. 
23 May 1997(23 
23 May 1997 (23 



.04.97) US 

.04.97) US 

.04.97) US 

.04.97) US 

.04.97) US 

.04.97) US 

.04.97) US 

.04.97) US 

.04.97) US 

.04.97) US 

.04.97) US 

.04.97) US 

.05.97) US 

1.05.97) US 

.05.97) US 

.05.97) US 

.05.97) US 

05.97) US 

.05.97) US 

.05.97) US 

.05.97) US 

.05.97) US 

.05.97) US 

.05.97) US 

.05.97) US 

.05.97) US 

.05.97) US 

.05.97) US 

.05.97) US 

05.97) US 



60/047,596 

60/047,612 

60/047.632 

60/047.601 

60/047.595 

60/047.599 

60/047.588 

60/047,585 

60/047,586 

60/047,590 

60/047,594 

60/047.589 

60/047,593 

60/047,614 

60/047,501 

60/048,974 

60/048.964 

60/049.610 

60/051.926 

60/052.874 

60/055,724 

60/056,886 

60/056.877 

60/056.889 

60/056,893 

60/056,630 

60/056.878 

60/056.662 

60/056.872 

60/056.882 



23 May 1997 (23.05.97) US 60/056 637 

23 May 1997 (23.05.97) US 60/056 903 

23 May 1997 (23.05.97) US 60/056*888 

23 May 1997 (23.05.97) US 60/056 879 

23 May 1997 (23.05.97) US 60/056.880 

23 May 1997 (23.05.97) US 60/056 894 

23 May 1997 (23.05.97) US 60/056911 

23 May 1997 (23.05.97) US 60/056 636 

23 May 1997 (23.05.97) US 60/056^874 

23 May 1 997 (23.05.97) US 60/056 91 0 

23 May 1997 (23.05.97) US 60/056^864 

23 May 1997 (23.05.97) US 60/056.631 

23 May 1997 (23.05.97) US 60/056!845 

23 May 1 997 (23.05.97) US 60/056 892 

23 May 1 997 (23.05.97) US 60/056!632 

06 June 1997 (06.06.97) US 60/056,664 

06 June 1 997 (06.06.97) US 60/056.876 

13 June 1997 (13.06.97) US 60/056.881 

08 July 1997 (08.07.97) US 60/056 909 

16 July 1997 (16.07.97) US 60/056.875 

18 August 1997 (18.08.97) US 60/056^862 

22 August 1997 (22.08.97) US 60/056,887 

22 August 1997 (22.08.97) US 60/056.908 

22 August 1997 (22.08.97) US 60/056,884 

22 August 1997 (22.08.97) US 60/057761 

22 August 1 997 (22.08.97) US 60/057,650 

22 August 1997 (22.08.97) US 60/057^669 

22 August 1997 (22.08.97) US 60/058,785 

22 August 1997 (22.08.97) US 60/061 060 
22 August 1997(22.08.97) US 



22 August 1997 (22.08.97) 
22 August 1997 (22.08.97) 
22 August 1997 (22.08.97) 
22 August 1997 (22.08.97) 
22 August 1997 (22.08.97) 
22 August 1997 (22.08.97) 
22 August 1 997 (22.08.97) US 
22 August 1 997 (22.08.97) US 
22 August 1997 (22.08.97) 
22 August 1997 (22.08.97) 
22 August 1 997 (22.08.97) US 
22 August 1 997 (22.08.97) US 
22 August 1997(22.08.97) 
22 August 1997 (22.08.97) 
22 August 1997 (22.08.97) 
22 August 1997(22.08.97) 
22 August 1997 (22.08.97) 
22 August 1997(22.08.97) 
22 August 1997 (22.08.97) 
22 August 1997 (22.08.97) 
22 August 1997(22.08.97) 
22 August 1997(22.08.97) 
22 August 1997(22.08.97) 
22 August 1997(22.08.97) 
05 September 1997 (05 09.97).US 
05 September 1997 (05 09.97). US 
05 September 1997 (05 09.97).US 
12 September 1997 (12.09.97) US 
02 October 1997 (02.10.97) US 



US 
US 

us 
us 
us 
us 



us 
us 



us 
us 
us 

us 
us 
us 
us 
us 
us 
us 
us 
us 



AL 

AM 

AT 

AU 

AZ 

BA 

BB 

BE 

BF 

BG 

BJ 

BR 

BY 

CA 

CF 

CC 

CH 

CI 

CM 

CN 

CU 

CZ 

DE 

DK 

EE 



FOR THE PURPOSES OF INFORMATION ONLY 
Codes used to identify States party to the PCT on the from pages of pamphlets publishing international applications under the PCT, 



Albania es 

Annenia pi 

Austria fji 

Ausiralia q\ 

Azeitaijan gb 

Bosnia and Herzegovina GE 

Barbados gH 

Belghun gn 

Burkina Paso GR 

Bulgaria HU 

Benin i£ 

Brazil iL 

Belarus is 

Canada ix 

Central African Republic JP 

Congo KE 

Switzerland KG 

COtc d'Tvoire KP 
Cameroon 

China KR 

Cuba KZ 

Czech Republic LC 

Gennany li 

Denmark tK 

Estonia lr 



Spain 
Finland 
France 
Gabon 

United Kingdom 

Georgia 

Ghana 

Guinea 

Greece 

Hungary 

Ireland 

Israel 

Iceland 

Italy 

Jm)an 

Kenya 

Kyrgyzsian 

Democratic People's 

Republic of Korea 

Republic of Korea 

Kazakstan 

Saint Lucia 

Liechtenstein 

Sri Lanka 

Liberia 



LS 
LT 
LU 
LV 
MC 
MD 
MG 
MK 

ML 

MN 

MR 

MW 

MX 

NE 

NL 

NO 

NZ 

PL 

PT 

RO 

RU 

SD 

SE 

SG 



Lesotho 

Lithuania 

Luxembourg 

Latvia 

Monaco 

Republic of Moldova 

Madagascar 

Hie former Yugoslav 

Republic of Macedonia 

Mali 

Mongolia 

Mauritania 

Malawi 

Mexico 

Niger 

Netherlands 

Norway 

New Zealand 

Poland 

Portugal 

Romania 

Russian Federation 

Sudan 

Sweden 

Singapore 



SI 


Slovenia 


SK 


Slovakia 


SN 


Senegal 


SZ 


Swaziland 


TD 


Chad 


TG 


Togo 


TJ 


Tajildstan 


TM 


Turkmenistan 


TR 


Turiccy 


TT 


Trinidad and Tobago 


UA 


Ukraine 


UG 


Uganda 


US 


United States of America 


uz 


Uzbekistan 


VN 


Viet Nam 


YU 


Yugoslavia 


zw 


Zimbabwe 



INTERNATIONAL SEARCH REPORT 



IntMnaU-nal AppHcatlon No 

PCT,-3 98/04493 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC 6 C12N15/12 C12N5/10 C12N1/21 C07K14/47 
C12Q1/68 G01N33/50 G01N33/53 G01N33/68 

Aooording to International Patent Cia—ificatiort (IPC) or to both nationaJ elas»ifioation and IPO 



C07K16/18 
A61K38/17 



B. RELOS SEARCHED 



Minmwm dooumentation searched (classification system folbwed by classilioation aytrfaola) 

IPC 6 C12N C07K C12Q GOIN A61K 



Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 



Eleetronio data base oonsutted during the intemational search (name of database and, where practioal, search temn used) 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 



Citation of dooument, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



L. HILLIER ET AL.: "The WashU-Merck EST 
Project" 

EMBL SEQUENCE DATABASE, 
25 January 1996, HEIDELBERG, FR6, 
XP002068330 

yy50d03.sl Homo sapiens cDNA clone 276955 
3'; Accession no.- N39235; 

L. HILLIER ET AL. : "The WashU-Merck EST 
Project" 

EMBL SEQUENCE DATABASE, 
3 July 1995, HEIDELBERG, FRG, XP002068331 
yni24h05.sl Homo sapiens cDNA clone 49109 
3*; Accession no. H16400; 



-I" 



1-3, 
7-11,21 



1-3, 
7-11,21 



X I Further documents are listed in the continuation of box C. 



Patent family members are listed in annex. 



* Special categoriea of oited dooumente : 

'A* document defining the generai state of the art which is not 
considered to be of particular retevanca 

*E' eariterdoounwnt but published on or after the intemational 
filing date 

'L' document which may throw doubts on priority claim(s) or 
which is cited to eatablish the put)lioation date of another 
citation or other special reason (as speoified) 

*0* document referring to an oral disolosure* use, exhiiition or 
other means 

*P' document published prior to the international filing date but 
later than the priority date claimed 



*T* later document published after the intemational filing date 
or priority date and not in oonfliot with the application but 
cited to understand the principle or theory underiying the 
Bivention 

"X' docuntent of particular reievanoe; the otaimed invention 
cannot be considered novel or cannot be oonaidered to 
involve an inventive step when the document is token alone 

"Y* document of particular relevance; the claimed invention 

cannot be oonaidered to involve an inventive step when the 
document is combined with one or more other such docu- 
ments, such combination being obvious to a person skilled 
in the art. 

•4" document member of the same patent family 



Date of the actual oompletion of thelnternationai search 



17 June 1998 



Data of maiiing of the intemationaJ search report 



2 ft. 09. W 



Name and mailing address of the ISA 

European Patent Office, P.B. 5818 Patentlaan 2 
NL-2280HV Rijawijk 
Tel. (f31-70) 340-2040, Tx. 31 651 epo nl. 
Fax: (-l-a 1-70) 340-3016 



Authorized officer 



H0RNIG H. 



Ponn PCT/tSA/2tO (second •haei) (July t992) 



page 1 of 2 



INTERNATIONAL SEARCH REPORT 



Intematinnal Application No 

PCT; 98/04493 



C.(Comintiattcm) DOCUMENTS CONSIDERED TO BE RELEVANT 



Category * Citation of dootitnent, with mdioalion, where appropriate, of tha relevant 



Relevant to claim No. 



EP 0 679 716 A (MATSUBARA KENICHI ;OKUBO 
KOUSAKU (JP)) 2 November 1995 
SEQ ID no. 3616 

see page 1144, line 10 - line 24; claims 
1-6 

WO 97 07198 A (GENETICS INSTITUT) 27 

February 1997 

see the whole document 

WO 97 04097 A (GENETICS INST) 5 February 
1997 

see the whole document 

US 5 536 637 A (JACOBS KENNETH) 16 July 
1996 

see the whole document 

JACOBS K ET AL: "A novel method for 
isolating eukaryotic cONA clones encoding 
secreted proteins." 

KEYSTONE SYMPOSIUM ON DENDRITIC CELLS; 
ANTIGEN PRESENTING CELLS OF T AND B 
LYMPHOCYTES, TAOS, NEW MEXICO, USA, MARCH 
10-16, 1995. JOURNAL OF CELLULAR 
BIOCHEMISTRY SUPPLEMENT 0 (21A). 1995. 19. 
ISSN: 0733-1959, XP002027246 
abstract no. C1-2G7 
see abstract 

WO 90 14432 A (GENETICS INST) 29 November 
1990 

see the whole document 

WO 96 17925 A (IMMUNEX CORP) 13 June 1996 
see the whole document 



1-3. 
7-10,21 



1-23 



1-23 



1-23 



1-23 



1-23 



1-23 



Form PCT/iSA/210 (oontinuatnn of second ehest) (July 1992) 



page 2 of 



2 



INTERNATIONAL SEARCH REPORT 



Ir' mational application No. 

PCT/US 98/04493 



Box I Observations where certain claims were found unsearchable (Continuation o1 item 1 of first sheet) 



This International Search Report has not been established in respect of certain claims under Article 17(2)(a) for the following reasons: 



1. X Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 

Remark: Although claim 17 is directed to a method of treatment of the 
human/animal body, the search has been carried out and based 
on the alleged effects of the compound/composition. 

2. Claims Nos.: ^, ... • * » u 
— because they relate to parts of the International Application that do not comply with the prescribed requirements to such 

an extent that no meaningful Intemational Search can be carried out, specifically: 



3. I Claims Nos.: ^ * »b t cj/ \ 
because they are dependent daims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



Box l| Observations where unity of invention is lacldng (Continuation of item 2 of first sheet) 



This International Searching Authority found muftifrfe inventions in this intemational application, as follows: 

see further information, sheet 



1 . I I As ail required additional search fees were timely paid by the applicant, this Intemational Search Report covers aJI 
I — ■ searchable claims. 



2. Q As all searohaWe claims could be searched without effort justifying an additional fee, this Authority did not invite payment 
of any additional fee. ' 



3. I I As only some of the required adcfilional search tees were timely paid by the applicant, this Intemational Search Report 
I — > covers only those claims tor which fees were paid, spadfieally claims Nos.: 



4. X No required additional search fees were timely paid by the applicant. Consequently, this Intemational Search Report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



1-23 partially 

Remark on Protest Q additional search fees were accompanied by the applicant's protest. 

j I No protest accompanied the payment of additional search fees. 



Form PCT/lSA/21 0 (continuation of first sheet (1 )) (July 1992) 



IrtemationalApplcationNo. PCT/ US 98/04493 



FURTHER INFORMATION CONTINUED FROM PCT/I8A/ 210 



This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

Invention 1. Claims: (1-23) partially 

-An isolated nucleic acid molecule comprising a 
polynucleotide having a nucleotide sequence at least 95% 
identical to a sequence selected from the group of SEQ ID 
nos. 11, 197; wherein said polynucleotide fragment comprises 
a nucleotide sequence encoding a secreted protein encoding 
the sequence selected from the goup of SEQ ID nos. 313, 499 
or the polypeptide encoded by the cDNA sequence included in 
ATCC Deposit no: 97897/209043, which is hybridizable to SEQ 
ID nos. 11 and/or 197; a recombinant vector comprising said 
isolated nucleic acid molecule; a method of making a 
recontoinant host cell comprising said isolated nucleic acid 
molecule; a recombinant host cell comprising said vector; an 
isolated polypeptide comprising an amino acid sequence at 
least 95% identical to a sequence selected from the group of 
SEQ ID nos. 313, 499; an isolated antibody that binds 
specifically to said isolated polyeptitdes; a reconomant 
host cell that expresses said isolated polypeptides; a 
method of making said polypeptides; a method for preventing, 
treating or ameliorating a medical condition which 
comprises administering to a mamnalian subject a 
therapeutically effective amount of said polypeptides; a 
method of diagnosing a pathological condition or i 
susceptibility to a pathological condition in a subject 
using said polynucleotides and/or polypeptide sequences; a 
method for identifying a binding partner to said 
polypeptides; a gene corresponding to the cONA sequences of 
SEQ ID nos. 11 and/or 197; a method for identifying an 
activity in a biological assay, by using the expression of 
SEQ ID no. 313 and/or 499; 

Inventions 2 to 186. Claims: (1-23) partially 

-Idem as subject 1 but limited to gene nos. 2 to 186 
respectively cDNA clone sequences HBGBW52 to HFAMH74. 
(Invention 2 is limited to SEQ ID nos. 12.198,314 and 500; 
Invention 3 is limited to SEQ ID nos. 13, 199, 315 and 50l; 

; Invention 186 is limited to SEQ ID nos. 196, 312, 

498* and 614;) 
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